I would like to know how octoparse can scrape a website where the lead page is not all the same, some pages have one contact and others have two or three contacts, and I would like to know how I can get all the data from the pages from the entire website.
I have attached three pages form the same website that I am looking to download information from so that you can understand the problem I am having.
this is the parent directory.
On this page you can see there are three contact people to scrape.
On this page there is two contact people to scrape.
On this page there is one contact person to scrape.
So I find if I want to scrape the address, phone number, and all of the contact people it is difficult because many of the pages are different, how do you program octoparse to do this, plus the link from the parent page to each alphabetic directory page looped so it is difficult to create a task that will loop from one page to the next.
If you can help with this I would appreciate it, I could not find a tutorial dealing with this specific problem.
Please sign in to leave a comment.