Not able to extract parent and child dataset (2 datasets)
AnsweredI was trying to extract parent and child datasets from a website without success.
The target is a bulk whole sale website. First step I extracted the sales batch as the parent dataset successfully. Second step, I setup a loop-click on each parent item. The first click would bring up the detail page of the batch. On this page another click will open up a list of items of the batch. The task was only able to extract all items of the first batch, then merged it with the parent dataset.
What I tried to achieve is to create 2 separated dataset. The child dataset should contain a batch ID that links back to the parent dataset, so I can do some analysis in the database.
Is this doable? Or is there a workaround? Thanks!
-
Hi,
The cloud parent-child task works in this way.
For example, you set the parent task to scrape the product detailed link.
Then you need to import the detailed link to your child task. When the parent task finishes, it will automatically start to scrape data on the detail page. This is how the cloud parent-child task works.
In your case, unless the parent task and the child task both have at least one identical primary key, otherwise it's not doable to create such tasks.
Please sign in to leave a comment.
Comments
1 comment