Extracting from Multiple Sites while keeping data consistent
Hi,
How do I extract data from multiple sites (with different formats), while keeping the data consistent in the right columns?
Example, I would like to extract data from the following sites:
https://www.totaldirectbank.com/personal/money-market-accounts
https://www.hsbcdirect.com/savings/
While keeping the APYs collected in one column, minimum balance in another column, and url in another column?
Thanks
-
Hi rayw1126sf,
Thank you for reaching out.
Usually to scrape one website or a bunch of URLs under one domain will use one task/crawler. If we are trying to scrape from different websites with one task, the target data must share the same XPath, but it looks like APYs locate differently on these sites, so nunfortunately we are not able to scrape it,
Best regards,
Please sign in to leave a comment.
Comments
1 comment