Scrape product reviews from Amazon(V8.4)
FollowYou are browsing a tutorial guide for Octoparse's latest version. If you are running an older version of Octoparse, we strongly recommend you upgrade because it is faster, easier and more robust! Download and upgrade here if you haven't already done so!
Product reviews are a good resource for improving your product performance. In this tutorial, we will show you how to scrape the product reviews from Amazon.com.
For Amazon product scraping, you could use our ready-to-use Task Template available on the home page or follow this tutorial to build the task from scratch.
Here are the main steps in this tutorial: [Download demo task file here]
- Open target webpage
- Click to see all reviews
- Auto-detect webpage and create workflow
- Adjust AJAX timeout for Paginate
- Check the data and workflow
- Run task to extract data
1) Open target webpage
- Paste the URL and click Start
2) Click to see all reviews
- Scroll down the page to find the See all reviews button
- Click on it and choose Click URL
3) Auto-detect webpage and create workflow
- Select Auto-detect web page data
- Wait for the detection - uncheck Add a page scroll - Create workflow
4) Adjust AJAX timeout for Paginate
- Click on Click to Paginate - adjust Timeout to 10s
5) Check the data and workflow
- Go to Data preview to check if the current data output, double click on the header to rename it, or click ... to delete a field
- Below is what the final workflow looks like. Once everything is in place, you can continue to run the task
6) Run task to extract data
- Run the task on the top right corner: Run task on your device to run the task on your local device, or select Run task in the cloud to run the task on the Cloud (for premium users only)
Here is the sample output -