You are browsing a tutorial guide for the latest Octoparse version. If you are running an older version of Octoparse, we strongly recommend you upgrade because it is faster, easier and more robust! Download and upgrade here if you haven't already done so!
Clutch is a website collecting ratings and reviews from leading IT, marketing, and business services companies. It is a data-driven field guide for B2B buying and hiring decisions.
This tutorial will show you how to use auto-detact to scrape company reviews from Clutch.
URL being used in this tutorial is: https://clutch.co/profile/webfx#reviews
Here are the main steps of this tutorial: [Download task file here]
1. Create a Go to Web Page - to open the target website
Enter the target URL into the search bar on the home screen and click Start
2. Auto-detect webpage - to generate a workflow
Click Auto-detect webpage data on the Tips
Delete unwanted data by clicking the delete icon after the auto-detect is complete
Untick Add a page scroll
Click Create workflow
The workflow will generate the following:
3. Modify the data field - to rename them if needed
Double click the header to rename the field
4. Run the task - to get the target data
Click the Save button first to save all the settings you have made
Then click Run to run your task either locally or cloudly
Select Run on your device and click Run Now to run the task on your local device
Waiting for the task to complete
Below is a sample data run from the local. Excel, CSV, HTML, and JSON formats are available for export.