You are browsing a tutorial guide for the latest Octoparse version. If you are running an older version of Octoparse, we strongly recommend you upgrade because it is faster, easier and more robust! Download and upgrade here if you haven't already done so!
💡 Try our pre-built DuckDuckGo Search template for faster setup!
To follow through with the tutorial, kindly use the following Example Search URL::
The main steps are shown in the menu on the right and you can download the demo task file here.
1. Task Setup
2. Auto-Detect Workflow
Click "Auto-detect web page data"
Select "Create workflow" after detection
Clean up detected fields:
3. Configure Pagination
Edit Loop Item XPath for "Load More" button:
//button[@id="more-results"]
Update results container XPath:
//ol[@class="react-results--main"]/li[@data-layout="organic"]
4. Refine Data Fields
Switch to Vertical View and update XPaths:
5. Optimize Workflow
To prevent duplicate data, move the Extract Data outside the pagination loop
6. Run & Export Data
Save your workflow
Run in Standard Mode (local) or Cloud
Export formats available:
Excel
CSV
HTML
JSON
Sample data: