Beyond Basics
XPath, Pagination, data cleaning, anti-blocking, API, and more
61 articles
What is XPath and how to use it in Octoparse?
Use relative XPath to locate data outside a loop item
Use XPath to locate email addresses from "mailto" links on any website
Fix field issues (missing, blank or misplaced fields)
Customize element XPath
Locate elements based on nearby text ("following-sibling" function)
Set up an alternative XPath
XPath Cheatsheet for Web Scraping with Octoparse
Regular Expression (Regex) Cheatsheet for Data Extraction
How can I address anti-scraping measures and restrictions when using Octoparse?
Set up IP Proxies
Switch IP Pools for Cloud Runs
Resolve Captcha
How to solve CAPTCHA in Octoparse?
What is Cloudflare verification and how to deal with it?
How to deal with Cloudflare security checks manually
Add custom User Agent
Smart Hacks in Octoparse
Retry actions