You are browsing a tutorial guide for the latest Octoparse version. If you are running an older version of Octoparse, we strongly recommend you upgrade because it is faster, easier, and more robust! Download and upgrade here if you haven't already done so!
Lazada is an international e-commerce platform and one of the largest e-commerce operators in Southeast Asia. It currently has over 10,000 third-party sellers and 50 million annual active buyers.
This tutorial will show you how to scrape product information, such as product title, price, rating, shipping info, etc., from Lazada with the Octoparse.
To follow through, here is the example URL:
The main steps are shown in the menu on the right, and you can download the sample task file here.
1. Create a Go to Web Page - to open the target website
Enter the target URL on the homepage of Octoparse and click Start
Lazada uses Captcha as an anti-scraping measure; therefore, before setting up the task, it is essential to resolve the Captcha.
Turn on the Browser Mode
Resolve the Captcha manually
Turn off the Browser Mode
For more information on how to resolve Captcha in Octoparse, please check this tutorial: How to solve Captcha in Octoparse?
2. Auto-detect the webpage - to create a workflow
Click Auto-detect web page data and wait for it to complete
Create Workflow
Go to Data Preview to see if you're okay with the current data output
Delete unnecessary data fields directly by clicking the More (three-dot) icon next to the field name and selecting Delete Field
Modify the data field names by double-clicking the headers
NOTE: For this specific task, please keep the Title_URL for the details page in the data preview section, as it will be used for the following steps.
3. Select Subpage URL - to extract data from the details page
Select Subpage URL in the Tips panel
Choose Title_URL as the data field to click on > Confirm
Click on your desired data field (e.g. Shipping info)
Choose Text on the Tips panel
Repeat the above steps to extract any other data on the details page
4. Run the task - to get your target data
Click Save and click Run on the upper right side
Select Run on your device to run the task on your computer.
Here's the sample data output for your reference.