Skip to main content

Scrape product info from Lazada

Updated over a month ago

You are browsing a tutorial guide for the latest Octoparse version. If you are running an older version of Octoparse, we strongly recommend you upgrade because it is faster, easier, and more robust! Download and upgrade here if you haven't already done so!

Lazada is an international e-commerce platform and one of the largest e-commerce operators in Southeast Asia. It currently has over 10,000 third-party sellers and 50 million annual active buyers.

This tutorial will show you how to scrape product information, such as product title, price, rating, shipping info, etc., from Lazada with the Octoparse.

To follow through, here is the example URL:

The main steps are shown in the menu on the right, and you can download the sample task file here.


1. Create a Go to Web Page - to open the target website

  • Enter the target URL on the homepage of Octoparse and click Start

Lazada uses Captcha as an anti-scraping measure; therefore, before setting up the task, it is essential to resolve the Captcha.

  • Turn on the Browser Mode

  • Resolve the Captcha manually

  • Turn off the Browser Mode

For more information on how to resolve Captcha in Octoparse, please check this tutorial: How to solve Captcha in Octoparse?


2. Auto-detect the webpage - to create a workflow

  • Click Auto-detect web page data and wait for it to complete

  • Create Workflow

  • Go to Data Preview to see if you're okay with the current data output

    • Delete unnecessary data fields directly by clicking the More (three-dot) icon next to the field name and selecting Delete Field

    • Modify the data field names by double-clicking the headers

NOTE: For this specific task, please keep the Title_URL for the details page in the data preview section, as it will be used for the following steps.


3. Select Subpage URL - to extract data from the details page

  • Select Subpage URL in the Tips panel

  • Choose Title_URL as the data field to click on > Confirm

  • Click on your desired data field (e.g. Shipping info)

  • Choose Text on the Tips panel

  • Repeat the above steps to extract any other data on the details page


4. Run the task - to get your target data

  • Click Save and click Run on the upper right side

  • Select Run on your device to run the task on your computer.

  • Click Pause and Show Browser, resolve the captcha manually, and resume the process


Here's the sample data output for your reference.

Did this answer your question?