You are browsing a tutorial guide for the latest Octoparse version. If you are running an older version of Octoparse, we strongly recommend you upgrade because it is faster, easier, and more robust! Download and upgrade here if you haven't already done so!

Jabong was an Indian fashion, and lifestyle e-commerce portal acquired by Flipkart through its unit Myntra and later shut down to shift focus completely on Myntra. So don't panic if you find the website automatically skipping to myntra.com. :)

This tutorial will introduce how to scrape product information from Jabong.

list.jpg

To follow through the tutorial, you may want to use the URL below:

https://www.myntra.com/bracelet

Here are the main steps in this tutorial: [Download task file here]

  1. Create a Go to Web Page - to open the target website

  2. Auto-detect the webpage - to create a workflow

  3. Set backup XPath for the data field(s) - to locate the field(s) accurately

  4. Run the task - to get the desired data


1. Create a Go to Web Page - to open the target website

  • Enter the target URL into the search bar on the home screen and click Start

search.jpg

2. Auto-detect the webpage - to create a workflow

Octoparse's auto-detection function can help you quickly create a workflow according to the target website's design.

  • Click Auto-detect web page data in Tips and wait for the detection to complete

auto_detect.jpg
  • Check the data fields in Data Preview and delete unwanted data or rename them if needed

data_preview.jpg
  • Click Create workflow

create.jpg

Now, you will see a workflow created like the one below:

workflow.jpg

3. Set backup XPath for the data field(s) - to locate the field(s) accurately

Sometimes, there is more than one XPath for a specific data field; thus, it is essential to set up an alternative XPath for such field(s) to extract the data more precisely.

In this case, the data field Original_price needs a backup XPath as the location of this field may change according to whether the product has a discount.

  • Click

    dot.jpg

    next to the data field to change its settings

  • Choose Customize XPath

original_price.jpg
  • Tick Try alternate XPath

  • Choose Relative XPath to the Loop Item

  • Input the XPath as: //div[@class='product-price']/span[1]

  • Click Apply to save the change

backup_xpath.jpg

Note: If you want to learn more about setting up alternative XPath, please check out here.


4. Run the task - to get the desired data

  • Click Save on the upper right to save your task

  • Click Run next to it and wait for a Run Task window to pop up

  • Select Run on your device to run the task on your local device

  • Wait for the task to complete

Here is the sample output from a local run:

Jabong_data.jpg

Tip: Local runs are great for task troubleshooting and quick runs. If you are dealing with more complicated tasks, it is recommended that you select Run in the Cloud to run the task in Octoparse's cloud-based platform for higher speed. Try out this premium feature by signing up for the 14-day free trial here. You can also schedule your tasks to run hourly, daily, or weekly and get data delivered to you regularly.

Did this answer your question?