You are browsing a tutorial guide for the latest Octoparse version. If you are running an older version of Octoparse, we strongly recommend you upgrade because it is faster, easier and more robust! Download and upgrade here if you haven't already done so!

Coinmarketcap is a website that provides free access to current and historic data for Bitcoin and thousands of altcoins. From the website, people can have an overall view of the constant coin market.

In this tutorial, we are going to scrape crypto coin prices and other info from Coinmarketcap.

DATA.png

Sample URL: https://coinmarketcap.com/

Here are the steps in this tutorial: [Download task file here]

  1. Create a Go to Web Page - to open the target website

  2. Auto-detect web page data - to create a workflow

  3. Modify the action order - to correctly load more information

  4. Run your task - get the data you want


1. Create a Go to Web Page - to open the target website

  • Create your task by inputting the URL in the search box on the homepage

  • Click the "Start" button to move on

2022-05-14_9-24-35.png

2. Auto-detect web page data - create a workflow

  • Click Auto-detect web page data and wait for it to complete

auto_detect.jpg
  • Go to Data preview to remove unwanted data or rename it if needed

    • Delete unwanted data fields directly by clicking the delete icon

    • Modify the data field names by double-clicking the headers

FIELD.png
  • Click Create a workflow to generate a workflow

CREATE.png

The automatically generated workflow would look like the below:

workflow.png

3. Modify the action order - to correctly load more information

In this case, we need to drag the Loop Item out of the Scroll Page step which means extracting data after the whole page is loaded. And set the scroll page for one screen.

  • Pull the Loop Item down to Scroll Page

loop.png
  • Set Scroll for one screen

  • Repeats 30 times

  • Wait 2s

  • Click Apply

scroll_page.png

TIP: More tutorials for page scroll-down settings, you can check here.


4. Run your task - get the data you want

  • Click the Save button first to save all the settings you have made

  • Then click Run to run your task either locally or cloudly

mceclip8.png
  • Select Run on your device and click Run Now to run the task on your local device

  • Wait for the task to complete

mceclip9.png

Below is a sample data run from the local run -

sample_data.png

TIP: Local runs are great for quick runs and small amounts of data. If you are dealing with more complicated tasks or mass of data, Run in the Cloud is recommended for higher speed. You are very welcome to try the premium feature by signing up for the 14-day free trial here. Tasks could be scheduled hourly, daily, or weekly and data delivered regularly.

Did this answer your question?