Tasks created in Octoparse can be run on your local device(Local Extraction) or in the cloud(Cloud Extraction). Running tasks locally can help you:
This tutorial will cover:
How to run a task on your device?
There are two ways to run a task on your device. The first method is to click the "Run" button to pop up the task editing panel. Then choose the "Run task on your device" option to execute the task.
The second way is to start a local run on the task dashboard. You need to hove on the task detail first, and then move to click the second button to execute a task on your device.
"Run the task on your device" option is available for both free and premium users.
For free users, it is limited to 10,000 records of data exported each time and 2 concurrent local runs ; for premium users (Standard & Professional), there is no limitation on records of data exported and concurrent local runs.
How to stop a task?
There are three ways to stop a running task as well. You can either exit the data extraction process by clicking the "Exit" button on the top right or click the "Stop Run" button on the bottom right.
Or you can stop a running task on the task dashboard by clicking on the "Stop" button.
Important settings of local runs
Disable images loading
When the website you're scraping contains a lot of images on a page, it will take a longer time for Octoparse to load all the content on the page. We can disable the image loading to shorten the time used for loading the page.
But it may affect the loading of the whole page on some websites. So if you find the page loading being affected, please don't select this option.
This setting won't affect the image URL you extract from the page because the URLs of the images are still on the source code.
You can set up IP proxies to avoid your local IP from being blocked. Check out how to set up the proxies at Set up IP proxies (Version 8)
1. What IP does a task use if I run it on my device?
The task runs with your own IP and some websites may limit the visit times of the same IP. Under this circumstance, the crawler is likely to be blocked if it runs on websites over the limitation.
2. What will affect Local Extraction?
As the task is running on your device, it will be affected by the local network speed and hardware configuration.