Lesson 5: Get data
FollowNow that your first scraping task is built and fully tested. You can go ahead and run the task to extract some real data.
Ways to get data
There are two ways you can run the task:
- Run on your device (also known as local extraction)
- Run in the Cloud (as known as Cloud extraction)
If you run a task on your device, you will need to have the Octoparse App open during the extraction process. There will be an extraction window running on your PC, and you can watch the data getting extracted and wait for it to complete.
On the other hand, when you run a task in the Cloud, the task will be run on the Octoparse Cloud Platform, which means you can shut off the Octoparse App or even your computer and come back for your data when the job is done. Tasks running in the Cloud generally run 6x to 20x faster compare to local extractions. Depending on your project requirements, you can always choose a plan that works for you.
Start a run
Once you are done building a task, you can click the "Run" button to start a run.
Alternatively, you can also access the task on the Dashboard and use the and
buttons to run/stop a task.
Check your data
Now that your run is completed, you can go ahead and check your data.
Go to the Dashboard and find your task. Hover over the task status and click on it.
Or, you can also check your data by clicking the "show more" icon on the Dashboard, select "View data", and then choose if you'd like to view "Cloud data" or "Local data".
Export your data
If the data looks good to go, you can export the data directly by clicking on "Export Data" at the lower right hand corner of the Data View tab. Octoparse supports exporting data to Excel, CSV or HTML file or to a database.
Tips!
What if the task does not work well? Check out the FAQs below. |
>> Lesson 6: Schedule regular runs
Artículo en español: Lección 5: Obtener datos
También puede leer artículos de web scraping en el sitio web oficial