Scrape trending video information from YouTube
FollowThe second most popular search engine in the world is YouTube, directly behind Google. A large number of videos are available on YouTube – and their associated data and comments can be really valuable.
Octoparse can scrape YouTube easily with pre-built templates. You may want to check it out here: Task Templates. Just enter a keyword/URL to get data extracted in minutes!
In this tutorial, we are going to show you how to scrape the trending video information from Youtube in only 3 steps with the Octoparse auto-detection feature.
Here is the YouTube trending video link that we will be using as an example.
https://www.youtube.com/feed/trending?gl=US
Here are the steps in this tutorial
- "Go to Web Page" - to open the target website
- Auto-detect web page data - to create a workflow
- Run your task - to get data you want
1. "Go To Web Page" - to open the target website
- Create your task by inputting the URL in the search box on the homepage
- Click the Start button nearby to move on
2. Auto-detect web page data - to create a workflow
- Click Auto-detect web page data and wait for it to complete
We need to check the data selected with the auto-detection.
- Go to Data preview to see if you're okay with the current data output
- Delete unnecessary data fields directly by clicking the delete icon
- Modify the data field names by double-clicking the headers
- Delete unnecessary data fields directly by clicking the delete icon
- Click Edit under the Add a page scroll to set up to scroll to the bottom of the page, scroll 20 times, and wait for 1s for every scroll (you can set up more repeats to get more videos)
- Confirm the settings
- Click Create workflow
3. Run your task - to get data you want
- Click Save, and click Run on the upper right side
- Select Run on your device to run the task on your computer, or select Run in the Cloud to run the task in the Cloud (for premium users only)
Here is the sample data.
If you have further issues with the task or have a suggestion that would make this a better resource for you, we’d love to hear about it. Submit a request here.
Author: Kiki
Editor: Yina