Extract page-level data
FollowThe updated tutorial for the latest version 8.1 is available here. Go to have a check now!
In this tutorial, we will show you how to use Octoparse to extract page-level data, including webpage URL, page title, meta description, and keywords.
It is pretty easy to extract web page URL, page title, meta description and keyword in Octoparse.
1. When you are in the "Extract Data" action, click "Add predefined fields"
2. Select "Add current page information"
3. Select the page-level data that you want
The selected page-level data will be added automatically in "Data Field".
4. Rename the data field as needed
日本語記事:ページレベルのデータを抽出する
Webスクレイピングについての記事は 公式サイトでも読むことができます。
Artículo en español: Extraer datos a nivel de página
También puede leer artículos de web scraping en el sitio web oficial.
Related articles:
Select and extract data/URL/image/HTML
Extract data from the source code
From: https://www.octoparse.com/tutorial-7/extract-page-level-data