How to extract webpage URL along with the other data?
FollowIt's often nice to have the webpage URL extracted along with the corresponding data. In particular, having the webpage URL added to the extracted dataset can be useful for 1) workflow debugging 2) data validation. Extracting webpage URL is straightforward with Octoparse.
1. Select Extract Data from the workflow. Under Advanced Options, click Add predefined field
2. Select Add current page information, then Web page URL
3. There you have it!
Artículo en español: ¿Cómo extraer lwebpage URL junto con los otros datos?
También puede leer artículos de web scraping en el sitio web oficial