I haven’t found a solution to work around the get .pdf or .img of a file yet ;(

Comments

2 comments

  • Scarlett

    Sorry that Octoparse can't help you get the PDF or image file directly. Octoparse is only able to get the download link.

    0
    Comment actions Permalink
  • newkirkpartners

    Same here. I am trying to extract the link of the pdf that I can use in another batch program to download but it appears Octoparse cannot open a pdf file to extract the URL of the page to do so.

    Unfortunately in my case, the hyperlink to the pdf is not shown on the originating page which is thus requiring me to try to extract the page data of the pdf itself; but since Octoparse doesn't seem able to open the pdf, my data is blank.

    Hopefully this, printing a pdf and downloading files will make it into a future update.  

    0
    Comment actions Permalink

Please sign in to leave a comment.