Can't select the content of a page

Comments

3 comments

  • Kara

    Hi livekth,

    Thank you for reaching out.

    So after inputting the target page URL, you do need to add the step to click on "공지사항" to get into the detail page, for the info from table one here, cause we are able to copy and post the content, meaning they are displayed as text in the source code, we can scrape it as texts for sure, but for the info from table2, it was an image, so we can only find and scrape the image URL from the source code, but can not get those content as texts:

     

    Best regards,

    0
    Comment actions Permalink
  • livekth

    Thank you for answering this.

     

    Now I have another issue (which was actually submitted to the customer success team):

    (I recorded the video: https://youtu.be/-LJ7teCiv4g)

    I spent the entire last week to find a tool to extract data from Jejuair.net.
    • Dexi.io and Octoparse.com both fail like the above video (it’s 3min, so take a look for fun).
    • Parsehub.com just stopped when loading the page.
    • Automatio.co looks promising and I’m looking forward to the upcoming features but currently it doesn’t work for the webpage.
    • The problem is that the site uses AJAX and hides the URLs inside it by using Javascript. Because of this, Dexi.io and Octoparse both can extract the data in the workflow (or bot) editor for a single detail page, but when I run the entire workflow they can’t get the data.
    How can I get the data extracted if the data is on a detail page of which URL is hidden by Javascript?
     
     
    0
    Comment actions Permalink
  • Kara

    Hi livekth,

    Thank you for your reply.

    I did a test run with the webpage and looks like the data you aim to capture in the screen recording is feasible with Octoparse, we just need to add "wait time" for some steps, in case the webpage loads too slowly, and the crawler didn't wait for it to be fully loaded and moved on to the next item, to learn how, please refer to the GIFs below:

     

     

     

    Best regards,

     

    0
    Comment actions Permalink

Please sign in to leave a comment.