How to scrap job postings when each posting has his own layout?

Comments

3 comments

  • Kara

    Hi krafre,

    Thank you for reaching out.

    Looks like the first three links have expired already:

     

    And ould you please use a screenshot to show us what is this "whole text-information" you want to grab from the detail page? 

    Best regards,

    0
    Comment actions Permalink
  • krafre

    Hello Kara,

    thanks for the feedback. That's a bit strange because the links for example 1 & 2 are still working for me, number 3 seems to expired. But no worries, here are some more examples: 4, 5, 6. Here you can also find some screenshots. I marked the area in red where the job, the company and the applicants-profile are described. That's what I would like to scrap. These text boxes are always different for each job.

    Example 1:

    Example 2:

     

    Example 4:

     

    Example 5:

     

    Example 6:

    Thanks for the support. Best regards, Fred.

    0
    Comment actions Permalink
  • Kara

    Hi Fred,

    Thank you very much for the info.

    Please try to modify the XPath of the data field into:

    //div[@class="js-app-ld-ContentBlock"]

    To learn more about XPath, this tutorial would be very helpful, please check: What is XPath and how to use it in Octoparse

    Best regards,

    0
    Comment actions Permalink

Please sign in to leave a comment.