Scraping from Hilton.com
Hi. I am trying to use Octoparse to scrape data from Hilton.com. Specifically, I am trying to enter a set of search criteria (location, dates, search radius, # rooms, # people) and then scrape hotel information for different discount codes. Specifically, for each discount code, I want to capture the discount code in use, the hotel name, distance from my search location, hotel price and rate type.
I have gotten through most of the setup to loop through 3 discount codes, but for some reason Octoparse is only capturing the hotel information from the 3rd code. It loops through the first 2 without scraping any data. Also, Octoparse doesn't recognize that the "distance" field is a repeating field, so it repeats the distance for the first hotel in the list for all hotels. In addition, I can't figure out how to get Octoparse to add a field to each row that contains the discount code in use.
I would also like to know if it is possible to have Octoparse wait for user input during the initial search. I am trying to get Octoparse to open the search page, but then wait for the user to enter the location, dates, etc., before clicking Submit. Otherwise, every time we perform a new search, we have to edit the Octoparse task to adjust those variables before continuing, which is time consuming and error-prone. Another option would be to put those variables into a text or CSV file that Octoparse would know where to find, if that's possible.
I have exported my task but don't see any way to upload it to this forum.
-
Official comment
Hi gurs,
Thank you for reaching out and giving us such a detailed background of the case.
Could you please click here to submit a ticket with the task attached? So we can help you look into the problem and revise it.
Cheers,
Comment actions
Please sign in to leave a comment.
Comments
3 comments