Scraping full review text from TripAdvisor
I couldn't find a way to get a full text reviews from specific hotel on TripAdvisor (for example https://www.tripadvisor.com/Hotel_Review-g295371-d559522-Reviews-Hilton_Imperial_Dubrovnik-Dubrovnik_Dubrovnik_Neretva_County_Dalmatia.html). Scraped text after 'Read more' button is missing. Can anyone help me?
-
Hello All,
Did anyone ever find a solution for this? I'm currently working to collect TripAdvisor reviews from individual hotel properties and want every review for a given property (ex. every TripAdvisor review from The Peninsula Hong Kong). I'm trying to collect the review text, review date, the bubble rating, reviewer ID, trip type, response from hotel, etc. - about 20 fields in total.). My scraping task gets all the fields I need, but only gets about 75%-80% of the reviews. Some are skipped and I haven't yet figured out why.
I've already lengthened timeouts for page loading and for AJAX, which made no difference in the number of reviews collected. I currently believe that there's something about specific reviews (or about each page worth of reviews) that keeps them from being collected by the scraper. Any advice or insights would be appreciated.
Please sign in to leave a comment.
Comments
3 comments