Can't extract image URL
I've had a lot of problems trying to extract product info from a site. I couldn't get anything to work until I set it to wait 30 seconds before each extraction excecution. It's finally working with the exception of the product image. Octoparse it able to see the first image, but it's not able to extract the URL or any URLs after that.
Here's an example of what it pulls during setup...
During the 30 second delay, the image is fully loaded, so that doesn't seem to be the problem. Also, when I put the real version of this URL into a browser, the image displays correctly.
Any ideas? Thanks in advance for your help.
-
Hi John,
Sorry for the late reply.
Can you see the image URL being extracted in the workflow?
If the image loads but the URL is not extracted, it is possible that the XPath of the image data field does not locate the image correctly.
You can submit us a ticket to give more details if the issue still persists:
-
If you're talking about seeing the image URL within the URL field when setting up the task in Workflow mode, yes, I can see it.
I also found another URL for the same image in the source code (probably due to the hover effect), so I experimented with changing the XPath to that image. Again, it was found in the Workflow, but when it came time to extract... blank.
I'll submit a ticket. Thanks for you help on this.
-
Hi ktperez,
Thank you for reaching out.
Could you please submit a ticket with your task attached? So we can help debug and revise your task, here are the related tutorials:
https://helpcenter.octoparse.com/hc/en-us/requests/new
https://helpcenter.octoparse.com/hc/en-us/articles/360020888352-How-to-export-a-task-
Best regards,
Please sign in to leave a comment.
Comments
4 comments