Unable to extract data from tabbed javascript table
Hi, i'm trying to extract data from this website https://www.oddsportal.com/soccer/italy/serie-a/results/
No problem with the list of football matches, i extract them with no problem. I have issues when i enter in a single match in order to extract all data in that tabbed table https://www.oddsportal.com/soccer/italy/serie-a/sassuolo-brescia-j1imlODd/ . I'm unable to make octoparse recognize each button (1x2, AH, O/U; DNB, etc etc).
It seems woorking in workflow mode but when i try to launch the extraction all data are skipped except the ones of the first tab (1X2)
Is there a way to overcome this?
thanks
-
Official comment
Hi FatTails,
Thank you for reaching out.
There are two options for this case, you can choose whichever you find more convenient:
1) After we click on the button, the URL of the page changes and it was following a certain pattern. So we can use the batch generate feature in Octoparse(or just excel) to get all the URLs, then loop through them to scrape data.
https://www.oddsportal.com/soccer/italy/serie-a/sassuolo-brescia-j1imlODd/#1X2;2
https://www.oddsportal.com/soccer/italy/serie-a/sassuolo-brescia-j1imlODd/#home-away;3
https://www.oddsportal.com/soccer/italy/serie-a/sassuolo-brescia-j1imlODd/#ah;2
https://www.oddsportal.com/soccer/italy/serie-a/sassuolo-brescia-j1imlODd/#over-under;2
https://www.oddsportal.com/soccer/italy/serie-a/sassuolo-brescia-j1imlODd/#dnb;2
2) We can add the clicking actions in the workflow to click on the buttons, for example, to click on the "Home/Away", we can revise the XPath of the Click Item into: //a[@title="Home/Away"]
If you need more assistance for this project, please submit a ticket via https://helpcenter.octoparse.com/hc/en-us/requests/new with the task attached. So we can help you revise it.
Please follow the instructions in this tutorial to export the task: How to export a task?
Cheers,
Comment actions
Please sign in to leave a comment.
Comments
1 comment