Octoparse

LinkedIn is a good database for finding valuable job information. In this tutorial, we are going to show you how to scrape job information from LinkedIn.com

To follow through, you may want to use the URL in the tutorial:

<a href="https://www.linkedin.com/jobs/search/?currentJobId=2011756127&amp;geoId=105080838&amp;keywords=accountant&amp;location=New%20York%2C%20United%20States" rel="nofollow noopener noreferrer" target="_blank">https://www.linkedin.com/jobs/search/?currentJobId=2011756127&amp;geoId=105080838&amp;keywords=accountant&amp;location=New%20York%2C%20United%20States</a>

We will scrape data such as job titles, companies, levels, types, functions, and industries in Octoparse.

The website applies an <a href="https://intercom.help/octoparse/en/articles/6470993" rel="nofollow noopener noreferrer" target="_blank">infinite scroll</a> coupled with a "Show More" to load more reviews. After we scroll the page to the bottom like 6 times, a <a href="https://intercom.help/octoparse/en/articles/6470991" rel="nofollow noopener noreferrer" target="_blank">"show more" button</a> would reveal, and if we want to continue to load jobs, we have to click on the button.

The main steps are shown in the menu on the right. 

[Download the demo task click <a href="https://www.octoparse.com/share/8of193oQ" rel="nofollow noopener noreferrer" target="_blank">here</a>]

___________________________________________________________

1. "Go To Web Page" - to open the targeted web page

Enter the URL on the home page and click Start

- Enter the URL on the home page and click Start

Since Linkedin requires you to log in first before accessing the jobs, we need to log in and save the cookies.

Manually log in to your LinkedIn account in Octoparse's built-in browser

Click on Use cookie from the current page

- Manually log in to your LinkedIn account in Octoparse's built-in browser
- Turn off Browse Mode
- Go to Options
- Tick Use cookie
- Click on Use cookie from the current page
 save_cookies.png

3. Set up scroll settings - to scroll down the page

Since the web page requires scrolling down to load more jobs, you need to set up scroll settings for the Go to Web Page action.

Tick Scroll down the page after it is loaded

Input the XPath: //*[@id="main"]/div/div[2]/div[1]/div

- Tick Scroll down the page after it is loaded
- Select Partial as Scroll Area
- Input the XPath: //*[@id="main"]/div/div[2]/div[1]/div
- Select Scroll for one screen
- Set up scroll 30 times
- Click Apply to save

4. Auto-detect web page - to create a workflow

You can use the auto-detect web page to scrape the list of jobs.

- Choose Auto-detect web page data

Uncheck Add a page scroll from the Tips panel

- Wait for the detection to complete
- Uncheck Add a page scroll from the Tips panel
- Click Create workflow

5. Click on each link - to get more detailed information

If you want to scrape job details from each job post, you need to click on each job URL to load the details page.

- Click on the first job title
- Select Click element

6. Extract data - to select the data for extraction

Click on any text information you want to extract from the page

Select Text or other data field you need on the Tips panel

Repeat the steps until you get all the data needed to be scraped

- Click on any text information you want to extract from the page
- Select Text or other data field you need on the Tips panel
- Repeat the steps until you get all the data needed to be scraped

Edit the name of the data fields if needed

- Edit the name of the data fields if needed

Uncheck the Extract data in the loop

- Uncheck the Extract data in the loop

7. Set up a scroll for Click to Paginate

- Click on Click to Paginate step
-

8. Run your task - to get the data you want

Click Save, and click Run on the upper right side

Select Run on your device to run the task on your computer

- Click Save, and click Run on the upper right side
- Select Run on your device to run the task on your computer

TIP: Please don't run the task in the Cloud since LinkedIn will fail to login if it detects suspicious IPs.

Scrape jobs from LinkedIn

Go to Octoparse.com

Octoparse DE

Octoparse FR

Octoparse ES

Octoparse JP

Download

Blog

API Docs

Find answers and get help from Intercom Support and Community Experts

This site employs cookies and other technologies that we and our third party vendors use to monitor and record personal information about you and your interactions with the site (including content viewed, cursor movements, screen recordings, and chat contents) for the purposes described in our Cookie Policy. By continuing to visit our site, you agree to our {websiteTermsLink}, {privacyPolicyLink} and {cookiePolicyLink}.

This site uses cookies and similar technologies ("cookies") as strictly necessary for site operation. We and our partners also would like to set additional cookies to enable site performance analytics, functionality, advertising and social media features. See our {cookiePolicyLink} for details. You can change your cookie preferences in our Cookie Settings.

We use cookies to make our site work and also for analytics and advertising purposes. You can enable or disable optional cookies as desired. See our {cookiePolicyLink} for more details.

Advertising cookies are set by our advertising partners to collect information about your use of the site, our communications, and other online services over time and with different browsers and devices. They use this information to show you ads online that they think will interest you and measure the ads' performance. Social media cookies are set by social media platforms to enable you to share content on those platforms, and are capable of tracking information about your activity across other online services for use as described in their privacy policies.

These cookies enable the website to provide enhanced functionality and personalisation. They may be set by us or by third party providers whose services we have added to our pages. If you do not allow these cookies then some or all of these services may not function properly.

These cookies are necessary for the website to function and cannot be switched off in our systems.

These cookies allow us to count visits and traffic sources so we can measure and improve the performance of our site. They help us to know which pages are the most and least popular and see how visitors move around the site.

You have the right to opt out of the sale of your personal information. See our {cookiePolicyLink} for more details about how we use your data.

Your Privacy Choices

We use cookies to enhance your experience. You can customize your cookie preferences below. See our {cookiePolicyLink} for more details.

Cookie Settings

Link, Press control-option-right-arrow to exit

Empty Help Center

Uh oh. That page doesn’t exist.

Home

Search

Disappointed

Neutral

Smiley

Thinking...

Searching through sources...

Analyzing...

Tickets submitted through the messenger or by a support agent in your conversation will appear here.