https://github.com/boringppl/linkedin-profiles-scraping

Automatically scrape the web data of people profiles on Linkedin based on a specific search query
https://github.com/boringppl/linkedin-profiles-scraping

beautifulsoup beautifulsoup4 python python3 selenium selenium-webdriver webscraper webscraping webscraping-data webscrapper webscrapping

Last synced: 5 months ago
JSON representation

Automatically scrape the web data of people profiles on Linkedin based on a specific search query

Host: GitHub
URL: https://github.com/boringppl/linkedin-profiles-scraping
Owner: boringPpl
Created: 2020-11-22T12:58:03.000Z (over 5 years ago)
Default Branch: main
Last Pushed: 2023-12-04T12:06:27.000Z (over 2 years ago)
Last Synced: 2025-07-29T12:56:20.057Z (12 months ago)
Topics: beautifulsoup, beautifulsoup4, python, python3, selenium, selenium-webdriver, webscraper, webscraping, webscraping-data, webscrapper, webscrapping
Language: Jupyter Notebook
Homepage:
Size: 25.4 KB
Stars: 66
Watchers: 3
Forks: 32
Open Issues: 2
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Web Scraping: Crawling Linkedin Profiles
---

Automatically scrape the web data of people profiles on Linkedin based on a specific search query

### Tutorial:
- English: https://youtu.be/zkfLAY2OrtI
- Vietnamese: https://youtu.be/hfnBswCe4QE

### Problem:
It takes 10s on average to skim through Linkedin profiles and copy that information into an excel sheet. To collect a large enough amount of data for analysis purposes, it will take time if done manually.

### Approach to the problem:
To efficiently collect and cluster Linkedin profiles data, this scrip helps automatically scrape the web data of people profiles on Linkedin based on a specific search query and store the output in a CSV file

---
### Sponsor
[](https://nubela.co/proxycurl?utm_campaign=influencer_marketing&utm_source=github&utm_medium=social&utm_content=daphne_linkedin_profiles_scraping
)

Scrape public LinkedIn profile data at scale with [Proxycurl APIs](https://nubela.co/proxycurl?utm_campaign=influencer_marketing&utm_source=github&utm_medium=social&utm_content=daphne_linkedin_profiles_scraping
).
- Scraping Public profiles are battle tested in court in HiQ VS LinkedIn case.
- GDPR, CCPA, SOC2 compliant
- High rate Limit - 300 requests/minute
- Fast APIs respond in ~2s
- Fresh data - 88% of data is scraped real-time, other 12% are not older than 29 days
- High accuracy
- Tons of data points returned per profile

Built for developers, by developers.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/boringppl/linkedin-profiles-scraping

Awesome Lists containing this project

README