Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/boringppl/linkedin-profiles-scraping
Automatically scrape the web data of people profiles on Linkedin based on a specific search query
https://github.com/boringppl/linkedin-profiles-scraping
beautifulsoup beautifulsoup4 python python3 selenium selenium-webdriver webscraper webscraping webscraping-data webscrapper webscrapping
Last synced: 2 months ago
JSON representation
Automatically scrape the web data of people profiles on Linkedin based on a specific search query
- Host: GitHub
- URL: https://github.com/boringppl/linkedin-profiles-scraping
- Owner: boringPpl
- Created: 2020-11-22T12:58:03.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2023-12-04T09:39:04.000Z (about 1 year ago)
- Last Synced: 2023-12-04T10:27:55.825Z (about 1 year ago)
- Topics: beautifulsoup, beautifulsoup4, python, python3, selenium, selenium-webdriver, webscraper, webscraping, webscraping-data, webscrapper, webscrapping
- Language: Jupyter Notebook
- Homepage:
- Size: 24.4 KB
- Stars: 51
- Watchers: 4
- Forks: 25
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Web Scraping: Crawling Linkedin Profiles
---Automatically scrape the web data of people profiles on Linkedin based on a specific search query
### Tutorial:
- English: https://youtu.be/zkfLAY2OrtI
- Vietnamese: https://youtu.be/hfnBswCe4QE### Problem:
It takes 10s on average to skim through Linkedin profiles and copy that information into an excel sheet. To collect a large enough amount of data for analysis purposes, it will take time if done manually.### Approach to the problem:
To efficiently collect and cluster Linkedin profiles data, this scrip helps automatically scrape the web data of people profiles on Linkedin based on a specific search query and store the output in a CSV file---
### Sponsor
[](https://nubela.co/proxycurl?utm_campaign=influencer_marketing&utm_source=github&utm_medium=social&utm_content=daphne_linkedin_profiles_scraping
)Scrape public LinkedIn profile data at scale with [Proxycurl APIs](https://nubela.co/proxycurl?utm_campaign=influencer_marketing&utm_source=github&utm_medium=social&utm_content=daphne_linkedin_profiles_scraping
).
- Scraping Public profiles are battle tested in court in HiQ VS LinkedIn case.
- GDPR, CCPA, SOC2 compliant
- High rate Limit - 300 requests/minute
- Fast APIs respond in ~2s
- Fresh data - 88% of data is scraped real-time, other 12% are not older than 29 days
- High accuracy
- Tons of data points returned per profileBuilt for developers, by developers.