https://github.com/carloocchiena/linkedin_orgchart_scraper
A Python automation that allows to extract company names and roles from an input company
https://github.com/carloocchiena/linkedin_orgchart_scraper
linkedin-scraper python scraper selenium-python
Last synced: about 2 months ago
JSON representation
A Python automation that allows to extract company names and roles from an input company
- Host: GitHub
- URL: https://github.com/carloocchiena/linkedin_orgchart_scraper
- Owner: carloocchiena
- License: gpl-3.0
- Created: 2020-12-16T10:43:12.000Z (over 5 years ago)
- Default Branch: main
- Last Pushed: 2022-02-21T13:02:14.000Z (over 4 years ago)
- Last Synced: 2025-07-19T18:40:35.270Z (11 months ago)
- Topics: linkedin-scraper, python, scraper, selenium-python
- Language: Python
- Homepage:
- Size: 28.3 KB
- Stars: 0
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# linkedin_orgchart_scraper
A Python automation that allows to extract company names and roles from an input compan, using Selenium and BSoup.
It opens many ways to increase its functionality such as getting emails, persona urls, numbers of people in the company, doing multiple searches and so on.
### Release notes:
15/10/2021 removed dependencies from lxml module that may cause trouble in conda venv and switched to built-in html.parser
15/10/2021 added "company_list.py" to scrape thru a list of given companies. This is tested and working with latest class and div tags
15/10/2021 noticed that main.py is not working anymore since the class tags in Linkedin changed. This is quite frequent and atm i have no reason to maintain this repo in real time. But in the case I'll be glad to help if someone may need assistance.