Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/jesseokeya/linkedin-scraper

Selenium webDriver used to get information from linkedIn
https://github.com/jesseokeya/linkedin-scraper

chromedriver crawler linkedin os python scraper selenium-webdriver

Last synced: 17 days ago
JSON representation

Selenium webDriver used to get information from linkedIn

Awesome Lists containing this project

README

        

# linkedin-scraper
Simple selenium web crawler used to fetch information (url to images and videos) from linkedIn and saves it to a file `data.json`

## Guide
To run this application without using docker you need to download a [`chromedriver`](https://chromedriver.storage.googleapis.com/index.html?path=78.0.3904.105/) for the the system your on and name it `chromedriver` in project directory. Also in app.py you will have to enter your linkedin [`username`](https://github.com/jesseokeya/linkedin-scraper/blob/bc6a187bfc05b5b4b6ea873cec304ce918b8ec80/app.py#L7) and [`password`](https://github.com/jesseokeya/linkedin-scraper/blob/bc6a187bfc05b5b4b6ea873cec304ce918b8ec80/app.py#L8) in order to log into linkedin

## Running application
* Make sure to have python 3 installed
* Create a virtual environment by running `virtualenv venv -p python3`
* Activate the virtual environment by running `source venv/bin/activate`
* Install required dependencies by running `pip install -r requirements.txt`
* Run the app by running `python app.py`

## Running application in docker
* Build `docker build -t linkedin-scraper:latest .`
* Run `docker run -it -v $(pwd):/app linkedin-scraper:latest`
* SSH into running container (optional) `docker exec -it sh`