{"id":13580006,"url":"https://github.com/khaleddallah/LinkedinScraper","last_synced_at":"2025-04-06T00:30:37.286Z","repository":{"id":217509468,"uuid":"162272911","full_name":"khaleddallah/LinkedinScraper","owner":"khaleddallah","description":"Python Scrapy project parse people profiles of Linkedin Search and arrange result content in Excel and Json file","archived":false,"fork":false,"pushed_at":"2022-09-07T14:12:23.000Z","size":7743,"stargazers_count":6,"open_issues_count":3,"forks_count":2,"subscribers_count":3,"default_branch":"master","last_synced_at":"2024-11-05T18:51:26.357Z","etag":null,"topics":["crawler","excel","json","linkedin","python","scraper","scrapy","spider"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"lgpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/khaleddallah.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2018-12-18T10:45:18.000Z","updated_at":"2024-04-25T19:28:00.000Z","dependencies_parsed_at":null,"dependency_job_id":"0cc604a2-e00b-4ebc-9b04-8ae2f8a42061","html_url":"https://github.com/khaleddallah/LinkedinScraper","commit_stats":null,"previous_names":["khaleddallah/linkedinscraper"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/khaleddallah%2FLinkedinScraper","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/khaleddallah%2FLinkedinScraper/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/khaleddallah%2FLinkedinScraper/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/khaleddallah%2FLinkedinScraper/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/khaleddallah","download_url":"https://codeload.github.com/khaleddallah/LinkedinScraper/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247419597,"owners_count":20936009,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["crawler","excel","json","linkedin","python","scraper","scrapy","spider"],"created_at":"2024-08-01T15:01:45.961Z","updated_at":"2025-04-06T00:30:37.281Z","avatar_url":"https://github.com/khaleddallah.png","language":"Python","funding_links":[],"categories":["Python"],"sub_categories":[],"readme":"# Linkedin Scraper using Scrapy    \n![](https://github.com/khaleddallah/LinkedinScraperProject/blob/master/Readme-Images/E.png)\n* Scrape number of profiles that exist in result of Linkedin searchUrl.    \n* Export the content of profiles to Excel and Json files.    \n    \n    \n## Installation\n\n* Use the package manager [pip](https://pip.pypa.io/en/stable/) to install Scrapy.  \n(Anaconda Recomended)\n```    \ncd LinkedinScraperProject     \npip install -r requirements.txt    \n```\n* clone the project\n```\ngit clone https://github.com/khaleddallah/GoogleImageScrapyDownloader.git\n```\n\n\n\n## Usage\n* get into the directory of the project:\n```\ncd LinkedinScraperProject   \n```   \n* to get help :\n```\npython LinkedinScraper -h\n```\n\u003cpre\u003e\n\u003cb\u003eusage:\u003c/b\u003e \npython LinkedinScraper [-h] [-n NUM] [-o OUTPUT] [-p] [-f format] [-m excelMode] (searchUrl or profilesUrl)\n\n\u003cb\u003epositional arguments:\u003c/b\u003e\n  searchUrl     URL of Linkedin search URL or Profiles URL\n\n\u003cb\u003eoptional arguments:\u003c/b\u003e\n  -h, --help    show this help message and exit\n  -n NUM        num of profiles\n                ** the number must be lower or equal of result number\n                'page' will parse profiles of url page (10 profiles) (Default)\n  -o OUTPUT     Output file\n  -p            Enable Parse Profiles\n  -f FORMAT     json    Json output file\n                excel    Excel file output\n                all    Json and Excel output files\n  -m EXCELMODE  1    to make each profile in Excel file appear in one row\n                m    to make each profile in Excel file appear in multi row\n\n\n\u003c/pre\u003e\n\n## Examples\n\n* Parse \u003cb\u003e(\u003c/b\u003e https://www.linkedin.com/in/khaled-dallah/ and https://www.linkedin.com/in/linustorvalds/ \u003cb\u003e) profiles\u003c/b\u003e and export the result content to \u003cb\u003eABC.xlsx\u003c/b\u003e and \u003cb\u003eABC.json\u003c/b\u003e\n\u003cbr\u003e(\u003cb\u003e-p\u003c/b\u003e) because of parsing single profiles\n```\npython LinkedinScraper -p -o 'ABC' 'https://www.linkedin.com/in/khaled-dallah/' 'https://www.linkedin.com/in/linustorvalds/'\n```\n\n\n* Parse \u003cb\u003e23\u003c/b\u003e profiles of searchUrl [https://www.linkedin.com/.../?keywords=Robotic\u0026...\u0026](https://www.linkedin.com/search/results/all/?keywords=Robotic\u0026origin=GLOBAL_SEARCH_HEADER)\n\u003cbr\u003eif you don't set output name by (-o), Name of result files will be value of keywords (\u003cb\u003eRobotic\u003c/b\u003e)\n```\npython LinkedinScraper -n 23 'https://www.linkedin.com/search/results/all/?keywords=Robotic\u0026origin=GLOBAL_SEARCH_HEADER'\n```\n\n\n* Parse \u003cb\u003e17\u003c/b\u003e profiles of searchUrl [https://www.linkedin.com/.../?keywords=Robotic\u0026...\u0026](https://www.linkedin.com/search/results/all/?keywords=Robotic\u0026origin=GLOBAL_SEARCH_HEADER)\n\u003cbr\u003eand get output as \u003cb\u003eexcel\u003c/b\u003e file and put the information of each profile in \u003cb\u003eone row\u003c/b\u003e \n```\npython LinkedinScraper -n 17 -f excel -m 1 'https://www.linkedin.com/search/results/all/?keywords=Robotic\u0026origin=GLOBAL_SEARCH_HEADER'\n```\n\n## Built with\n* Python 3.7\n* Scrapy\n* openpyxl\n\n\n## Author\n\n* **Khaled Dallah** - *Software Engineer*   |   *Python/c++ Developer*    \nkhaled.dallah0@gmail.com\n\n\n## Issues:   \nReport bugs and feature requests\n[here](https://github.com/khaleddallah/LinkedinScraperProject/issues).    \n\n## Contribute   \nContributions are always welcome! \n\n## License\n\nThis project is licensed under the LGPL-V3.0 License - see the [LICENSE.md](https://github.com/khaleddallah/LinkedinScraperProject/blob/master/LICENSE) file for details\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkhaleddallah%2FLinkedinScraper","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fkhaleddallah%2FLinkedinScraper","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkhaleddallah%2FLinkedinScraper/lists"}