{"id":14978847,"url":"https://github.com/gurbaaz27/linkedin-comments-scraper","last_synced_at":"2025-07-22T18:33:51.532Z","repository":{"id":50284140,"uuid":"285069067","full_name":"gurbaaz27/linkedin-comments-scraper","owner":"gurbaaz27","description":"Script to scrape comments (including name, profile link, pfp, designation, email(if present), and comment) from a LinkedIn post from the URL of the post.","archived":false,"fork":false,"pushed_at":"2024-06-08T08:20:59.000Z","size":178,"stargazers_count":84,"open_issues_count":3,"forks_count":24,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-04-08T20:08:30.538Z","etag":null,"topics":["linkedin","linkedin-comments-scraper","linkedin-post","python","python3","scraper","selenium","selenium-python","selenium-webdriver","webscraping"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/gurbaaz27.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-08-04T18:46:54.000Z","updated_at":"2025-04-04T14:09:29.000Z","dependencies_parsed_at":"2024-06-08T09:33:38.020Z","dependency_job_id":"67fcef7a-83de-46c3-ab1d-672263663d95","html_url":"https://github.com/gurbaaz27/linkedin-comments-scraper","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/gurbaaz27/linkedin-comments-scraper","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gurbaaz27%2Flinkedin-comments-scraper","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gurbaaz27%2Flinkedin-comments-scraper/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gurbaaz27%2Flinkedin-comments-scraper/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gurbaaz27%2Flinkedin-comments-scraper/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/gurbaaz27","download_url":"https://codeload.github.com/gurbaaz27/linkedin-comments-scraper/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gurbaaz27%2Flinkedin-comments-scraper/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":266552647,"owners_count":23947181,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-07-22T02:00:09.085Z","response_time":66,"last_error":null,"robots_txt_status":null,"robots_txt_updated_at":null,"robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["linkedin","linkedin-comments-scraper","linkedin-post","python","python3","scraper","selenium","selenium-python","selenium-webdriver","webscraping"],"created_at":"2024-09-24T13:58:31.253Z","updated_at":"2025-07-22T18:33:51.484Z","avatar_url":"https://github.com/gurbaaz27.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# LinkedIn Comments Scraper \u003cimg src=\"assets/linkedin_logo.png\" width=\"50\" height=\"50\"\u003e\u003c/img\u003e\n\n## Updates 🚀\n\n- [x] Fix utf-8 encoding error\n- [x] Shift configuration from Config.py to config.json\n- [x] Add support for profile pictures of commentors\n- [ ] Add support for pulling images in comment section\n- [x] Add headless support \n- [ ] Chrome extension\n- [x] Show all replies support\n\n## Brief Overview\nIf you have used LinkedIn, you must have encountered many posts regarding helpful resources, and they would ask email address,to which replies in comments are usually like\n```bash\nInterested!\n\u003cemail-address\u003e\n ``` \nI don't like this thing, like you can already share link in the post itself :angry:. **Nevertheless**, I thought to automate this work of collecting all emails.  \nAll the comments, with columns of \n- Name of the person commented\n- Designation of the person\n- Profile Picture URL\n- Email address (if present in comment)\n- Comment (UTF-8 encoded)\n\nare stored in a csv.\n\n## Requirements\n- [python](https://www.python.org/) (recommended : \u003e= 3.7)\n- [selenium](https://pypi.org/project/selenium/) \n\u003c!-- \u003cimg src=\"assets/selenium_logo.png\" width=\"20\" height=\"20\"\u003e\u003c/img\u003e --\u003e\n- [web-driver](https://pypi.org/project/webdriver-manager/)\n\nInstall the dependencies:\n```bash\npip install -r requirements.txt\n```\n\n## Usage\n\n- In `config.json`, enter the required url of LinkedIn Post in **post_url** variable:\n```python\npost_url = \"\"\n```\n\u003e *__NOTE__*: If you forget to enter here, it will be asked during execution of script itself.\n\n- You can also change csv file name (in which scraped data will be stored) and dir name (in which profile pics will be downloaded) in `config.json` .\n- Help:\n```\nusage: main.py [-h] [--headless] [--show-replies] [--download-pfp]\n\nLinkedin Scraping.\n\noptions:\n  -h, --help      show this help message and exit\n  --headless      Go headless browsing\n  --show-replies  Load all replies to comments\n  --download-pfp  Download profile pictures of commentors\n```\n\u003e *__NOTE__*: Even if the flag `--download-pfp` isn't provided, URLs of image would get stored in the output csv.\n\n- Run the script:\n```bash\npython main.py\n```\n\nLogin email and password for your LinkedIn account will be asked and process would start.\n\n### Configuration\n\n`config.json` contains various fields, containing information about scraping the HTML elements by name or xpath, and other metadata\n\nSuggestions and contributio ns are alwasy welcome!:smile:\n\n## Star History\n\n[![Star History Chart](https://api.star-history.com/svg?repos=gurbaaz27/linkedin-comments-scraper\u0026type=Date)](https://star-history.com/#gurbaaz27/linkedin-comments-scraper\u0026Date)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgurbaaz27%2Flinkedin-comments-scraper","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgurbaaz27%2Flinkedin-comments-scraper","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgurbaaz27%2Flinkedin-comments-scraper/lists"}