{"id":19819077,"url":"https://github.com/dmitriiweb/extract-emails","last_synced_at":"2025-04-07T13:08:30.093Z","repository":{"id":23122619,"uuid":"98173039","full_name":"dmitriiweb/extract-emails","owner":"dmitriiweb","description":"Extract emails from a given website","archived":false,"fork":false,"pushed_at":"2024-06-02T09:48:46.000Z","size":12740,"stargazers_count":98,"open_issues_count":4,"forks_count":36,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-03-31T11:08:12.140Z","etag":null,"topics":["email","extract-emails","linkedin","parser","parsing","parsing-library","python","scraper"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/dmitriiweb.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGELOG.md","contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-07-24T09:23:48.000Z","updated_at":"2025-03-12T06:58:21.000Z","dependencies_parsed_at":"2024-12-03T17:53:08.727Z","dependency_job_id":"f63936f4-4e41-436f-b4f3-c2e0a34f9cef","html_url":"https://github.com/dmitriiweb/extract-emails","commit_stats":{"total_commits":218,"total_committers":6,"mean_commits":"36.333333333333336","dds":"0.44495412844036697","last_synced_commit":"26c99dc459b0c3e1135a96f514b166aa6810adaa"},"previous_names":[],"tags_count":8,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dmitriiweb%2Fextract-emails","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dmitriiweb%2Fextract-emails/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dmitriiweb%2Fextract-emails/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dmitriiweb%2Fextract-emails/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/dmitriiweb","download_url":"https://codeload.github.com/dmitriiweb/extract-emails/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247657281,"owners_count":20974345,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["email","extract-emails","linkedin","parser","parsing","parsing-library","python","scraper"],"created_at":"2024-11-12T10:17:45.130Z","updated_at":"2025-04-07T13:08:30.053Z","avatar_url":"https://github.com/dmitriiweb.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Extract Emails\n\n![Image](https://github.com/dmitriiweb/extract-emails/blob/docs_improvements/images/email.png?raw=true)\n\n[![PyPI version](https://badge.fury.io/py/extract-emails.svg)](https://badge.fury.io/py/extract-emails)\n\nExtract emails and linkedins profiles from a given website\n\n**Support the project with BTC**: *bc1q0cxl5j3se0ufhr96h8x0zs8nz4t7h6krrxkd6l*\n\n[Documentation](https://dmitriiweb.github.io/extract-emails/)\n\n## Requirements\n\n- Python \u003e= 3.10\n\n## Installation\n\n```bash\npip install extract_emails[all]\n# or\npip install extract_emails[requests]\n# or\npip install extract_emails[selenium]\n```\n\n## Simple Usage\n\n### As library\n\n```python\nfrom pathlib import Path\n\nfrom extract_emails import DefaultFilterAndEmailFactory as Factory\nfrom extract_emails import DefaultWorker\nfrom extract_emails.browsers.requests_browser import RequestsBrowser as Browser\nfrom extract_emails.data_savers import CsvSaver\n\n\nwebsites = [\n    \"website1.com\",\n    \"website2.com\",\n]\n\nbrowser = Browser()\ndata_saver = CsvSaver(save_mode=\"a\", output_path=Path(\"output.csv\"))\n\nfor website in websites:\n    factory = Factory(\n        website_url=website, browser=browser, depth=5, max_links_from_page=1\n    )\n    worker = DefaultWorker(factory)\n    data = worker.get_data()\n    data_saver.save(data)\n```\n\n### As CLI tool\n\n```bash\n$ extract-emails --help\n\n$ extract-emails --url https://en.wikipedia.org/wiki/Email -of output.csv -d 1\n$ cat output.csv\nemail,page,website\nbob@b.org,https://en.wikipedia.org/wiki/Email,https://en.wikipedia.org/wiki/Email\n```\n\n### By me a coffee\n\n- **USDT** (TRC20): TXuYegp5L8Zf7wF2YRFjskZwdBxhRpvxBS\n- **BEP20**: 0x4D51Db2B754eA83ce228F7de8EaEB93a88bdC965\n- **TON**: UQA5quJljQz84RwzteN3uuKsdPTDee7a_GF5lgIgezA2oib5\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdmitriiweb%2Fextract-emails","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdmitriiweb%2Fextract-emails","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdmitriiweb%2Fextract-emails/lists"}