Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/lablnet/email-spider
Multi threaded Email crawler.
https://github.com/lablnet/email-spider
crawl email multithreading pyhon3 spider
Last synced: about 1 month ago
JSON representation
Multi threaded Email crawler.
- Host: GitHub
- URL: https://github.com/lablnet/email-spider
- Owner: lablnet
- License: mit
- Created: 2020-07-19T14:46:22.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2024-05-06T15:47:01.000Z (8 months ago)
- Last Synced: 2024-05-06T17:15:53.085Z (8 months ago)
- Topics: crawl, email, multithreading, pyhon3, spider
- Language: Python
- Homepage:
- Size: 11.7 KB
- Stars: 9
- Watchers: 3
- Forks: 5
- Open Issues: 0
-
Metadata Files:
- Readme: readme.md
- License: LICENSE
Awesome Lists containing this project
README
# Overview
A multi-threaded Email crawler written in Python
***The purpose of this tool to gather the email `Only` for now. You may look todo section***
To run the crawler, please type `python3 index.py` and enter a URL to crawl.
**This project is inspired by [hunter.io](https://hunter.io/)**## Todo
- [x] Gather Pages form home page
- [x] Gather Emails
- [x] Multi-threaded
- [X] Added Configuration support.
- [ ] Behchmarking.
- [ ] Make the package flexible and easy to use without touching any core files
- [ ] Components to extend project
- [ ] Database layer
- [ ] Verification of email
- [ ] Get reference link of emails
- [ ] Searching algorithms
- [ ] Add more tests
## Contributions
There is still a lot of work to do, so feel free to contribute to open `PR`
## License MIT## Disclaimer
**I do not accept responsibility for any illegal usage**