https://github.com/m98/email-extractor-crawler
A minimal Node crawler to find emails used inside a website content, this crawler follows links in the website and tries to find an email in the content of the page
https://github.com/m98/email-extractor-crawler
crawler email javascript lowdb node-crawler nodejs scraper
Last synced: 7 months ago
JSON representation
A minimal Node crawler to find emails used inside a website content, this crawler follows links in the website and tries to find an email in the content of the page
- Host: GitHub
- URL: https://github.com/m98/email-extractor-crawler
- Owner: m98
- License: mit
- Created: 2019-07-03T21:30:08.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2022-12-09T16:55:00.000Z (about 3 years ago)
- Last Synced: 2025-06-05T09:09:39.731Z (7 months ago)
- Topics: crawler, email, javascript, lowdb, node-crawler, nodejs, scraper
- Language: JavaScript
- Size: 51.8 KB
- Stars: 1
- Watchers: 0
- Forks: 0
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# email-extractor-crawler
A minimal Node crawler to find emails used inside a website content, this crawler follows links in the website and tries to find an email in the content of the page. You just need to pass a domain name and wait until the crawler search to find email in pages.
You can find list of extracted emails in `email-extractor-crawler-db.json` in the root directory.
## How to use?
Update the first line of `src/index.js` (domain) to your preferred domain name (the full path of domain is necessary, ex: `https://google.com/`). Then just in terminal run `node src/index.js`