https://github.com/m98/email-extractor-crawler

A minimal Node crawler to find emails used inside a website content, this crawler follows links in the website and tries to find an email in the content of the page
https://github.com/m98/email-extractor-crawler

crawler email javascript lowdb node-crawler nodejs scraper

Last synced: 8 months ago
JSON representation

A minimal Node crawler to find emails used inside a website content, this crawler follows links in the website and tries to find an email in the content of the page

Host: GitHub
URL: https://github.com/m98/email-extractor-crawler
Owner: m98
License: mit
Created: 2019-07-03T21:30:08.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2022-12-09T16:55:00.000Z (about 3 years ago)
Last Synced: 2025-06-05T09:09:39.731Z (9 months ago)
Topics: crawler, email, javascript, lowdb, node-crawler, nodejs, scraper
Language: JavaScript
Size: 51.8 KB
Stars: 1
Watchers: 0
Forks: 0
Open Issues: 4
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# email-extractor-crawler
A minimal Node crawler to find emails used inside a website content, this crawler follows links in the website and tries to find an email in the content of the page. You just need to pass a domain name and wait until the crawler search to find email in pages.

You can find list of extracted emails in `email-extractor-crawler-db.json` in the root directory.

## How to use?
Update the first line of `src/index.js` (domain) to your preferred domain name (the full path of domain is necessary, ex: `https://google.com/`). Then just in terminal run `node src/index.js`

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/m98/email-extractor-crawler

Awesome Lists containing this project

README