https://github.com/tb0hdan/domains
World’s single largest Internet domains dataset
https://github.com/tb0hdan/domains
colly dataset internet-domains scrapy search-engines yacy
Last synced: 2 months ago
JSON representation
World’s single largest Internet domains dataset
- Host: GitHub
- URL: https://github.com/tb0hdan/domains
- Owner: tb0hdan
- License: bsd-3-clause
- Created: 2020-01-12T22:39:04.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2025-03-29T22:21:28.000Z (3 months ago)
- Last Synced: 2025-03-29T23:22:13.633Z (3 months ago)
- Topics: colly, dataset, internet-domains, scrapy, search-engines, yacy
- Language: HTML
- Homepage: https://domainsproject.org
- Size: 1.68 GB
- Stars: 753
- Watchers: 31
- Forks: 122
- Open Issues: 7
-
Metadata Files:
- Readme: README.md
- Changelog: NEWS.md
- Funding: .github/FUNDING.yml
- License: LICENSE
Awesome Lists containing this project
- my-awesome - tb0hdan/domains - domains,scrapy,search-engines,yacy pushed_at:2025-05 star:0.8k fork:0.1k World’s single largest Internet domains dataset (HTML)
- awesome-hacking-lists - tb0hdan/domains - World’s single largest Internet domains dataset (HTML)