Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/cjbarker/top-domains
Top 1 Millon ranked websites and top level domains (TLD)
https://github.com/cjbarker/top-domains
Last synced: about 1 month ago
JSON representation
Top 1 Millon ranked websites and top level domains (TLD)
- Host: GitHub
- URL: https://github.com/cjbarker/top-domains
- Owner: cjbarker
- License: apache-2.0
- Created: 2018-08-10T16:05:32.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2023-01-31T21:39:14.000Z (almost 2 years ago)
- Last Synced: 2024-10-15T12:26:21.627Z (about 1 month ago)
- Language: Shell
- Size: 247 MB
- Stars: 9
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG
- License: LICENSE
Awesome Lists containing this project
README
# TOP DOMAINS
[![GitLab license](https://img.shields.io/badge/license-Apache2.0-brightgreen.svg)](https://gitlab.com/cjbarker/top-domains/blob/master/LICENSE)
## About
The repo caontains the top ranked top level domains (TLD) and websites tracked via [Cisco's Umbrella Popularity List](https://s3-us-west-1.amazonaws.com/umbrella-static/index.html). Potential future enhancements may include additional source of records for merging (ex: Alexa 1Million).The repo's goal is to provide a simple, static comma separate files available for easy ingestion and use.
## File Downloads
The files can be downloaded in several ways:1. [Download archive release](https://gitlab.com/cjbarker/top-domains/uploads/top-recs-20200521.zip) in 7zip format that includes all the files
```bash
wget https://gitlab.com/cjbarker/top-domains/uploads/top-recs-20200521.zip
```2. All files downloaded via clone of the repository
```bash
git clone [email protected]:cjbarker/top-domains.git
cd top-domains/top-recs
```3. Individual file download via raw file from top-recs directory in the repository
```bash
wget https://gitlab.com/cjbarker/top-domains/raw/master/top-recs/top-sites-1000000.csv
```4. Run the program directly via wget piped to sh (see usage below)
```bash
wget -qO- https://gitlab.com/cjbarker/top-domains/raw/master/create-lists.sh | sh
```## Usage
The files can be downloaded directly via the directory top-recs in the repo, or can generated locally via running of the script.If you choose to run the script yourself, locally, the following commands will execute it:
```bash
# Downloads and splits records accordingly
wget -qO- https://gitlab.com/cjbarker/top-domains/raw/master/create-lists.sh | sh# Available files separated by TLD and websites
# Format ,
ls top-recs/
top-TLD-100.csv top-TLD-4121.csv top-sites-1000.csv top-sites-100000.csv
top-TLD-1000.csv top-sites-100.csv top-sites-10000.csv top-sites-1000000.csv# Example Output of ranked Top Level Domains (TLD)
head top-recs/top-TLD-100.csv
1,com
2,net
3,googleapis.com
4,org
5,io
6,cn
7,goog
8,co
9,vn
10,tv# Example Output of ranked Top Websites
head top-recs/top-sites-100.csv
1,google.com
2,microsoft.com
3,www.google.com
4,windowsupdate.com
5,ctldl.windowsupdate.com
6,data.microsoft.com
7,facebook.com
8,netflix.com
9,safebrowsing.googleapis.com
10,live.com
```