Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/damcraft/cf_percentage
How many of the top 100k domains use cloudflare? (Spoiler: it's a lot more than you probably think)
https://github.com/damcraft/cf_percentage
Last synced: 19 days ago
JSON representation
How many of the top 100k domains use cloudflare? (Spoiler: it's a lot more than you probably think)
- Host: GitHub
- URL: https://github.com/damcraft/cf_percentage
- Owner: DAMcraft
- License: apache-2.0
- Created: 2024-02-25T12:04:14.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2024-02-25T12:18:47.000Z (10 months ago)
- Last Synced: 2024-10-19T14:59:25.856Z (2 months ago)
- Language: Python
- Size: 558 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# How many of the top 100'000 websites use Cloudflare?
This program uses a dataset of the top 100'000 websites to determine how many of them use Cloudflare.
Attention: It is highly recommended to not run this program at home,
as it will generate a lot of traffic and your Internet access might stop working for a while.
It is recommended to run this program on a server or a cloud instance.
This program uses 100 threads to analyze the data.
The dataset uses the top 100'000 requested domains on Cloudflare's DNS servers from `2024-01-22` until `2024-01-29`.
Updating this data is simply possible by replacing the `100k.csv` with newer data from Cloudflare Radar.## Requirements
- Python 3.8 or higher
- No additional packages## Usage
Just run `python3 main.py`## Results
After running the program on `2024-02-25` on my server, I got the following results:
- Total Cloudflare domains: 48257
- Total non-Cloudflare domains: 33842
- Total unresolved domains: 17902
So 82'099 domains were able to be resolved, of which 48'257 used Cloudflare.
This means that about **58,8%** of the top 100'000 domains use Cloudflare, **more than half**!## Data Attribution
The dataset used in this subproject, "100k.csv", was obtained from Cloudflare Radar and is licensed under a [Creative Commons Attribution 4.0 International License](https://creativecommons.org/licenses/by/4.0/).
**Attribution:**
- Original Data Source: [Cloudflare Radar](https://radar.cloudflare.com/charts/LargerTopDomainsTable/attachment?id=954&top=100000&startDate=2024-01-22&endDate=2024-01-29)
- Author: [Cloudflare](https://www.cloudflare.com/)## License
Copyright 2024, DAMcraft
This project is licensed under the Apache License, Version 2.0. See [LICENSE](LICENSE) for the full license text.