https://github.com/commoncrawl/cc-webgraph-statistics
Statistics of Common Crawl monthly Web Graphs
https://github.com/commoncrawl/cc-webgraph-statistics
Last synced: 9 months ago
JSON representation
Statistics of Common Crawl monthly Web Graphs
- Host: GitHub
- URL: https://github.com/commoncrawl/cc-webgraph-statistics
- Owner: commoncrawl
- License: apache-2.0
- Created: 2024-12-09T18:20:03.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-07-25T15:59:01.000Z (11 months ago)
- Last Synced: 2025-07-25T22:47:22.895Z (11 months ago)
- Language: Python
- Homepage: https://commoncrawl.github.io/cc-webgraph-statistics/
- Size: 132 MB
- Stars: 5
- Watchers: 7
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# cc-webgraph-statistics

Web page showing statistics and plots derived from Common Crawl's monthly Web Graphs, and generation tools.
## Setup
```
cd src
make
```
## Updating
```
cd src
make update
```
## Modifications
Sometimes you may want to generate the HTML without generating the plots. You can do this with:
```
cd src
make noplot
```