https://github.com/catchpoint/webpagetest.crawler
https://github.com/catchpoint/webpagetest.crawler
Last synced: 11 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/catchpoint/webpagetest.crawler
- Owner: catchpoint
- License: apache-2.0
- Created: 2021-06-14T17:15:46.000Z (about 5 years ago)
- Default Branch: main
- Last Pushed: 2025-01-17T18:38:49.000Z (over 1 year ago)
- Last Synced: 2025-07-24T09:06:27.151Z (11 months ago)
- Language: TypeScript
- Size: 160 KB
- Stars: 1
- Watchers: 3
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README

Learn about more WebPageTest API Integrations in our docs
# WebPageTest Crawler
The WebPageTest Crawler, crawls through the website to fetch URLs and then runs test on them. Level and URL limit can be given.

Requires node, npm.
### 1. Installing Packages
Once you have cloned the project run `npm install` to install dependencies. requires node v22.13.0 minumum
```bash
npm install
```
### 2. Updating config values
There are 3 main config values : -
1. wpt_api_key - Check [here](https://docs.webpagetest.org/api/keys/) the API documentation
2. level - integer value, specifies maximum depth the crawler should crawl.
3. limit - integer value, specifies maximum limit of URLs need to tested.
Note : - Crawling stops if either of them reaches a limit.
### 3. Adding a initial URLs txt file
You can add your initial set of URLs to the startingUrls.txt file by seperating them using a comma.

### 4. Lets fire it up
```bash
npm run build & node build/index.js -k [YOUR_API_KEY] -f ./startingUrls.txt
```
Booyah, once the crawl-testing is complete you'll have a report.csv file which includes performance details of the URLs crawled.