https://github.com/catchpoint/webpagetest.crawler

Last synced: 11 months ago
JSON representation

Host: GitHub
URL: https://github.com/catchpoint/webpagetest.crawler
Owner: catchpoint
License: apache-2.0
Created: 2021-06-14T17:15:46.000Z (about 5 years ago)
Default Branch: main
Last Pushed: 2025-01-17T18:38:49.000Z (over 1 year ago)
Last Synced: 2025-07-24T09:06:27.151Z (11 months ago)
Language: TypeScript
Size: 160 KB
Stars: 1
Watchers: 3
Forks: 3
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

Learn about more WebPageTest API Integrations in our docs

# WebPageTest Crawler

The WebPageTest Crawler, crawls through the website to fetch URLs and then runs test on them. Level and URL limit can be given.

![image](https://user-images.githubusercontent.com/31168643/122060468-40ad6680-ce0b-11eb-9f25-ea51eaac22f9.png)

Requires node, npm.

### 1. Installing Packages

Once you have cloned the project run `npm install` to install dependencies. requires node v22.13.0 minumum

```bash
npm install
```

### 2. Updating config values

There are 3 main config values : -

1. wpt_api_key - Check [here](https://docs.webpagetest.org/api/keys/) the API documentation
2. level - integer value, specifies maximum depth the crawler should crawl.
3. limit - integer value, specifies maximum limit of URLs need to tested.
Note : - Crawling stops if either of them reaches a limit.

### 3. Adding a initial URLs txt file

You can add your initial set of URLs to the startingUrls.txt file by seperating them using a comma.

![image](https://user-images.githubusercontent.com/31168643/122050545-2a021200-ce01-11eb-9400-31e7716791c0.png)

### 4. Lets fire it up

```bash
npm run build & node build/index.js -k [YOUR_API_KEY] -f ./startingUrls.txt
```

Booyah, once the crawl-testing is complete you'll have a report.csv file which includes performance details of the URLs crawled.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/catchpoint/webpagetest.crawler

Awesome Lists containing this project

README