Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/danhje/dead-link-crawler

An efficient, asynchronous crawler that identifies broken links on a given domain.
https://github.com/danhje/dead-link-crawler

async broken-links crawler dead-links python python3

Last synced: 5 days ago
JSON representation

An efficient, asynchronous crawler that identifies broken links on a given domain.

Awesome Lists containing this project

README

        

# Dead Link Crawler
An efficient, asynchronous crawler that identifies broken links on a given domain.

## Installation
```shell
git clone https://github.com/danhje/dead-link-crawler.git
cd dead-link-crawler
pipenv install
```

## Usage
To start Python from within the virtual environment:
```shell
pipenv run python
```
To start the crawl and print the results:
```python
from deadLinkCrawler import DeadLinkCrawler

crawler = DeadLinkCrawler()
crawler.startCrawl('http://danielhjertholm.me/prosjekter.htm', verbose=True)
crawler.printDeadLinks()
checkedLinks = crawler.checkedLinks
deadLinks = list(crawler.deadLinks)
```