An open API service indexing awesome lists of open source software.

https://github.com/springload/site-url-scraper

Multithreaded Python application to search a string pattern or find all internal links from a url or CSV list of urls
https://github.com/springload/site-url-scraper

python scraper

Last synced: 4 months ago
JSON representation

Multithreaded Python application to search a string pattern or find all internal links from a url or CSV list of urls

Awesome Lists containing this project

README

          

# Site url scraper

Multithreaded Python application to search a string pattern or find all internal links from a url or CSV list of urls

**/!\ This project isn't actively maintained. Check alternatives like [pylinkchecker](https://github.com/mtlevolio/pylinkchecker) or [broken-link-checker](https://github.com/stevenvachon/broken-link-checker) /!\\**

## Installation

```bash
git clone REPOSITORY_URL
cd site-url-scraper
pip install -e .
```

## Usage

```bash
python GLSapp/Main.py --help
```