An open API service indexing awesome lists of open source software.

https://github.com/nirantak/scraper

Python web scrapers
https://github.com/nirantak/scraper

beautifulsoup playwright python python-web-scraper scraper scraping selenium

Last synced: 5 months ago
JSON representation

Python web scrapers

Awesome Lists containing this project

README

          

# Scraper

> _Python web scrapers built using Selenium, BS4 and Playwright_

## Table of Contents

- [Scraper](#scraper)
- [Table of Contents](#table-of-contents)
- [Installation](#installation)
- [Usage](#usage)
- [Requirements](#requirements)

## Installation

Clone the git repository:

```bash
git clone https://github.com/nirantak/scraper.git && cd scraper
cp -nv .env.sample .env # copy and update the env variables
```

Install necessary dependencies

```bash
python3 -m venv .venv
source .venv/bin/activate
pip install -U pip wheel setuptools
pip install -U -r requirements.txt
playwright install
```

## Usage

See [scrapers/README.md](scrapers/) for usage instructions.

Samples present in [demo/](demo/).

## Requirements

1. [Python 3.10](https://www.python.org/downloads/)