An open API service indexing awesome lists of open source software.

https://github.com/akkasayaz/quickscrape

QuickScrape is a lightweight, high-level web scraping library for Python that makes extracting structured data from websites simple and intuitive. Unlike more complex scraping frameworks, QuickScrape focuses on providing straightforward APIs for common extraction tasks.
https://github.com/akkasayaz/quickscrape

collaborate github github-copilot

Last synced: 2 days ago
JSON representation

QuickScrape is a lightweight, high-level web scraping library for Python that makes extracting structured data from websites simple and intuitive. Unlike more complex scraping frameworks, QuickScrape focuses on providing straightforward APIs for common extraction tasks.

Awesome Lists containing this project

README

          

# QuickScrape

A simple Python library for extracting common data types from web pages.

## Installation

```bash
pip install quickscrape
```

## Usage

```python-repl
import quickscrape

# Extract emails from a webpage
emails = quickscrape.extract("email", "https://example.com/contact")

# Extract tables
tables = quickscrape.extract("table", "https://example.com/data")
# Get tables as pandas DataFrames
tables_df = quickscrape.extract("table", "https://example.com/data", output_format="dataframe")

# Extract multiple data types at once
results = quickscrape.extract(["email", "table"], "https://example.com")email", "https://example.com/contact")

# Extract tables
tables = quickscrape.extract("table", "https://example.com/data")
```

## License

MIT

### Development Workflow

1. **Local Development**: Install your package in development mode:

```bash
pip install -e .
```
2. **Testing** : Use pytest for writing and running tests.

```bash
pip install pytest
pytest
```