An open API service indexing awesome lists of open source software.

https://github.com/spaames/babelio-scraping-api

[Flask] Scraping API - Retrieves book information from the babelio website
https://github.com/spaames/babelio-scraping-api

flask flask-api python scraper scraping

Last synced: about 2 months ago
JSON representation

[Flask] Scraping API - Retrieves book information from the babelio website

Awesome Lists containing this project

README

          

# Book Scraping API

An open-source Flask API that retrieves book information from the [Babelio](https://www.babelio.com) website. This API is used to power a personal book library management application ([Panthota]()).

## Features

- ***Babelio scraping***: Retrieves detailed book information based on the book title / author or ISBN.
- ***Data Return***: Returns data like title, author, cover image, summary.
- ***Frontend Communication***: This API is used by a Next.js application to display the results and store books in a MongoDB database.

## Installation

### Prerequisites

- **Python** version 3.13 or above
- **Poetry** to install dependencies

### Installation and Running Steps

```
git clone https://github.com/Spaames/babelio-scraping-api.git
cd babelio-scraping-api
poetry update
python3.13 app.py
```

By default, the app serves on localhost and on basic flask port (5000).
Edit the app.py to change this.

## Endpoints

- **/search** : GET -- params : query= --> return the result of the search field as a list of book (title, author, cover and book_url)
- **/book** : GET -- params : query= --> return data for a specific book url,