https://github.com/spaames/babelio-scraping-api
[Flask] Scraping API - Retrieves book information from the babelio website
https://github.com/spaames/babelio-scraping-api
flask flask-api python scraper scraping
Last synced: about 2 months ago
JSON representation
[Flask] Scraping API - Retrieves book information from the babelio website
- Host: GitHub
- URL: https://github.com/spaames/babelio-scraping-api
- Owner: Spaames
- License: mit
- Created: 2025-02-27T09:48:41.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-02-28T15:31:54.000Z (over 1 year ago)
- Last Synced: 2025-02-28T18:13:38.771Z (over 1 year ago)
- Topics: flask, flask-api, python, scraper, scraping
- Language: Python
- Homepage:
- Size: 17.6 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Book Scraping API
An open-source Flask API that retrieves book information from the [Babelio](https://www.babelio.com) website. This API is used to power a personal book library management application ([Panthota]()).
## Features
- ***Babelio scraping***: Retrieves detailed book information based on the book title / author or ISBN.
- ***Data Return***: Returns data like title, author, cover image, summary.
- ***Frontend Communication***: This API is used by a Next.js application to display the results and store books in a MongoDB database.
## Installation
### Prerequisites
- **Python** version 3.13 or above
- **Poetry** to install dependencies
### Installation and Running Steps
```
git clone https://github.com/Spaames/babelio-scraping-api.git
cd babelio-scraping-api
poetry update
python3.13 app.py
```
By default, the app serves on localhost and on basic flask port (5000).
Edit the app.py to change this.
## Endpoints
- **/search** : GET -- params : query= --> return the result of the search field as a list of book (title, author, cover and book_url)
- **/book** : GET -- params : query= --> return data for a specific book url,