https://github.com/islamhafez0/web-crawler

books-toscrap flask json pagination python restful-api scrapy web-crawling web-scraping

Last synced: about 2 months ago
JSON representation

Host: GitHub
URL: https://github.com/islamhafez0/web-crawler
Owner: islamhafez0
Created: 2024-11-03T01:55:25.000Z (over 1 year ago)
Default Branch: master
Last Pushed: 2024-11-05T17:55:46.000Z (over 1 year ago)
Last Synced: 2025-04-12T14:46:56.064Z (about 1 year ago)
Topics: books-toscrap, flask, json, pagination, python, restful-api, scrapy, web-crawling, web-scraping
Language: Python
Homepage:
Size: 1.11 MB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Web Crawler API

### This app provides a straightforward way to crawl book data from the "Books to Scrape" website and transform it into a structured, paginated API. This setup allows easy access to book information, enabling developers to integrate this data into web applications, mobile apps, or other platforms.

## usage
- `pip install scrapy`
- customize your app as needed
- `scrapy crawl mycrawler(identifier) -o output.json`

## Routes
- `/api/books?page=&per_page=`
- (Single Book): `/api/books/`

`Eslam Hafez`

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/islamhafez0/web-crawler

Awesome Lists containing this project

README