https://github.com/islamhafez0/web-crawler
https://github.com/islamhafez0/web-crawler
books-toscrap flask json pagination python restful-api scrapy web-crawling web-scraping
Last synced: about 1 month ago
JSON representation
- Host: GitHub
- URL: https://github.com/islamhafez0/web-crawler
- Owner: islamhafez0
- Created: 2024-11-03T01:55:25.000Z (over 1 year ago)
- Default Branch: master
- Last Pushed: 2024-11-05T17:55:46.000Z (over 1 year ago)
- Last Synced: 2025-04-12T14:46:56.064Z (about 1 year ago)
- Topics: books-toscrap, flask, json, pagination, python, restful-api, scrapy, web-crawling, web-scraping
- Language: Python
- Homepage:
- Size: 1.11 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Web Crawler API
### This app provides a straightforward way to crawl book data from the "Books to Scrape" website and transform it into a structured, paginated API. This setup allows easy access to book information, enabling developers to integrate this data into web applications, mobile apps, or other platforms.
## usage
- `pip install scrapy`
- customize your app as needed
- `scrapy crawl mycrawler(identifier) -o output.json`
## Routes
- `/api/books?page=&per_page=`
- (Single Book): `/api/books/`
`Eslam Hafez`