https://github.com/deadbeatnoble/scraperest

Web scraping API made with ktor
https://github.com/deadbeatnoble/scraperest

ktor manga-reader manhua-reader manhwa-reader rest-api scraper

Last synced: 28 days ago
JSON representation

Web scraping API made with ktor

Host: GitHub
URL: https://github.com/deadbeatnoble/scraperest
Owner: deadbeatnoble
Created: 2024-04-16T17:08:21.000Z (about 2 years ago)
Default Branch: master
Last Pushed: 2024-04-20T19:49:29.000Z (about 2 years ago)
Last Synced: 2025-01-13T17:24:16.957Z (over 1 year ago)
Topics: ktor, manga-reader, manhua-reader, manhwa-reader, rest-api, scraper
Language: Kotlin
Homepage: https://scrape-v1-0.onrender.com
Size: 189 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

          # ScrapeREST

## Introduction

ScrapeREST is a RESTful API designed for web scraping purposes. It provides a simple and efficient way to extract data from websites using HTTP requests and parsing the HTML responses.

## Project Features

- Fetch web pages using HTTP requests

- Extract data by iterating and searching for specific selectors or tags

- Support for various scraping techniques (XPath, CSS selectors, regular expressions)

- Customizable scraping configurations

- Error handling

## API Base URL

The base URL for the ScrapeREST API is `https://scrape-v1-0.onrender.com`.

## API Endpoints

| HTTP Verbs | Endpoints | Action |

| --- | --- | --- |

| GET | `/feed?type=latest&page={page?}` | Retrieves the latest feed |

| GET | `/feed?type=topview&page={page?}` | Retrieves the popular feed |

| GET | `/feed?type=newest&page={page?}` | Retrieves the new feed |

| GET | `/collection/genre?genre_id={genre_id?}&page={page?}` | Performs a genre-based search |

| GET | `/collection/author?author_id={author_id?}&page={page?}` | Performs an author-based search |

| GET | `/search?title={title?}&page={page?}` | Performs a simple search |

| GET | `/advanced_search?type={type?}&title={title?}&s={s?}&g_i={g_i?}&g_e={g_e?}&stat={stat?}&orby={orby?}&page={page?}` | Performs an advanced search |

| GET | `/manga?manga_id={manga_id?}` | Retrieves manga details |

| GET | `/chapter?manga_id={manga_id?}&chapter_id={chapter_id?}` | Retrieves chapter pages |

## Technologies Used

- Ktor (Kotlin): A framework for building asynchronous servers and clients in connected systems.

- Kotlin coroutines: A powerful tool for writing asynchronous code in a sequential style.

- HTML parsing libraries: Utilized to extract data from the HTML responses.

- Docker: A containerization platform used for packaging the application and its dependencies into a standardized unit for easy deployment and scalability.

## Author

- [Deadbeatnoble](https://github.com/deadbeatnoble)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/deadbeatnoble/scraperest

Awesome Lists containing this project

README