https://github.com/finki-hub/finki-scraper

Scripts for scraping various FINKI services, and providing them by a Discord webhook and a REST API
https://github.com/finki-hub/finki-scraper

fcse finki ukim

Last synced: 9 days ago
JSON representation

Scripts for scraping various FINKI services, and providing them by a Discord webhook and a REST API

Host: GitHub
URL: https://github.com/finki-hub/finki-scraper
Owner: finki-hub
License: mit
Created: 2022-09-07T00:56:30.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2025-05-05T13:05:22.000Z (19 days ago)
Last Synced: 2025-05-05T13:41:12.796Z (18 days ago)
Topics: fcse, finki, ukim
Language: TypeScript
Homepage:
Size: 1.73 MB
Stars: 3
Watchers: 1
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# FINKI Scraper

Tooling for scraping and providing publicly available data from FCSE services. The data is provided using a REST API or webhooks. Requires Node.js >= 20.

## Architecture

The scrapers are implemented as classes (called strategies) which contain several selectors and methods for fetching the data from each container (post, announcement, etc). Adding a new service requires creating a new strategy and linking it. See [the example strategy](./src/strategies/ExampleStrategy.ts) for more info.

## Quick Setup (Production)

To run the scraper:

1. Clone the repository: `git clone https://github.com/finki-hub/finki-scraper.git`
2. Prepare configuration by copying `config/config.sample.json` to `config/config.json`
3. Install dependencies: `npm i`
4. Run the scraper `npm run start`

It's also available as a Docker image:

```sh
docker run -d \
--name finki-scraper \
--restart unless-stopped \
-v ./cache:/app/cache \
-v ./config:/app/config \
-v ./logs:/app/logs \
ghcr.io/finki-hub/finki-scraper:latest
```

Or Docker Compose: `docker compose up -d`

You can select which scrapers to run declaratively (in the configuration with the `enabled` flag) or imperatively: `npm run start scraper_1 scraper_2 ... scraper_n`

## Quick Setup (Development)

1. Clone the repository: `git clone https://github.com/finki-hub/finki-scraper.git`
2. Install dependencies (and pre-commit hooks): `npm i`
3. Prepare configuration: `cp config/config.sample.json config/config.json`
4. Build the project: `npm run build`
5. Run it: `npm run start`

## Configuration

There is an example configuration file available at [`config/config.sample.json`](./config/config.sample.json). Copy it to `config/config.json` and edit it to your liking.

## Server Mode

If you would like to consume the data from a REST API, run the app in server mode: `npm run serve`. The data will be scraped on each API call instead of periodically.

- `GET /list` - get all active scrapers,
- `GET /get/` - get data from the scraper ``
- `DELETE /delete` - delete the cache of all scrapers
- `DELETE /delete/` - delete the cache of the scraper ``

The `` parameter is what is specified in the configuration.

## License

This project is licensed under the terms of the MIT license.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/finki-hub/finki-scraper

Awesome Lists containing this project

README