Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/andrefs/derzis
A path-aware distributed linked data crawler
https://github.com/andrefs/derzis
crawler linked-data
Last synced: 6 days ago
JSON representation
A path-aware distributed linked data crawler
- Host: GitHub
- URL: https://github.com/andrefs/derzis
- Owner: andrefs
- Created: 2021-03-11T11:51:48.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2023-11-07T23:03:55.000Z (about 1 year ago)
- Last Synced: 2024-04-14T20:18:35.991Z (7 months ago)
- Topics: crawler, linked-data
- Language: TypeScript
- Homepage:
- Size: 78.3 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 7
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# derzis [![Node.js CI](https://github.com/andrefs/derzis/actions/workflows/node.js.yml/badge.svg?branch=testing)](https://github.com/andrefs/derzis/actions/workflows/node.js.yml)
A path-aware distributed linked data crawler.
## Run
The easiest way of deploying Derzis is using Docker, but it can also
be executed without using containers.### Docker
#### Dependencies
- docker
- docker-compose#### Running
- Edit `src/config/index.ts` to modify the crawler parameters.
- Run `docker-compose up --build`.
- The web app will be accessible at http://localhost:5432/processes### Locally
#### Install dependencies
- MongoDB
- Redis
- Node.js v20
- Run `npm install` on the root folder#### Run in development mode
- Open a terminal and run `npm run worker:dev`.
- Open another terminal and run `npm run manager:dev`.
- The web app will be accessible at http://localhost:5432/processes#### Build and run
- Run `npm run build` on the root folder
- Make sure you have Redis and MongoDB running.
- Edit `dist/src/config/index.js` to modify the crawler parameters.
- Open a terminal and run `node ./dist/src/manager/bin/manager.js`.
- Open another terminal and run `./dist/src/worker/bin/worker-pool`.
- The web app will be accessible at http://localhost:5432/processes