Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/dailydotdev/daily-scraper
Fetches information about every webpage 🤖
https://github.com/dailydotdev/daily-scraper
Last synced: 3 months ago
JSON representation
Fetches information about every webpage 🤖
- Host: GitHub
- URL: https://github.com/dailydotdev/daily-scraper
- Owner: dailydotdev
- License: agpl-3.0
- Created: 2020-06-05T11:04:59.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2024-07-10T14:17:10.000Z (4 months ago)
- Last Synced: 2024-07-10T17:03:52.280Z (4 months ago)
- Language: HTML
- Homepage:
- Size: 1.51 MB
- Stars: 101
- Watchers: 4
- Forks: 24
- Open Issues: 17
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Codeowners: CODEOWNERS
Awesome Lists containing this project
README
Daily Scraper
Fetches information about every webpage 🤖
The service uses [Puppeteer](https://github.com/puppeteer/puppeteer), a headless Chrome, to scrape webpages.
Currently it's only purpose is to provide information when a user suggests a new source.
The scraper can find the icon, rss feed, name, and other relevant information for every page.## Stack
* Node v16.20.0 (a `.nvmrc` is presented for [nvm](https://github.com/nvm-sh/nvm) users).
* NPM for managing dependencies.
* Fastify as the web framework## Project structure
* `__tests__` - There you can find all the tests and fixtures. Tests are written using `jest`.
* `helm` - The home of the service helm chart for easily deploying it to kubernetes.
* `src` - This is obviously the place where you can find the source files.
* `scrape` - Stores many utility functions to scrape information from a webpage.## Local environment
Daily Scraper requires nothing to run. It doesn't need any database or a service.
[.env](.env) is used to set the required environment variables. It is loaded automatically by the project.
Finally run `npm run dev` to run the service and listen to port `5001`.
## Want to Help?
So you want to contribute to Daily Scraper and make an impact, we are glad to hear it. :heart_eyes:
Before you proceed we have a few guidelines for contribution that will make everything much easier.
We would appreciate if you dedicate the time and read them carefully:
https://github.com/dailydotdev/.github/blob/master/CONTRIBUTING.md