https://github.com/vinhlh/frontendmasters-crawler

A demo of a serverless crawler built on AWS Lambda (scheduled tasks) and store results in S3
https://github.com/vinhlh/frontendmasters-crawler

aws crawler lambda s3 serverless

Last synced: 11 months ago
JSON representation

A demo of a serverless crawler built on AWS Lambda (scheduled tasks) and store results in S3

Host: GitHub
URL: https://github.com/vinhlh/frontendmasters-crawler
Owner: vinhlh
License: mit
Created: 2018-05-09T17:12:29.000Z (about 8 years ago)
Default Branch: master
Last Pushed: 2022-06-25T04:19:07.000Z (about 4 years ago)
Last Synced: 2025-06-22T03:35:10.720Z (about 1 year ago)
Topics: aws, crawler, lambda, s3, serverless
Language: JavaScript
Homepage:
Size: 274 KB
Stars: 10
Watchers: 1
Forks: 0
Open Issues: 3
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # FrontendMasters Crawler

A serverless crawler built on AWS Lambda (scheduled tasks) and store results in S3.







## Usage

- Setup your infrastructure on AWS by `make apply`.

- Make your own event by copying `event.example.json` to `event.json` in `functions/fm_crawler/` and updating its content to yours.

- Install dependencies by running `yarn` in `functions/fm_crawler/`.

- Sync course list into S3 by `make upload`. This file works as a dead simple storage for storing course list, download state.

- Deploy by `apex deploy`.

- Invoke lambda function by `make run-lambda`.

## Development

In order to execute the crawler in dev environment, running `make run`.

## Player

You can use [fm-player](https://github.com/vinhlh/fm-player) to play the content stored in S3.

## Note

This project is just a demo for using serverless service. We encourage you guys to buy Frontend Masters's subscription instead.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/vinhlh/frontendmasters-crawler

Awesome Lists containing this project

README