Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/alimghmi/crypto-news-etl

A simple ETL data pipeline using python and sqlite3
https://github.com/alimghmi/crypto-news-etl

beautifulsoup crawling etl-pipeline python scraper sqlite3

Last synced: 9 days ago
JSON representation

A simple ETL data pipeline using python and sqlite3

Awesome Lists containing this project

README

        

# crypto-news-etl

## Description
An attempt, of course simple, to simulate workflow of an ETL data pipline. Extracting news from [cryptonews.com](https://cryptonews.com/), transforming fetched content and loading to sqlite3 database and S3 Bucket.

## Installing

#### Use docker-compose:
```
git clone https://github.com/alimghmi/crypto-news-etl.git
cd crypto-news-etl
docker-compose up --build
```
Current working directory is mounted to container thus logs and sqlite3 database is accessible locally.

#### Or run locally:
```
git clone https://github.com/alimghmi/crypto-news-etl.git
cd crypto-news-etl
pip3 install -r requirements.txt
python3 app.py
```