Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mneedham/real-time-analytics-book
https://github.com/mneedham/real-time-analytics-book
Last synced: 12 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/mneedham/real-time-analytics-book
- Owner: mneedham
- Created: 2022-08-16T14:05:43.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2023-11-03T07:00:00.000Z (about 1 year ago)
- Last Synced: 2024-04-14T09:10:05.909Z (9 months ago)
- Language: Java
- Size: 2.78 MB
- Stars: 24
- Watchers: 5
- Forks: 8
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Real-Time Analytics Book Code Repository
The code for Dunith Dhanushka and Mark Needham's upcoming book on Real-Time Analytics.
*Put the architecture diagram here*
## Install components
```bash
docker-compose \
-f docker-compose-base.yml \
-f docker-compose-pinot.yml \
-f docker-compose-dashboard-enriched.yml \
-f docker-compose-dashboard.yml up
``````bash
docker-compose \
-f docker-compose-base.yml \
-f docker-compose-pinot-m1.yml \
-f docker-compose-dashboard.yml \
-f docker-compose-enrich-streams.yml \
-f docker-compose-dashboard-enriched.yml \
up
```Once that's run, you can navigate to the following:
* Pinot UI - http://localhost:9000
* Streamlit Dashboard - http://localhost:8501## Add enriched table
```
docker run -v $PWD/pinot/config:/config \
--network rta \
apachepinot/pinot:0.11.0 \
AddTable -schemaFile /config/orders_enriched/schema.json \
-tableConfigFile /config/orders_enriched/table.json \
-controllerHost pinot-controller -exec
```## (Optional) Downloading products
Setup Python environment:
```bash
python -m venv venv
source venv/bin/activate
```Install dependencies:
```bash
pip install -r scripts/requirements.txt
```Downloading product pages:
```bash
python scripts/download_products.py
```Scrape product data:
```bash
python scripts/scrape_products.py
```The products will be written to [simulator/data/products.json](simulator/data/products.json)