Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mtpatter/bilao
Jupyter notebooks for filtering Kafka data with Spark Streaming.
https://github.com/mtpatter/bilao
avro docker jupyter-notebook kafka spark spark-streaming
Last synced: 29 days ago
JSON representation
Jupyter notebooks for filtering Kafka data with Spark Streaming.
- Host: GitHub
- URL: https://github.com/mtpatter/bilao
- Owner: mtpatter
- Created: 2017-04-24T23:02:30.000Z (almost 8 years ago)
- Default Branch: master
- Last Pushed: 2021-11-15T20:51:49.000Z (about 3 years ago)
- Last Synced: 2024-11-13T04:52:34.738Z (3 months ago)
- Topics: avro, docker, jupyter-notebook, kafka, spark, spark-streaming
- Language: HTML
- Homepage: https://mtpatter.github.io/bilao/
- Size: 123 KB
- Stars: 2
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
[bilao](https://github.com/mtpatter/bilao)
============Serving up winnowed data.
Usage
-------------------```
docker build . -t "bilao"
```Some notebooks assume use with the [LSST DM alert_stream testbed](https://github.com/lsst-dm/alert_stream) on a running Docker network for that testbed, named "alertstream_default", as in the configuration below.
Follow the instructions to access the Notebook server after running:
```
docker run -it \
--rm \
-v $PWD:/home/jovyan/work:rw \
-p 8888:8888 -p 4040:4040 \
--network=alertstream_default \
-P \
bilao
```Examples
-------------------### Spark Streaming
* [Streaming from Kafka](https://mtpatter.github.io/bilao/notebooks/html/01-spark-streaming-kafka.html) ([ipynb](https://github.com/mtpatter/bilao/blob/master/notebooks/01-spark-streaming-kafka.ipynb))
* [Streaming from Kafka using Avro](https://mtpatter.github.io/bilao/notebooks/html/01-spark-streaming-kafka-avro.html) ([ipynb](https://github.com/mtpatter/bilao/blob/master/notebooks/01-spark-streaming-kafka-avro.ipynb))
### Spark Structured Streaming
* [Structured Streaming from Kafka](https://mtpatter.github.io/bilao/notebooks/html/01-spark-struct-stream-kafka.html) ([ipynb](https://github.com/mtpatter/bilao/blob/master/notebooks/01-spark-struct-stream-kafka.ipynb))