https://github.com/hrialan/m2ds-data-stream-project
Markets stocks prediction using Kafka and online regression tools
https://github.com/hrialan/m2ds-data-stream-project
data-stream-processing kafka machine-learning online-learning
Last synced: 3 months ago
JSON representation
Markets stocks prediction using Kafka and online regression tools
- Host: GitHub
- URL: https://github.com/hrialan/m2ds-data-stream-project
- Owner: hrialan
- Created: 2022-03-10T07:33:41.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2022-03-31T15:36:25.000Z (about 4 years ago)
- Last Synced: 2025-01-21T22:43:14.463Z (over 1 year ago)
- Topics: data-stream-processing, kafka, machine-learning, online-learning
- Language: HTML
- Homepage:
- Size: 8.54 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Data streams processing project
## Real-time streaming application with Kafka

Authors : __Cyril Nérin - Hugo Rialan - Alexandre Perbet__
---
### Project Description
The objective of the project is to predict markets stocks using online regression tools
The stock exchange rates used are as follows : Google, Facebook, Amazon, Total, Gazprom, Alibaba, BNP Paribas, Ferrari
We first developed a batch regression solution. Then, we developed an online solution using river and Kafka.
---
### Project Architecture
The project was carried out on Jupyter notebooks to improve the visibility of the results obtained.
3 notebooks are available :
- The **first** one contains the data ingestion and an implementation of the ARIMA algorithm for the batch part.
- The **second** one contains the online learning prediction algorithm implemented with River.
- The **third** one allows to display and plot the obtained results.
---
### Results
- [BP](https://hrialan.github.io/m2ds-data-stream-project/HTML/3-BP__NERIN_PERBET_RIALAN__Datastream_sujet_5__Plot.html)
- [Alibaba](https://hrialan.github.io/m2ds-data-stream-project/HTML/3-alibaba__NERIN_PERBET_RIALAN__Datastream_sujet_5__Plot.html)
- [Amazon](https://hrialan.github.io/m2ds-data-stream-project/HTML/3-amazon__NERIN_PERBET_RIALAN__Datastream_sujet_5__Plot.html)
- [BNP Paribas](https://hrialan.github.io/m2ds-data-stream-project/HTML/3-bnp_paribas__NERIN_PERBET_RIALAN__Datastream_sujet_5__Plot.html)
- [Facebook](https://hrialan.github.io/m2ds-data-stream-project/HTML/3-facebook__NERIN_PERBET_RIALAN__Datastream_sujet_5__Plot.html)
- [Ferrari](https://hrialan.github.io/m2ds-data-stream-project/HTML/3-ferrari__NERIN_PERBET_RIALAN__Datastream_sujet_5__Plot.html)
- [Gazprom](https://hrialan.github.io/m2ds-data-stream-project/HTML/3-gazprom__NERIN_PERBET_RIALAN__Datastream_sujet_5__Plot.html)
- [Google](https://hrialan.github.io/m2ds-data-stream-project/HTML/3-google__NERIN_PERBET_RIALAN__Datastream_sujet_5__Plot.html)
- [Total](https://hrialan.github.io/m2ds-data-stream-project/HTML/3-total__NERIN_PERBET_RIALAN__Datastream_sujet_5__Plot.html)
---
### Requirements
It is assumed that Zookeeper is running default on localhost:2181 and Kafka on localhost:9092 before running the notebooks.