Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/wazzabeee/pyspark-etl-twitter
Implementation of an ETL process for real-time sentiment analysis of tweets with Docker, Apache Kafka, Spark Streaming, MongoDB and Delta Lake
https://github.com/wazzabeee/pyspark-etl-twitter
delta-lake docker etl etl-pipeline etl-process kafka kafka-consumer kafka-producer kafka-streams mongodb nlp pyspark python sentiment-analysis spark spark-streaming tweet-analysis tweet-classification twitter twitter-sentiment-analysis
Last synced: 27 days ago
JSON representation
Implementation of an ETL process for real-time sentiment analysis of tweets with Docker, Apache Kafka, Spark Streaming, MongoDB and Delta Lake
- Host: GitHub
- URL: https://github.com/wazzabeee/pyspark-etl-twitter
- Owner: Wazzabeee
- Created: 2023-01-11T21:13:55.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2023-05-06T13:50:52.000Z (over 1 year ago)
- Last Synced: 2023-05-14T17:15:47.745Z (over 1 year ago)
- Topics: delta-lake, docker, etl, etl-pipeline, etl-process, kafka, kafka-consumer, kafka-producer, kafka-streams, mongodb, nlp, pyspark, python, sentiment-analysis, spark, spark-streaming, tweet-analysis, tweet-classification, twitter, twitter-sentiment-analysis
- Language: Python
- Homepage: https://medium.com/towards-artificial-intelligence/real-time-sentiment-analysis-with-docker-kafka-and-spark-streaming-952c06549de1
- Size: 3.37 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0