https://github.com/isabeljohnson001/twitter_tweets_data_streaming
https://github.com/isabeljohnson001/twitter_tweets_data_streaming
docker flask kafka mongodb python reactjs spark
Last synced: 3 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/isabeljohnson001/twitter_tweets_data_streaming
- Owner: isabeljohnson001
- Created: 2024-07-06T20:38:08.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-07-06T21:46:28.000Z (almost 2 years ago)
- Last Synced: 2025-03-02T05:23:36.975Z (over 1 year ago)
- Topics: docker, flask, kafka, mongodb, python, reactjs, spark
- Language: Python
- Homepage:
- Size: 74.3 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Twitter Tweets Data Streaming
## Overview
This repository contains the data analysis project for the Britney Spears tweet dataset. It includes scripts for processing and analyzing two TSV files (~50MB and ~500MB) to extract insights about the public discourse around Britney Spears. The purpose of this analysis is to showcase data handling, processing capabilities, and exploratory data analysis techniques as part of the application process for the Data Engineer position at IDI.
## Repository Structure
```
Twitter_Streaming
├── src # Contains source code files for the project
├── app # Contains application logic and server-side scripts
├── config # Holds configuration files for the application
├── datasets # Directory for storing data files
├── jobs # Contains scripts for batch jobs and data processing tasks
├── docker-compose.yml # Docker Compose file for defining and running multi-container Docker applications
├── Dockerfile # Dockerfile for building Docker images
└── requirements # Text file listing dependencies to be installed with pip
├── ui-screen # Holds files for the user interface screen
```
## Getting Started
### Prerequisites
1. Ensure you have Python 3.8+,Docker installed on your machine. Additionally, you will need pip to install the dependencies.
### Installation
1.Read the  for detailed design and Installation Setup