Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/computingvictor/thread_app_dataset
Analyses and models based on the "Thread app dataset: 37000 entities" Kaggle dataset
https://github.com/computingvictor/thread_app_dataset
data-science dataset eda kaggle
Last synced: 1 day ago
JSON representation
Analyses and models based on the "Thread app dataset: 37000 entities" Kaggle dataset
- Host: GitHub
- URL: https://github.com/computingvictor/thread_app_dataset
- Owner: ComputingVictor
- License: apache-2.0
- Created: 2023-08-21T08:58:41.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2023-09-18T18:57:29.000Z (about 1 year ago)
- Last Synced: 2023-09-19T00:33:00.528Z (about 1 year ago)
- Topics: data-science, dataset, eda, kaggle
- Language: Jupyter Notebook
- Homepage:
- Size: 1.03 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Thread App Dataset Analysis
![image](https://github.com/ComputingVictor/Thread_app_dataset/assets/115224707/77df9d78-2f1e-4911-8e05-81499a8cfd50)
## Description
This repository contains analyses and models based on the "Thread app dataset: 37000 entities," encompassing over 37,000 reviews from the Google Play Store and Apple App Store about the New Thread mobile application.
## Repository Contents
- 1_ExploratoryDataAnalysis.ipynb: Exploratory Data Analysis (EDA) of the dataset, including visualizations and descriptive statistics.
Upcoming Work
- NLP Transformation: Planned transformation and preprocessing of the text contained within the reviews are underway to extract features and prepare the data for modeling.- Ratings Predictive Analysis: In the upcoming weeks, Machine Learning models, RNNs, and potentially LLM will be implemented to predict user ratings based on the content of their reviews.
## Data Source
The original dataset can be found on Kaggle: [Thread app dataset: 37000 entities](https://www.kaggle.com/datasets/shuvammandal121/37000-reviews-of-thread-app-dataset).## Contributions
Contributions are welcome! If you have suggestions or wish to collaborate on upcoming phases of the project, feel free to open an issue or submit a pull request.## License
This project is licensed under the Apache License 2.0. Refer to the LICENSE.md file for details.