Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/computingvictor/thread_app_dataset

Analyses and models based on the "Thread app dataset: 37000 entities" Kaggle dataset
https://github.com/computingvictor/thread_app_dataset

data-science dataset eda kaggle

Last synced: 1 day ago
JSON representation

Analyses and models based on the "Thread app dataset: 37000 entities" Kaggle dataset

Awesome Lists containing this project

README

        

# Thread App Dataset Analysis

![image](https://github.com/ComputingVictor/Thread_app_dataset/assets/115224707/77df9d78-2f1e-4911-8e05-81499a8cfd50)

## Description

This repository contains analyses and models based on the "Thread app dataset: 37000 entities," encompassing over 37,000 reviews from the Google Play Store and Apple App Store about the New Thread mobile application.

## Repository Contents
- 1_ExploratoryDataAnalysis.ipynb: Exploratory Data Analysis (EDA) of the dataset, including visualizations and descriptive statistics.
Upcoming Work
- NLP Transformation: Planned transformation and preprocessing of the text contained within the reviews are underway to extract features and prepare the data for modeling.

- Ratings Predictive Analysis: In the upcoming weeks, Machine Learning models, RNNs, and potentially LLM will be implemented to predict user ratings based on the content of their reviews.

## Data Source
The original dataset can be found on Kaggle: [Thread app dataset: 37000 entities](https://www.kaggle.com/datasets/shuvammandal121/37000-reviews-of-thread-app-dataset).

## Contributions
Contributions are welcome! If you have suggestions or wish to collaborate on upcoming phases of the project, feel free to open an issue or submit a pull request.

## License
This project is licensed under the Apache License 2.0. Refer to the LICENSE.md file for details.