An open API service indexing awesome lists of open source software.

https://github.com/amatofrancesco99/dsbda2022

The goal of this project is to study the Steam platform gaming community behavior using both standard python, and Big Data tools.
https://github.com/amatofrancesco99/dsbda2022

big-data data-science steam-community

Last synced: 3 months ago
JSON representation

The goal of this project is to study the Steam platform gaming community behavior using both standard python, and Big Data tools.

Awesome Lists containing this project

README

        

# Data Science & Big Data Analytics (dsbda) project - 2022 (UniPV)

## Objective
The goal of the project is to study the **Steam platform gaming community** using both standard python, and Big Data tools.

More specifically, considering the users' reviews the objective is to understand, with the statement of some hypotheses, relevant correlation about the *players behavior* (e.g., identify factors such that a player like a video game, if the user is loyal to the same genre, ...).

## Procedure
The stated hypotheses have firstly been studied using a small portion of the overall Big Data source (obtained using snippet of code written with Big Data tools).

Once seen if the selected hypotheses are statistically significant using the small portion of data, the same hypotheses were replicated on the whole data source (using Spark/HDFS/...) and checked if still relevant.

## Datasets
The used datasets (Kaggle) are the following ones:
- *Reviews data*: https://www.kaggle.com/datasets/forgemaster/steam-reviews-dataset
- *Games data*: https://www.kaggle.com/datasets/nikdavis/steam-store-games