An open API service indexing awesome lists of open source software.

https://github.com/kayannr/sportstats

Historical data analysis using SQL, Databricks, Python, PandaSQL, Pandas, and SQL Window functions. .
https://github.com/kayannr/sportstats

pandasql scala spark-sql sql

Last synced: 3 months ago
JSON representation

Historical data analysis using SQL, Databricks, Python, PandaSQL, Pandas, and SQL Window functions. .

Awesome Lists containing this project

README

        

# sportstats
This project aims to analyze historical Olympics dataset using SQL.

Tasks performed:

* Data collection and aggregation
* Data cleaning and deduplication
* Data quality and validity assessment
* Exploratory data analysis using SQL
* Hypothesis development and testing using Python and Pandas
* Recommendation development based on results obtained