An open API service indexing awesome lists of open source software.

https://github.com/francescocoding/data-vis-datasets

Just a list of datasets to make for an easier import to Google Colab
https://github.com/francescocoding/data-vis-datasets

Last synced: 4 months ago
JSON representation

Just a list of datasets to make for an easier import to Google Colab

Awesome Lists containing this project

README

        

# CM4125 - Data Visualization Coursework Datasets

This repository hosts a collection of diverse datasets specifically chosen for the CM4125 - Data Visualization coursework.
These datasets range from horror games to solar eclipses to coffee consumption and are used to try and find [spurious correlations](https://www.tylervigen.com/spurious-correlations) between seemingly unrelated sources..

## Datasets Overview

### 1. Horror Games Data (1972 - 2024)
- **Description**: This dataset includes a scraped list of horror games, used it for exploring trends in horror game development and popularity.
- **Scraped Source**: [Wikipedia List of Horror Games](https://en.wikipedia.org/wiki/List_of_horror_games)
- **Scraping Tool**: [Horror Games List Scraper](https://github.com/FrancescoCoding/Harvest-Time/blob/main/Horror-Games-List-Scraper.js)
- **Dataset**: [Horror Games Dataset](https://raw.githubusercontent.com/FrancescoCoding/Data-Vis-Datasets/main/Horror_games_list.json)

### 2. Solar Eclipses (1901 - 2000) & Solar Eclipses (2001 - 2100)
- **Description**: These datasets provide detailed information on solar eclipses from 1901 to 2100, ideal for chronological and astronomical analyses.
- **Common Source**: [NASA Five Millennium Catalog of Solar Eclipses](https://data.world/nasa/five-millennium-catalog-of-solar-eclipses-detailed)
- **Datasets**:
- [Solar Eclipses 1901-2000](https://raw.githubusercontent.com/FrancescoCoding/Data-Vis-Datasets/main/1901-2000.csv)
- [Solar Eclipses 2001-2100](https://raw.githubusercontent.com/FrancescoCoding/Data-Vis-Datasets/main/2001-2100.csv)

### 4. Metacritic Game Ratings (2011 - 2019)
- **Description**: Comprising Metacritic scores for various games, this dataset is useful for analyzing game rating trends and industry reception.
- **Source**: [Metacritic Games Stats 2011-2019](https://www.kaggle.com/datasets/skateddu/metacritic-games-stats-20112019)
- **Dataset**: [Metacritic Game Ratings Dataset](https://raw.githubusercontent.com/FrancescoCoding/Data-Vis-Datasets/main/metacritic_games.csv)

### 5. Coffee Consumption Data (1990 - 2018)
- **Description**: This dataset explores coffee consumption patterns across various countries, providing insights into global coffee consumption trends.
- **Source**: [ICO Coffee Dataset Worldwide](https://www.kaggle.com/datasets/yamaerenay/ico-coffee-dataset-worldwide?select=disappearance.csv)
- **Dataset**: [Coffee Consumption (disappearance)](https://raw.githubusercontent.com/FrancescoCoding/Data-Vis-Datasets/main/Coffee_consumption.csv)