An open API service indexing awesome lists of open source software.

https://github.com/skrub-data/datasets

skrub (previously dirty-cat) related dataset files. Includes script, raw datasets, etc.
https://github.com/skrub-data/datasets

Last synced: 8 months ago
JSON representation

skrub (previously dirty-cat) related dataset files. Includes script, raw datasets, etc.

Awesome Lists containing this project

README

          

# Datasets
Download and denormalization scripts for skrub datasets.

Contains also:
- Correspondence table between KEN Embeddings and their figshare download ID[[1]](#1).
- Happiness score dataset from the World Happiness Report 2022[[2]](#2).
- Bike sharing dataset from the UCI Machine Learning Repository[[3]](#3).

## References
[1]
https://soda-inria.github.io/ken_embeddings/

[2]
Helliwell, J. F., Layard, R., Sachs, J. D., De Neve, J.-E., Aknin, L. B., & Wang, S. (Eds.). (2022).
[World Happiness Report 2022](https://worldhappiness.report/ed/2022/). New York: Sustainable Development Solutions Network.

[2]
Fanaee-T,Hadi. (2013). Bike Sharing. UCI Machine Learning Repository. https://doi.org/10.24432/C5W894.