An open API service indexing awesome lists of open source software.

https://github.com/mlampros/datasets

Data sets in .zip or .geojson format
https://github.com/mlampros/datasets

data geojson zip

Last synced: 7 months ago
JSON representation

Data sets in .zip or .geojson format

Awesome Lists containing this project

README

          

README

h1 {font-size: 34px;}
h1.title {font-size: 38px;}
h2 {font-size: 30px;}
h3 {font-size: 24px;}
h4 {font-size: 18px;}
h5 {font-size: 16px;}
h6 {font-size: 12px;}
code {color: inherit; background-color: rgba(0, 0, 0, 0.04);}
pre:not([class]) { background-color: white }

code{white-space: pre-wrap;}
span.smallcaps{font-variant: small-caps;}
span.underline{text-decoration: underline;}
div.column{display: inline-block; vertical-align: top; width: 50%;}
div.hanging-indent{margin-left: 1.5em; text-indent: -1.5em;}
ul.task-list{list-style: none;}

code{white-space: pre;}

if (window.hljs) {
hljs.configure({languages: []});
hljs.initHighlightingOnLoad();
if (document.readyState && document.readyState === "complete") {
window.setTimeout(function() { hljs.initHighlighting(); }, 0);
}
}

.main-container {
max-width: 940px;
margin-left: auto;
margin-right: auto;
}
img {
max-width:100%;
}
.tabbed-pane {
padding-top: 12px;
}
.html-widget {
margin-bottom: 20px;
}
button.code-folding-btn:focus {
outline: none;
}
summary {
display: list-item;
}
pre code {
padding: 0;
}

.tabset-dropdown > .nav-tabs {
display: inline-table;
max-height: 500px;
min-height: 44px;
overflow-y: auto;
border: 1px solid #ddd;
border-radius: 4px;
}

.tabset-dropdown > .nav-tabs > li.active:before {
content: "";
font-family: 'Glyphicons Halflings';
display: inline-block;
padding: 10px;
border-right: 1px solid #ddd;
}

.tabset-dropdown > .nav-tabs.nav-tabs-open > li.active:before {
content: "";
border: none;
}

.tabset-dropdown > .nav-tabs.nav-tabs-open:before {
content: "";
font-family: 'Glyphicons Halflings';
display: inline-block;
padding: 10px;
border-right: 1px solid #ddd;
}

.tabset-dropdown > .nav-tabs > li.active {
display: block;
}

.tabset-dropdown > .nav-tabs > li > a,
.tabset-dropdown > .nav-tabs > li > a:focus,
.tabset-dropdown > .nav-tabs > li > a:hover {
border: none;
display: inline-block;
border-radius: 4px;
background-color: transparent;
}

.tabset-dropdown > .nav-tabs.nav-tabs-open > li {
display: block;
float: none;
}

.tabset-dropdown > .nav-tabs > li {
display: none;
}



Buy Me A Coffee





Data sets in .zip format



To download the .zip files on a linux OS from within R use :

system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/mnist.zip”)

system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/cifar_10.zip”)

system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/africa_soil_train_data.zip”)

system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/sift_10k.txt”)

system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/BSR_bsds500.zip”)

system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/fastText_data.zip”)

system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/wili-2018.zip”)

system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/declaration_human_rights.zip”)

system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/lid.176.zip”)

system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/grid30.zip”)

system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/grid90.zip”)


In any other case use the Clone or download button in this repository page to download the dataset(s).



data
rows
columns
notes

mnist
70000
785 (including the class

cifar 10
60000
1025 (including the class)
the data were converted from RGB to gray, normalized and rounded to 2 decimal places (to reduce the storage size)

africa soil data train
1157
3600
https://www.kaggle.com/c/afsis-soil-properties/data

sift_10k
10000
128
https://github.com/searchivarius/nmslib/blob/master/sample_data/sift_10k.txt

Berkeley Segmentation Data Set and Benchmarks 500 (BSDS500)
500 images (train, test, validation
-
https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/grouping/resources.html

fastText data
-
-
The .zip file includes files / folders for the fastText R package

wili-2018
-
-
WiLI-2018, the Wikipedia language identification benchmark dataset, contains 235000 paragraphs of 235 languages. The dataset is balanced and a train-test split is provided. Download weblink: https://zenodo.org/record/841984#.YHwCvOpRUV0

Declaration of Human rights
-
-
The .zip file includes the decalaration of human rights (https://www.un.org/en/about-us/universal-declaration-of-human-rights) in Chinese, English, Spanish (first 3 by number of speakers worldwide, https://en.wikipedia.org/wiki/List_of_languages_by_number_of_native_speakers)

fastText language identification
-
-
The .zip file includes a pre-trained model for language identification (https://fasttext.cc/docs/en/language-identification.html)

grid30.zip
-
-
The .zip file includes the 30 meter elevation .shp files required to reproduce the .RDS files of the CopernicusDEM R package

grid90.zip
-
-
The .zip file includes the 90 meter elevation .shp files required to reproduce the .RDS files of the CopernicusDEM R package



To download the .Rd files from the man folder use the Raw button, then right click and Save as…






Data in .geojson format




To display / download the data use the raw format,


https://raw.githubusercontent.com/mlampros/DataSets/master/california.geojson


// add bootstrap table styles to pandoc tables
function bootstrapStylePandocTables() {
$('tr.odd').parent('tbody').parent('table').addClass('table table-condensed');
}
$(document).ready(function () {
bootstrapStylePandocTables();
});

$(document).ready(function () {
window.buildTabsets("TOC");
});

$(document).ready(function () {
$('.tabset-dropdown > .nav-tabs > li').click(function () {
$(this).parent().toggleClass('nav-tabs-open');
});
});

(function () {
var script = document.createElement("script");
script.type = "text/javascript";
script.src = "https://mathjax.rstudio.com/latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML";
document.getElementsByTagName("head")[0].appendChild(script);
})();