https://github.com/mlampros/datasets
Data sets in .zip or .geojson format
https://github.com/mlampros/datasets
data geojson zip
Last synced: 7 months ago
JSON representation
Data sets in .zip or .geojson format
- Host: GitHub
- URL: https://github.com/mlampros/datasets
- Owner: mlampros
- Created: 2016-06-13T18:24:23.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2021-05-19T09:36:43.000Z (over 4 years ago)
- Last Synced: 2025-02-15T00:29:24.511Z (12 months ago)
- Topics: data, geojson, zip
- Homepage:
- Size: 211 MB
- Stars: 6
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.html
Awesome Lists containing this project
README
README
h1 {font-size: 34px;}
h1.title {font-size: 38px;}
h2 {font-size: 30px;}
h3 {font-size: 24px;}
h4 {font-size: 18px;}
h5 {font-size: 16px;}
h6 {font-size: 12px;}
code {color: inherit; background-color: rgba(0, 0, 0, 0.04);}
pre:not([class]) { background-color: white }
code{white-space: pre-wrap;}
span.smallcaps{font-variant: small-caps;}
span.underline{text-decoration: underline;}
div.column{display: inline-block; vertical-align: top; width: 50%;}
div.hanging-indent{margin-left: 1.5em; text-indent: -1.5em;}
ul.task-list{list-style: none;}
code{white-space: pre;}
if (window.hljs) {
hljs.configure({languages: []});
hljs.initHighlightingOnLoad();
if (document.readyState && document.readyState === "complete") {
window.setTimeout(function() { hljs.initHighlighting(); }, 0);
}
}
.main-container {
max-width: 940px;
margin-left: auto;
margin-right: auto;
}
img {
max-width:100%;
}
.tabbed-pane {
padding-top: 12px;
}
.html-widget {
margin-bottom: 20px;
}
button.code-folding-btn:focus {
outline: none;
}
summary {
display: list-item;
}
pre code {
padding: 0;
}
.tabset-dropdown > .nav-tabs {
display: inline-table;
max-height: 500px;
min-height: 44px;
overflow-y: auto;
border: 1px solid #ddd;
border-radius: 4px;
}
.tabset-dropdown > .nav-tabs > li.active:before {
content: "";
font-family: 'Glyphicons Halflings';
display: inline-block;
padding: 10px;
border-right: 1px solid #ddd;
}
.tabset-dropdown > .nav-tabs.nav-tabs-open > li.active:before {
content: "";
border: none;
}
.tabset-dropdown > .nav-tabs.nav-tabs-open:before {
content: "";
font-family: 'Glyphicons Halflings';
display: inline-block;
padding: 10px;
border-right: 1px solid #ddd;
}
.tabset-dropdown > .nav-tabs > li.active {
display: block;
}
.tabset-dropdown > .nav-tabs > li > a,
.tabset-dropdown > .nav-tabs > li > a:focus,
.tabset-dropdown > .nav-tabs > li > a:hover {
border: none;
display: inline-block;
border-radius: 4px;
background-color: transparent;
}
.tabset-dropdown > .nav-tabs.nav-tabs-open > li {
display: block;
float: none;
}
.tabset-dropdown > .nav-tabs > li {
display: none;
}
Data sets in .zip format
To download the .zip files on a linux OS from within R use :
system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/mnist.zip”)
system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/cifar_10.zip”)
system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/africa_soil_train_data.zip”)
system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/sift_10k.txt”)
system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/BSR_bsds500.zip”)
system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/fastText_data.zip”)
system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/wili-2018.zip”)
system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/declaration_human_rights.zip”)
system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/lid.176.zip”)
system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/grid30.zip”)
system(“wget https://raw.githubusercontent.com/mlampros/DataSets/master/grid90.zip”)
In any other case use the Clone or download button in this repository page to download the dataset(s).
data
rows
columns
notes
mnist
70000
785 (including the class
–
cifar 10
60000
1025 (including the class)
the data were converted from RGB to gray, normalized and rounded to 2 decimal places (to reduce the storage size)
africa soil data train
1157
3600
https://www.kaggle.com/c/afsis-soil-properties/data
sift_10k
10000
128
https://github.com/searchivarius/nmslib/blob/master/sample_data/sift_10k.txt
Berkeley Segmentation Data Set and Benchmarks 500 (BSDS500)
500 images (train, test, validation
-
https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/grouping/resources.html
fastText data
-
-
The .zip file includes files / folders for the fastText R package
wili-2018
-
-
WiLI-2018, the Wikipedia language identification benchmark dataset, contains 235000 paragraphs of 235 languages. The dataset is balanced and a train-test split is provided. Download weblink: https://zenodo.org/record/841984#.YHwCvOpRUV0
Declaration of Human rights
-
-
The .zip file includes the decalaration of human rights (https://www.un.org/en/about-us/universal-declaration-of-human-rights) in Chinese, English, Spanish (first 3 by number of speakers worldwide, https://en.wikipedia.org/wiki/List_of_languages_by_number_of_native_speakers)
fastText language identification
-
-
The .zip file includes a pre-trained model for language identification (https://fasttext.cc/docs/en/language-identification.html)
grid30.zip
-
-
The .zip file includes the 30 meter elevation .shp files required to reproduce the .RDS files of the CopernicusDEM R package
grid90.zip
-
-
The .zip file includes the 90 meter elevation .shp files required to reproduce the .RDS files of the CopernicusDEM R package
To download the .Rd files from the man folder use the Raw button, then right click and Save as…
Data in .geojson format
To display / download the data use the raw format,
https://raw.githubusercontent.com/mlampros/DataSets/master/california.geojson
// add bootstrap table styles to pandoc tables
function bootstrapStylePandocTables() {
$('tr.odd').parent('tbody').parent('table').addClass('table table-condensed');
}
$(document).ready(function () {
bootstrapStylePandocTables();
});
$(document).ready(function () {
window.buildTabsets("TOC");
});
$(document).ready(function () {
$('.tabset-dropdown > .nav-tabs > li').click(function () {
$(this).parent().toggleClass('nav-tabs-open');
});
});
(function () {
var script = document.createElement("script");
script.type = "text/javascript";
script.src = "https://mathjax.rstudio.com/latest/MathJax.js?config=TeX-AMS-MML_HTMLorMML";
document.getElementsByTagName("head")[0].appendChild(script);
})();