{"id":16572152,"url":"https://github.com/awhstin/dataset-list","last_synced_at":"2025-10-18T02:58:00.484Z","repository":{"id":48055690,"uuid":"55236311","full_name":"awhstin/Dataset-List","owner":"awhstin","description":"Storing datasets","archived":false,"fork":false,"pushed_at":"2021-08-16T15:16:44.000Z","size":14134,"stargazers_count":9,"open_issues_count":0,"forks_count":10,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-04-04T22:14:07.495Z","etag":null,"topics":["airport","data-mining","database","dataset"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/awhstin.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2016-04-01T13:57:51.000Z","updated_at":"2023-12-28T19:40:20.000Z","dependencies_parsed_at":"2022-08-12T17:40:19.430Z","dependency_job_id":null,"html_url":"https://github.com/awhstin/Dataset-List","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/awhstin/Dataset-List","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/awhstin%2FDataset-List","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/awhstin%2FDataset-List/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/awhstin%2FDataset-List/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/awhstin%2FDataset-List/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/awhstin","download_url":"https://codeload.github.com/awhstin/Dataset-List/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/awhstin%2FDataset-List/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":263931997,"owners_count":23531705,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["airport","data-mining","database","dataset"],"created_at":"2024-10-11T21:26:33.673Z","updated_at":"2025-10-18T02:57:55.460Z","avatar_url":"https://github.com/awhstin.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# Dataset-List\nI store some of my favorite data sets here as well as those I want to use later. \n\n## Datasets\n+ US streams database found [here](http://nationalmap.gov/small_scale/atlasftp-1m.html?openChapters=chpwater#chpwater)\n+ World roads shapefile, found [here](http://www.naturalearthdata.com/downloads/10m-cultural-vectors/roads/)\n+ [Ceta Base](http://www.cetabase.org/), Captive Cetacean Database \n+ 2018 EPL Fixtures list\n+ U.S. Census Bureau [FactFinder](https://factfinder.census.gov/faces/nav/jsf/pages/searchresults.xhtml?refresh=t#acsST)\n+ Stack Exchange [Data Dump](https://archive.org/details/stackexchange)\n+ Center for Disease Control - [Wonder Database](https://wonder.cdc.gov/)\n  + Mortality Rates, foodborne illness data, etc \n+ SimpleMaps US Cities [Database](http://simplemaps.com/data/us-cities)\n+ I created a list of airports and shortcodes from the TSA XML [file](https://www.tsa.gov/data/apcp.xml). The airport csv is in the this repo, and here is a map of some of the airports. \\\n![Alt text](https://github.com/awhstin/Dataset-List/blob/master/airportsv2.png \"Airports by category\")\n\n+ Bike share data (Divvy, Citibike etc) via ROpenSci/[bikedata](https://github.com/ropensci/bikedata)\n+ Bureau of Labor Statistics [Databases, Tables \u0026 Calculators](https://www.bls.gov/data/)\n+ NYT [Cost of Hurricane Harvey](https://www.nytimes.com/interactive/2017/09/01/upshot/cost-of-hurricane-harvey-only-one-storm-comes-close.html?mcubz=1) viz background [data](https://static01.nyt.com/newsgraphics/2017/08/29/expensive-storms/79088630ae1af934d7840e104a0e3f1e8a6c7bf1/data-2.tsv)\n+ Observatory of Economic Complexity [API](http://atlas.media.mit.edu/api/)\n  + Tutorial on calling the API and visualizing using streamgraphs [here](http://austinwehrwein.com/tutorials/streams/)\n+ Confederate monuments data from [Southern Poverty Law Center](https://splcenter.carto.com/tables/confederate_symbols/public) found in this repo. \\\n![Alt text](https://github.com/awhstin/Dataset-List/blob/master/states.png \"States with Confederate monuments\")\n+ Bechdel test data and [API documentation](http://bechdeltest.com/api/v1/doc) and example of use from this article, [Men, women and films](https://www.1843magazine.com/data-graphic/what-the-numbers-say/men-women-and-films)\n+ FEMA National Flood Insurance Program [data](https://www.fema.gov/statistics-calendar-year0) by Calendar Year\n+ Debt in America data from [Debt Interactive Map](https://apps.urban.org/features/debt-interactive-map/) located in this repo.\n+ [Gun Violence Reports](http://www.gunviolencearchive.org/reports) from the Gun Violence Archive\n+ Firefighter fatalities in the US, [data](https://www.kaggle.com/fema/firefighter-fatalities) from Kaggle.\n+ Top 500 passwords from [Information is Beautiful](https://informationisbeautiful.net/visualizations/top-500-passwords-visualized/), [data](http://bit.ly/KIB_PopularPasswords)\n+ Cook County Medical Examiner Case Archive - [Data Lens](https://datacatalog.cookcountyil.gov/Public-Safety/Medical-Examiner-Case-Archive/cjeq-bs86)\n+ Homeless arrests [github](https://github.com/datadesk/homeless-arrests-analysis)\n+ R Socrata Chicago Open Data [API](https://github.com/Chicago/RSocrata)\n+ Zillow Home Values [Data](https://www.zillow.com/research/data/)\n+ Global video games sales [data](https://data.world/julienf/video-games-global-sales-in-volume-1983-2017)\n+ Institute for Health Metrics and Evaluation [data](http://ghdx.healthdata.org/us-data)\n+ Sampled speeches from The History Place - [Great Speeches](http://www.historyplace.com/speeches/previous.htm)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fawhstin%2Fdataset-list","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fawhstin%2Fdataset-list","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fawhstin%2Fdataset-list/lists"}