Projects in Awesome Lists by datasets
A curated list of projects in awesome lists by datasets .
https://github.com/datasets/covid-19
Novel Coronavirus 2019 time series data on cases
coronavirus coronavirus-disease covid covid-19 covid19-data data-package datapackage dataset
Last synced: 14 May 2025
https://github.com/datasets/commons
DataHub commons. Wiki catalog of interesting and important datasets
data datasets datasets-csv open-data open-datasets opendata
Last synced: 03 Feb 2026
https://github.com/datasets/country-codes
Comprehensive country code information, including ISO 3166 codes, ITU dialing codes, ISO 4217 currency codes, and many others
Last synced: 13 Apr 2025
https://github.com/datasets/s-and-p-500-companies
List of companies in the S&P 500 together with associated financials
Last synced: 25 Jan 2026
https://github.com/datasets/geo-countries
Country polygons as GeoJSON in a datapackage
Last synced: 15 May 2025
https://github.com/datasets/edgar
Securities and Exchange Commission (SEC) EDGAR database which contains regulatory filings from publicly-traded US corporations.
Last synced: 27 Jan 2026
https://github.com/datasets/airport-codes
List of Airport codes, locations and other information around the world
Last synced: 16 May 2025
https://github.com/datasets/s-and-p-500
S&P 500 index data (aka Standard and Poor's index of 500 major US stocks)
Last synced: 16 May 2025
https://github.com/datasets/world-cities
List of major cities of the world as a datapackage
Last synced: 08 Apr 2025
https://github.com/datasets/country-list
List of all countries in the world with their ISO 2 digit codes (ISO 3166-1) as CSV and JSON
Last synced: 28 Jan 2026
https://github.com/datasets/currency-codes
ISO 4217 List of Currencies and Currency Codes
Last synced: 15 May 2025
https://github.com/datasets/un-locode
United Nations Codes for Trade and Transport Locations (UN/LOCODE) and Country Codes
Last synced: 04 Apr 2025
https://github.com/datasets/language-codes
ISO Language Codes (639-1 and 639-2)
Last synced: 04 Apr 2025
https://github.com/datasets/population
Population figures for countries, regions (e.g. Asia) and the world.
data-package datapackage dataset population population-figures
Last synced: 12 Apr 2025
https://github.com/datasets/oil-prices
Brent crude and WTI oil prices from US EIA
Last synced: 04 Apr 2025
https://github.com/datasets/gdp
Country, regional and world GDP in current US Dollars ($)
Last synced: 05 Apr 2025
https://github.com/datasets/s-and-p-500-companies-financials
List of companies in the S&P 500 (Standard and Poor's 500).
Last synced: 08 Feb 2026
https://github.com/datasets/publicbodies
A database of public bodies such as government departments, ministries etc.
department fire ministries open-data open-government open-knowledge-international police
Last synced: 12 Apr 2025
https://github.com/datasets/geoip2-ipv4
GeoIP2 - free IP geolocation database.
Last synced: 07 Apr 2025
https://github.com/datasets/finance-vix
CBOE Volatility Index (VIX) time-series dataset including daily open, close, high and low.
Last synced: 27 Jan 2026
https://github.com/datasets/nasdaq-listings
Data package for Nasdaq listings
Last synced: 05 Apr 2025
https://github.com/datasets/core-datasets
DataHub.io awesome datasets - curated collections of high quality dataset organized by topic
Last synced: 07 Apr 2025
https://github.com/datasets/top-level-domain-names
The delegation details of top-level domains
Last synced: 13 Jul 2025
https://github.com/datasets/imf-weo
IMF World Economic Outlook Database Data
Last synced: 28 Feb 2026
https://github.com/datasets/iso-container-codes
Coded list of ISO 6346 shipping containers, used in international trade and electronic shipping messages.
Last synced: 01 Feb 2026
https://github.com/datasets/five-thirty-eight-datasets
Over 100 datasets scraped from FiveThirtyEight
Last synced: 10 Apr 2025
https://github.com/datasets/media-types
List of MIME types, subtypes, and file name extensions.
Last synced: 01 Jul 2025
https://github.com/datasets/clinical-trials-us
Official US clinical trial outcomes from the FDA
Last synced: 18 Jun 2025
https://github.com/datasets/investor-flow-of-funds-us
Monthly net new cash flow into various mutual fund investment classes (equities, bonds etc).
Last synced: 22 Aug 2025
https://github.com/datasets/nyse-other-listings
Data package for NYSE listings
Last synced: 12 Apr 2025
https://github.com/datasets/commodity-prices
Monthly Prices of 53 commodities and 10 indexes from 1980 to 2016.
Last synced: 06 Oct 2025
https://github.com/datasets/population-city
City population yearly timeseries for female and male, and for both sexes, collected by the United Nations Statistics Division and published by UNData.
Last synced: 05 Oct 2025
https://github.com/datasets/natural-gas
Natural Gas Prices including Henry Hub
Last synced: 12 Apr 2025
https://github.com/datasets/house-prices-us
US House Price Indices (Case-Shiller)
Last synced: 12 Apr 2025
https://github.com/datasets/exchange-rates
Foreign exchange rates from US Federal Reserve.
Last synced: 12 Apr 2025
https://github.com/datasets/inflation
Annual Inflation, GDP deflator and consumer prices
Last synced: 12 Apr 2025
https://github.com/datasets/geo-boundaries-world-110m
DEPRECATED - replaced by https://github.com/datasets/geo-countries (Map of the world's countries - vector data at 1:110m scale)
Last synced: 15 Feb 2026
https://github.com/datasets/co2-ppm
CO2 PPM - Trends in Atmospheric Carbon Dioxide
Last synced: 12 Apr 2025
https://github.com/datasets/continent-codes
List of continents with two letter code
Last synced: 08 Mar 2026
https://github.com/datasets/corruption-perceptions-index
Corruption Perceptions Index - CPI
Last synced: 12 Apr 2025
https://github.com/datasets/gini-index
Repository of the GINI index official repository.
Last synced: 11 Mar 2026
https://github.com/datasets/bond-yields-us-10y
10 year nominal yields on US government bonds from the Federal Reserve
Last synced: 12 Apr 2025
https://github.com/datasets/cpi
Annual consumer price index datapackage for most countries in the world
Last synced: 12 Apr 2025
https://github.com/datasets/cpi-us
Us Consumer Price Index (DataHub Data Package)
Last synced: 12 Apr 2025
https://github.com/datasets/employment-us
US Employment and Unemployment rates since 1940 from Bureau of Labor Statistics
Last synced: 06 Oct 2025
https://github.com/datasets/eu-emissions-trading-system
Data about the EU emission trading system (ETS)
Last synced: 12 Apr 2025
https://github.com/datasets/world-religion-projections
Word Religion Projections (2010-2050)
Last synced: 02 Feb 2026
https://github.com/datasets/co2-fossil-by-nation
Annual info about co2 emissions per nation
Last synced: 12 Apr 2025
https://github.com/datasets/co2-ppm-daily
Carbon Dioxide levels in the atmosphere (ppm on a daily basis)
Last synced: 29 Jul 2025
https://github.com/datasets/unece-units-of-measure
Standardised codes from Recommendation 20, mantained by UNECE.
Last synced: 12 Apr 2025
https://github.com/datasets/dac-and-crs-code-lists
Machine readable DAC CRS codelists
Last synced: 07 Mar 2026
https://github.com/datasets/imo-imdg-codes
Official IMDG Codes for use in transport of dangerous goods as described by the IMO
Last synced: 09 Mar 2026
https://github.com/datasets/openml-datasets
Group of most downloaded datasets extracted from https://www.openml.org
Last synced: 09 Feb 2026
https://github.com/datasets/glwd
Global Lakes and Wetlands Database Levels 1 and 2 Polygons as GeoJSON (.geojson/.topojson) with original format (.shp)
Last synced: 04 Mar 2025
https://github.com/datasets/geo-nuts-administrative-boundaries
Datapackage for NUTS admin levels 1, 2 and 3 edition 2010
Last synced: 12 Apr 2025
https://github.com/datasets/cpi-gb
Consumer Price Index (and hence inflation) for the UK from 1850 to the present (monthly since June 1947).
Last synced: 06 Mar 2026
https://github.com/datasets/fips-10-4
List of FIPS (Federal Information Processing Standards) region codes
Last synced: 29 Aug 2025
https://github.com/datasets/world-wealth-and-income-database
World Wealth and Income Database (formerly World Top Incomes Database). Database of income shares of top end of population for long time periods (e.g. 1875-present) for a variety of countries around the world.
Last synced: 02 Feb 2026
https://github.com/datasets/gdp-us
Gross Domestic Product of the United States (US GDP)
Last synced: 10 Sep 2025
https://github.com/datasets/cofog
Classifications of Functions of Government
Last synced: 05 Mar 2026
https://github.com/datasets/glacier-mass-balance
Average cumulative mass balance of "reference" Glaciers worldwide
Last synced: 15 Oct 2025
https://github.com/datasets/opented
Tenders Electronic Daily (TED) - OpenTED
Last synced: 12 Apr 2025
https://github.com/datasets/pharmaceutical-drug-spending
Pharmaceutical Drug Spending by countries
Last synced: 12 Apr 2025
https://github.com/datasets/unece-package-codes
Coded representations of the package type names used in International Trade (UNECE/CEFACT Trade Facilitation Recommendation No.21)
Last synced: 29 Jan 2026
https://github.com/datasets/co2-fossil-global
Global CO2 Emissions from fossil-fuels annually since 1751 till 2014.
Last synced: 06 Mar 2026
https://github.com/datasets/speed-dating
Data was gathered from participants in experimental speed dating events from 2002-2004
Last synced: 20 Jul 2025
https://github.com/datasets/geo-boundaries-us-110m
Internal, first-order administrative boundaries and polygons for the United States in .shp, .geojson, and .topojson.
Last synced: 09 Mar 2026
https://github.com/datasets/population-growth-estimates-and-projections
Total Population
Last synced: 09 Sep 2025
https://github.com/datasets/geo-ne-admin1
Test of a datapackage for Natural Earth admin1
Last synced: 27 Jun 2025
https://github.com/datasets/smdg-master-terminal-facilities-list
List mantained by the SMDG Secretariat to specify the port terminal facilities in UN/EDIFACT messages.
Last synced: 09 Mar 2026
https://github.com/datasets/population-global-historical
Global historical population data
Last synced: 19 Jul 2025
https://github.com/datasets/lme-large-marine-ecosystems
LME (Large Marine Ecosystems) global dataset; originally .kml (.kmz), and .shp formats, converted to .geojson/.topojson
Last synced: 05 Mar 2026
https://github.com/datasets/icc-incoterms
International Commercial Terms (‘Incoterms’) are internationally recognised standard trade terms used in sales contracts.
Last synced: 05 Mar 2026
https://github.com/datasets/household-income-us-historical
Income Limits for Each Fifth and Top 5 Percent of All Households: 1967 to 2016
Last synced: 05 Jul 2025
https://github.com/datasets/eeg-eye-state
EEG measurements where the output is whether eye was open or not
Last synced: 18 Jul 2025
https://github.com/datasets/zopa
Data on interest rate and risk (default rates) at ZOPA, the peer-to-peer marketplace for money.
Last synced: 14 Jul 2025
https://github.com/datasets/cash-surplus-deficit
Cash Surplus/Deficit (% of GDP), from 1990 to 2013
Last synced: 24 Aug 2025
https://github.com/datasets/bond-yields-gov-long-term
Long term government bond yields
Last synced: 04 Mar 2025
https://github.com/datasets/genome-sequencing-costs
Costs associated with DNA sequencing since 2001
Last synced: 19 Oct 2025
https://github.com/datasets/global-temp-anomalies
Data about global annual anomalies
Last synced: 12 Apr 2025
https://github.com/datasets/house-prices-global
Residential property price statistics from different countries (from bis.org)
Last synced: 12 Apr 2025
https://github.com/datasets/dermatology
Patients with dermatology illnesses.
Last synced: 12 Apr 2025