Open Data
Open data is data that can be freely used, re-used, and redistributed by anyone - subject only, at most, to the requirement to attribute and sharealike.
- GitHub: https://github.com/topics/open-data
- Wikipedia: https://en.wikipedia.org/wiki/Open_data
- Related Topics: data, dataset, linked-open-data, open-access, open-science, openstreetmap, wikidata,
- Aliases: opendata,
- Last updated: 2026-01-16 00:23:19 UTC
- JSON Representation
https://github.com/ckan/ckan
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data.humdata.org among many other sites.
api catalog ckan ckanext data digitalpublicgoods dpg open-data python sdg16
Last synced: 12 May 2025
https://github.com/okfn-brasil/serenata-de-amor
🕵 Artificial Intelligence for social control of public administration | **This repository does not receive frequent updates. Check out the README**
artificial-intelligence civic-tech data-science machine-learning open-data politics
Last synced: 14 May 2025
https://github.com/common-voice/common-voice
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
crowdsourcing internet-freedom open-data voice
Last synced: 13 May 2025
https://github.com/statsbomb/open-data
Free football data from StatsBomb
football football-data open-data soccer sports-data sports-stats
Last synced: 16 Jan 2026
https://github.com/codyogden/killedbygoogle
Part guillotine, part graveyard for Google's doomed apps, services, and hardware.
front-end google hacktoberfest json open-data product react
Last synced: 14 May 2025
https://github.com/mdeff/fma
FMA: A Dataset For Music Analysis
dataset deep-learning music-analysis music-information-retrieval open-data open-science reproducible-research
Last synced: 15 May 2025
https://github.com/github/CodeSearchNet
Datasets, tools, and benchmarks for representation learning of code.
bert cnn data data-science datasets deep-learning machine-learning machine-learning-on-source-code ml natural-language-processing neural-networks nlp nlp-machine-learning open-data programming-language-theory python representation-learning rnn self-attention tensorflow
Last synced: 13 Mar 2025
https://github.com/github/codesearchnet
Datasets, tools, and benchmarks for representation learning of code.
bert cnn data data-science datasets deep-learning machine-learning machine-learning-on-source-code ml natural-language-processing neural-networks nlp nlp-machine-learning open-data programming-language-theory python representation-learning rnn self-attention tensorflow
Last synced: 29 Sep 2025
https://github.com/gsa/datagov-wptheme
Data.gov WordPress Theme (obsolete)
government open-data wordpress
Last synced: 29 Sep 2025
https://github.com/GSA/datagov-wptheme
Data.gov WordPress Theme (obsolete)
government open-data wordpress
Last synced: 27 Jul 2025
https://github.com/open-thoughts/open-thoughts
Fully open data curation for reasoning models
Last synced: 03 Apr 2025
https://github.com/okfn-brasil/querido-diario
📰 Diários oficiais brasileiros acessíveis a todos | 📰 Brazilian government gazettes, accessible to everyone.
civic-tech data-science digital-public-goods dpg governments-gazettes govtech hacktoberfest open-data politics scraping sdg-16 spider
Last synced: 12 Apr 2025
https://github.com/juancarlospaco/faster-than-requests
Faster requests on Python 3
curl cython download-file faster-than-requests high-performance http-requests ndjson open-data python python-library python-requests python3 requests-toolbelt requests3 scrapy speed urllib urllib3 web-scraper web-scraping
Last synced: 14 May 2025
https://github.com/sentinelsat/sentinelsat
Search and download Copernicus Sentinel satellite images
copernicus esa geographic-data hacktoberfest open-data remote-sensing satellite-imagery sentinel
Last synced: 21 Oct 2025
https://github.com/datasets/commons
DataHub commons. Wiki catalog of interesting and important datasets
data datasets datasets-csv open-data open-datasets opendata
Last synced: 17 Jan 2026
https://github.com/MobilityData/gbfs
Documentation for the General Bikeshare Feed Specification, a standardized data feed for shared mobility system availability. Maintained by MobilityData
bike-share bike-sharing bikesharing carshare carsharing civic-tech gbfs gbfs-documentation mobility mobility-as-a-service mobilitydata open-data scooter-sharing shared-mobility urban-mobility
Last synced: 03 Apr 2025
https://github.com/kuwala-io/kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demographics data b) Point of Interests from Open Street Map c) Google Popular Times
admin-boundaries data data-integration data-science dbt elt google-trends jupyter kuwala no-code open-data open-source population postgres pyspark python react react-flow scraping spatial-analysis
Last synced: 30 Mar 2025
https://github.com/cernopendata/opendata.cern.ch
Source code for the CERN Open Data portal
big-data digital-library digital-repository flask invenio inveniosoftware json-schema open-data open-research-data open-science python research-data research-data-management research-data-repository
Last synced: 15 May 2025
https://github.com/blaylockbk/Herbie
Download numerical weather prediction datasets (HRRR, RAP, GFS, IFS, etc.) from NOMADS, NODD partners (Amazon, Google, Microsoft), ECMWF open data, and the University of Utah Pando Archive System.
big-data-program cfgrib download ecmwf-data gfs grib grib2 hrrr noaa-data nomads numerical-weather-prediction open-data python rap xarray
Last synced: 20 Jul 2025
https://github.com/blaylockbk/herbie
Download numerical weather prediction datasets (HRRR, RAP, GFS, IFS, etc.) from NOMADS, NODD partners (Amazon, Google, Microsoft), ECMWF open data, and the University of Utah Pando Archive System.
big-data-program cfgrib download ecmwf-data gfs grib grib2 hrrr noaa-data nomads numerical-weather-prediction open-data python rap xarray
Last synced: 14 May 2025
https://github.com/siznax/wptools
Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis
api-client commons data-science glam linked-open-data mediawiki mediawiki-api open-data python restbase wikidata wikimedia-commons wikipedia wikipedia-api
Last synced: 15 May 2025
https://github.com/etalab/dvf-app
Exploration des données DVF
cadastre cartography dvf open-data ventes-immobilieres visualisation
Last synced: 10 Apr 2025
https://github.com/etalab/DVF-app
Exploration des données DVF
cadastre cartography dvf open-data ventes-immobilieres visualisation
Last synced: 01 Aug 2025
https://github.com/catalyst-cooperative/pudl
The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.
cems climate coal ddj eia eia860 eia923 electricity emissions energy epa etl ferc ghg natural-gas open-data pudl python sqlite utility
Last synced: 14 May 2025
https://github.com/meteostat/meteostat-python
Access and analyze historical weather and climate data with Python.
climate climate-change climate-data data-science meteostat open-data statistics weather weather-data weather-station
Last synced: 20 Jul 2025
https://github.com/github/covid-19-repo-data
Data archive of identifiable COVID-19 related public projects on GitHub
covid-19 dataset extracts open-data
Last synced: 04 Oct 2025
https://github.com/magda-io/magda
A federated, open-source data catalog for all your big data and small data
elasticsearch kubernetes nodejs open-data postgresql scala
Last synced: 24 Dec 2025
https://github.com/geonetwork/core-geonetwork
GeoNetwork is a catalog application to manage spatially referenced resources. It provides powerful metadata editing and search functions as well as an interactive web map viewer. It is currently used in numerous Spatial Data Infrastructure initiatives across the world.
api catalog csw dcat geospatial inspire iso19110 iso19115 iso19119 iso19139 metadata metadata-management ogc ogcapi open-data opendata opensearch
Last synced: 14 May 2025
https://github.com/anahitasocial/anahita
Anahita is a platform and framework for developing open science and knowledge sharing applications on a social networking foundation.
actors anahita anahita-apps framework graph-architecture hashtags knowledge-sharing location-graph locations mentions notifications open-data open-science platform social-apps social-graph social-network-graph social-networking
Last synced: 11 Jan 2026
https://github.com/Chicago/food-inspections-evaluation
This repository contains the code to generate predictions of critical violations at food establishments in Chicago. It also contains the results of an evaluation of the effectiveness of those predictions.
cdph chicago data-science food-poisoning open-data open-science public-health
Last synced: 27 Mar 2025
https://github.com/basedosdados/sdk
⚙️ Código de manutenção do datalake (metadados e pacotes de acesso) | 📖 Docs: https://basedosdados.github.io/sdk/
bigquery dados-abertos data-science govtech hacktoberfest hacktoberfest2022 open-data python r sql transparencia
Last synced: 14 May 2025
https://fraud-detection-handbook.github.io/fraud-detection-handbook/
Reproducible Machine Learning for Credit Card Fraud Detection - Practical Handbook
credit-card credit-card-fraud data-mining data-science fraud-detection machine-learning open-data
Last synced: 19 Nov 2025
https://github.com/earthobservations/wetterdienst
Open weather data for humans.
canada data deutscher-wetterdienst dwd eccc germany historical-data hydrology meteorology open-data open-source radar time-series uk united-states weather weather-api weather-forecast weather-station weatherservice
Last synced: 14 May 2025
https://github.com/UCF-SST-Lab/UCF-SST-CitySim1-Dataset
Official github page of UCF SST CitySim Dataset
carla-driving-simulator carla-simulator computer-vision dataset digitaltwin drone open open-data simulation sumo trajectory
Last synced: 20 Mar 2025
https://github.com/jdemaeyer/brightsky
JSON API for DWD's open weather data.
Last synced: 20 Jul 2025
https://github.com/openlists/ElectrophysiologyData
A list of openly available datasets in (mostly human) electrophysiology.
data ecog eeg electrophysiological-data electrophysiology lfp meg open-data open-science research
Last synced: 09 May 2025
https://github.com/City-Bureau/city-scrapers
Scrape, standardize and share public meetings from local government websites
city-scrapers open-data python scrapy web-scraping
Last synced: 07 Apr 2025
https://github.com/TrumpTracker/trumptracker.github.io
Open source for http://trumptracker.github.io/
jekyll open-data open-source policy politics
Last synced: 10 May 2025
https://github.com/wri/global-power-plant-database
A comprehensive, global, open source database of power plants
climate climate-data energy energy-data free-datasets open-data open-datasets
Last synced: 07 May 2025
https://github.com/upgini/upgini
Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs
automated-feature-engineering automl automl-pipeline chatgpt data-enrichment data-science feature-engineering feature-extraction feature-selection features kaggle kaggle-solution large-language-models llm machine-learning open-data open-datasets public-data python-library scikit-learn
Last synced: 15 May 2025
https://github.com/kamu-data/kamu-cli
Next-generation decentralized data lakehouse and a multi-party stream processing network
blockchain data-as-code data-management data-science datafusion flink jupyter kamu open-data open-data-fabric spark sql
Last synced: 15 May 2025
https://github.com/greenelab/scihub
Source code and data analyses for the Sci-Hub Coverage Study
crossref data-science doi journals libgen open-data sci-hub scimag scopus
Last synced: 09 Apr 2025
https://github.com/opengeos/aws-open-data-geo
A list of open geospatial datasets on AWS
aws environment geospatial mapping open-data satellite-imagery sustainability
Last synced: 12 Apr 2025
https://github.com/qcif/data-curator
Data Curator - share usable open data
csv csv-desktop-editor data-curator data-publication electron-app nodejs open-data tsv
Last synced: 06 Jul 2025
https://github.com/dadosgovbr/catalogos-dados-brasil
Mapeamento de iniciativas (e catálogos) de dados abertos governamentais no Brasil.
brasil brazil dados-abertos open-data portais-de-dados transparencia transparency
Last synced: 14 Mar 2025
https://github.com/Chicago/RSocrata
Provides easier interaction with Socrata open data portals http://dev.socrata.com. Users can provide a 'Socrata' data set resource URL, or a 'Socrata' Open Data API (SoDA) web query, or a 'Socrata' "human-friendly" URL, returns an R data frame. Converts dates to 'POSIX' format. Manages throttling by 'Socrata'.
chicago government open-data r socrata soda
Last synced: 27 Jul 2025
https://github.com/matheusrocha89/graphql-camara-deputados
API GraphQL com os dados da câmara de deputados do Brasil
brasil congresso dados-abertos deputados graphql javascript nodejs open-data political-science public-data
Last synced: 14 Mar 2025
https://github.com/nycdb/nycdb
Database of NYC Housing Data
civic-data data database housing nyc open-data psql python3
Last synced: 15 May 2025
https://github.com/buds-lab/building-data-genome-project-2
Whole building non-residential hourly energy meter data from the Great Energy Predictor III competition
building-automation building-energy electricity-consumption electricity-meter energy-consumption energy-efficiency open-data open-data-science open-source smart-city smart-meter
Last synced: 07 May 2025
https://github.com/Hack23/cia
Citizen Intelligence Agency. Open-source intelligence platform analyzing Swedish political activities using AI and data visualization. Tracks politicians, government institutions, and parliamentary data, offering detailed insights, performance metrics, and advanced analytics.
ai civic-tech css data-analysis data-visualization goverment government-data java ministries open-data osint parliament-charts parliamentary-monitoring political-analysis political-parties politics riksdagen sverigesriksdag sweden sweden-data
Last synced: 17 Jan 2026
https://github.com/blaylockbk/goes2go
Download and process GOES-16 and GOES-17 data from NOAA's archive on AWS using Python.
big-data-program download glm goes goes-16 goes-17 goes-satellite netcdf noaa-satellite open-data python satellite satellite-data satellite-imagery xarray
Last synced: 08 Apr 2025
https://github.com/JustFixNYC/who-owns-what
Who owns what in nyc?
civic-tech civictech mapbox-gl mapbox-gl-js open-data opendata postgresql reactjs
Last synced: 18 Jul 2025
https://github.com/CamaraDosDeputados/dados-abertos
Repositório do serviço de Dados Abertos da Câmara. Consulte as "Issues" para atendimento a dúvidas e sugestões.
api-service congresso dados-abertos deputados open-data opendata political-science politics public-data
Last synced: 14 Mar 2025
https://github.com/neurodatawithoutborders/pynwb
A Python API for working with Neurodata stored in the NWB Format
cross-platform data-format hdf hdf5 neurodata-without-borders neuroscience nwb nwb-n open-data open-science open-source pynwb python reproducible-research
Last synced: 15 May 2025
https://github.com/buds-lab/the-building-data-genome-project
A collection of non-residential buildings for performance analysis and algorithm benchmarking
commercial-building electrical-meters electricity-meter energy-efficiency feature-engineering feature-extraction jupyter-notebook open-data smart-meter temporal-data
Last synced: 04 Apr 2025
https://github.com/hrecht/censusapi
R package to retrieve U.S. Census data and metadata via API
census census-api census-data demographics open-data r rstats
Last synced: 15 May 2025
https://github.com/ropensci/osmextract
Download and import OpenStreetMap data from Geofabrik and other providers
geo geofabrik-zone open-data osm osm-pbf r rstats
Last synced: 16 May 2025
https://github.com/robert-koch-institut/COVID-19-Impfungen_in_Deutschland
Die COVID-19-Impfung kann einen Wendepunkt in der Kontrolle der COVID-19-Pandemie darstellen und erfährt daher hohes Maß an öffentlicher Aufmerksamkeit. Einführung und Umsetzung der COVID-19-Impfung gehen mit besonderen Herausforderungen einher, die bei der Impfdatenerfassung zu berücksichtigen sind. In diesem Kontext ist es Ziel des Projekts 'D...
covid-19 deutschland germany impfschutz-deckungsgrad mass-vaccination massenimpfung offene-daten open-data rki sars-cov-2 vaccination vaccination-coverage vakzination
Last synced: 17 Apr 2025
https://github.com/GoogleCloudPlatform/public-datasets-pipelines
Cloud-native, data onboarding architecture for Google Cloud Datasets
airflow bigquery cloud-composer cloud-native cloud-storage data-architecture data-engineering data-pipelines datasets google-cloud open-data
Last synced: 23 Apr 2025
https://github.com/googlecloudplatform/public-datasets-pipelines
Cloud-native, data onboarding architecture for Google Cloud Datasets
airflow bigquery cloud-composer cloud-native cloud-storage data-architecture data-engineering data-pipelines datasets google-cloud open-data
Last synced: 12 Apr 2025
https://github.com/kensho-technologies/qwikidata
Python tools for interacting with Wikidata
knowledge-graph open-data package python python3 wikidata wikimedia wikipedia
Last synced: 06 Apr 2025
https://github.com/BaseAdresseNationale/adresse.data.gouv.fr
Le site officiel de l'Adresse
Last synced: 23 Aug 2025
https://github.com/common-voice/cv-dataset
Metadata and versioning details for the Common Voice dataset
asr dataset open-data open-datasets speech-recognition voice
Last synced: 06 Jul 2025
https://github.com/transitland/transitland-atlas
an open directory of mobility feeds and operators — powers both Transitland v1 and v2
gbfs gtfs gtfs-realtime gtfs-rt mds mobility open-data transit transitland transportation
Last synced: 16 May 2025
https://japan-opendata.github.io/awesome-japan-opendata/
Awesome Japan Open Data - 日本のオープンデータ情報一覧・まとめ
Last synced: 29 Aug 2025
https://github.com/okfn/dataportals.org
Open Data Portals and Sites around the world
csv json metadata open-data open-datasets open-knowledge-international
Last synced: 10 Jan 2026
https://github.com/Chicago/osd-bike-routes
Open source release of bike routes in Chicago.
bike-routes chicago geojson-data government open-data
Last synced: 22 Jul 2025
https://github.com/opengeos/open-buildings
Tools for working with open building datasets
buildings geoparquet geopython geospatial open-buildings open-data
Last synced: 24 Jul 2025
https://github.com/public-transport/friendly-public-transport-format
A format for APIs, libraries and datasets containing and working with public transport data.
fptf gtfs open-data public-transportation spec transport
Last synced: 23 Aug 2025
https://github.com/BIMCV-CSUSP/BIMCV-COVID-19
Valencia Region Image Bank (BIMCV) that combines data from the PadChest dataset with future datasets based on COVID-19 pathology to provide the open scientific community with data of clinical-scientific value that helps early detection of COVID-19
ai bimcv coronavirus-dataset covid deep-learning detection open-data padchest-dataset pneumonia rx scientific-community
Last synced: 20 Nov 2025
https://github.com/regardscitoyens/nosdeputes.fr
Repository of NosDéputés.fr : the french parliamentary monitoring website
civic-tech democracy open-data parliament parliamentary-data parliamentary-monitoring politics
Last synced: 30 Aug 2025
https://github.com/daq-tools/kotori
A flexible data historian based on InfluxDB, Grafana, MQTT, and more. Free, open, simple.
daq data-historization grafana historian internet-of-things iot-platform kotori-daq m2m mosquitto mqtt multi-channel multi-protocol open-data python scada sensor-network telemetry time-series visualization
Last synced: 12 Apr 2025
https://github.com/Chicago/osd-street-center-line
Open source release of street center lines in Chicago.
chicago geojson-data government open-data
Last synced: 15 May 2025
https://github.com/transitland/transitland-datastore
Transitland v1 core components. Deprecated and only maintained occasionally. See Transitland v2.
datasets geo gtfs open-data transit transportation
Last synced: 03 Apr 2025
https://github.com/opendatamcp/opendatamcp
Connect any Open Data to any LLM with Model Context Protocol.
Last synced: 09 Apr 2025
https://github.com/opentraffic/otv2-platform
An overview of the entire Open Traffic v2 platform and its components
open-data traffic-map traffic-statistics transportation
Last synced: 19 Jul 2025
https://github.com/datadotworld/data.world-py
Python package for data.world
api-client datasets dwstruct-t01-dist open-data reference-implementation
Last synced: 06 Apr 2025
https://github.com/ropensci/ckanr
R client for the CKAN API
api-wrapper ckan ckan-api open-data r r-package rstats
Last synced: 18 Dec 2025
https://github.com/openml/openml-r
R package to interface with OpenML
arff benchmarking benchmarking-suite classification cran data-science database dataset datasets machine-learning machine-learning-algorithms open-data open-science opendata openml openscience r regression reproducible-research statistics
Last synced: 01 Jul 2025
https://github.com/ropensci/fingertipsR
R package to interact with Public Health England’s Fingertips data tool
api-wrapper cran fingertips health open-data peer-reviewed public-health public-health-england r r-package rstats
Last synced: 29 Jul 2025
https://github.com/ruyut/taiwancalendar
紀錄中華民國政府行政機關辦公日曆表的 JSON 資料,內容包含日期、星期、是否放假、說明。
Last synced: 09 Apr 2025
https://github.com/datagouv/data.gouv.fr
Ce dépôt rassemble les tickets techniques qui portent sur data.gouv.fr.
france government-data open-data open-government support ticketing
Last synced: 12 Aug 2025
https://github.com/freecodecamp/open-api
freeCodeCamp's open-api Intiative
freecodecamp graphql lambda open-api open-data serverless
Last synced: 06 Oct 2025
https://github.com/okfn/opendataday
Open Data Day website
event events hackathon open-data open-data-day opendata opendataday translation
Last synced: 18 Jul 2025
https://github.com/openpotato/openholidaysapi.website
Website for the OpenHolidays API project
Last synced: 09 Apr 2025
https://github.com/technologiestiftung/giessdenkiez-de
The consequences of climate change, especially the dry and hot summers, are putting a strain on Berlin's ecosystem. Our urban trees are drying out and suffering long-term damage. Gieß den Kiez is made to enable coordinated citizen participation in the irrigation of urban trees.
berlin citylab-berlin community map open-data rain trees watering
Last synced: 05 Apr 2025
https://github.com/isaacus-dev/open-australian-legal-corpus-creator
The code used to create and update the Open Australian Legal Corpus, the first and only multijurisdictional open corpus of Australian legislative and judicial documents.
australia corpus dataset datasets isaacus law legal open-data scraping web-scraping
Last synced: 11 Jul 2025
https://github.com/tdt/core
Transform any dataset into an HTTP API with The DataTank
datatank government-data open-data php
Last synced: 30 Dec 2025
https://github.com/okfn/okfn.github.com
Open Knowledge Labs website (and general issue tracker).
lab open-data open-knowledge-international
Last synced: 17 Dec 2025
https://github.com/InseeFrLab/pynsee
pynsee package contains tools to easily search and download french data from INSEE and IGN APIs
Last synced: 08 Apr 2025
https://github.com/OpenDataMCP/OpenDataMCP
Connect any Open Data to any LLM with Model Context Protocol.
Last synced: 22 Mar 2025
https://github.com/inseefrlab/pynsee
pynsee package contains tools to easily search and download french data from INSEE and IGN APIs
Last synced: 04 Apr 2025
https://github.com/lgervasoni/urbansprawl
Open framework for calculating spatial urban sprawl indices and performing disaggregated population estimates using open data
demographics dispersion geospatial gis land-use land-use-mix open-data openstreetmap overpass-api population-density python spatial-analysis sustainability sustainable-development-goals transportation urban urban-accessibility urban-dispersion urban-planning urban-sprawl
Last synced: 05 May 2025