Projects in Awesome Lists tagged with public-data
A curated list of projects in awesome lists tagged with public-data .
https://github.com/caserec/Datasets-for-Recommender-Systems
This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)
data-science database datasets public-data recommender-systems
Last synced: 20 Jul 2025
https://github.com/leftmove/wallstreetlocal
Free and open-source stock tracking website for America's biggest money managers.
13f celery collaborate docker fastapi investment javascript meilisearch mongodb nextjs nginx-proxy-manager public-data python redis redux sec
Last synced: 08 Apr 2025
https://github.com/upgini/upgini
Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & commercial LLMs
automated-feature-engineering automl automl-pipeline chatgpt data-enrichment data-science feature-engineering feature-extraction feature-selection features kaggle kaggle-solution large-language-models llm machine-learning open-data open-datasets public-data python-library scikit-learn
Last synced: 15 May 2025
https://github.com/matheusrocha89/graphql-camara-deputados
API GraphQL com os dados da câmara de deputados do Brasil
brasil congresso dados-abertos deputados graphql javascript nodejs open-data political-science public-data
Last synced: 01 Feb 2026
https://github.com/CamaraDosDeputados/dados-abertos
Repositório do serviço de Dados Abertos da Câmara. Consulte as "Issues" para atendimento a dúvidas e sugestões.
api-service congresso dados-abertos deputados open-data opendata political-science politics public-data
Last synced: 14 Mar 2025
https://github.com/apis-is/apis
Making data readily available to anyone interested
api data iceland javascript node public-data
Last synced: 13 May 2025
https://github.com/c-3lab/dim
📦 dim: Manage the open data in your project like a package manager.
cli commads command-line-tool data dataops dim gpt gpt-3 llm opendata package-manager public-data public-dataset
Last synced: 17 Jan 2026
https://github.com/ajdamico/lodown
locally download and prepare publicly-available microdata
officialstatistics public-data rstats survey
Last synced: 18 Feb 2026
https://github.com/legalize-kr/precedent-kr
대한민국 법원 판례 데이터를 Git 저장소로 관리합니다. 각 판례는 Markdown 파일이고, 선고일을 Git Commit Date로 갖습니다.
case-law git-history korean-law korean-precedent legal-documents legalize-kr markdown public-data
Last synced: 28 May 2026
https://github.com/gadenbuie/mynorfolk-dash
MyNorfolk Dashboard: A Quarto Dashboards Demo
city-data open-data public-data quarto quarto-dashboard quarto-pub
Last synced: 07 Mar 2026
https://github.com/ondata/ckan-mcp-server
MCP server for querying CKAN open data portals (package search, DataStore SQL, organizations, groups, tags)
ai-tools api-client civic-tech ckan ckan-api claude cloudflare-workers data-discovery data-portal datastore government-data mcp model-context-protocol nodejs open-data public-data solr typescript
Last synced: 14 Apr 2026
https://github.com/andreluizbvs/PLAD
STN PLAD: A Dataset for Multi-Size Power Line Assets Detection in High-Resolution UAV Images
damper dataset deep-learning high-resolution insulator insulator-dataset object-detection power-delivery power-line-assets power-supply power-transmission-lines powerlines public public-data stockbridge tower uav uav-images unmanned-aerial-vehicle
Last synced: 21 Nov 2025
https://github.com/vhoulbreque/dafter
📥 Command-line downloader for public datasets
brew-style command-line data database dataset download fetcher linux osx public-data unix
Last synced: 22 Mar 2025
https://github.com/cre-dev/pub-data-visualization
The objective of this repository is to share with an MIT license the visualization tools used with public data and developed by the Wholesale Markets Surveillance Directorate (DSMG) of the Regulatory Commission of Energy (CRE). It can be used by final users such as developers and energy analysts.
cre electricity energy open-source public-data visualization
Last synced: 07 Mar 2026
https://github.com/albert221/mpra
Medicinal Products Registry API, from dane.gov.pl, with GraphQL.
medicine public-data public-data-api
Last synced: 11 Apr 2025
https://github.com/brannondorsey/chirp-files
A collection of notable radio frequencies near Philadelphia PA and beyond
amateur-radio baofeng chirp chirpradio ham-radio philadelphia public-data radio
Last synced: 04 Mar 2026
https://github.com/evancarroll/db-texas-ethics-commission
A schema loader for the Texas Ethics Commission
campaign-contributions campaign-finance campaign-finance-data data-import government-data lobbying open-data postgresql public-data texas
Last synced: 14 Sep 2025
https://github.com/stritti/thermal-solar-plant-dataset
Realtime Thermal Solar Plant Dataset for Machine Learning
dataset examples iot machine-learning opendata public-data research smarthome training-data
Last synced: 27 Jan 2026
https://github.com/fairfield-programming/openlist
🦋 The definitive list of open-source, source-available, and commercial licenses.
api api-rest awesome awesome-list json license license-management licenses list open-data open-source openapi opensource public public-api public-data xml yaml
Last synced: 30 Dec 2025
https://github.com/defgsus/office-schedule-scraper
Scraper for (german) free dates at the public offices
appointment-scheduling historical-data public-data scraper social-data timeseries web-scraping webscraping
Last synced: 06 May 2025
https://github.com/groovy-sky/azure-ip-ranges
Stores Azure DC IP addresses
automation azure cloud github-actions ip microsoft microsoft-azure network public-data
Last synced: 25 Dec 2025
https://github.com/markusvalo/HSLtraffic
Scripts to create a PostgreSQL database for HSL GTFS-data
avoindata finland gtfs gtfs-files gtfs-transit-feed helsinki hsl joukkoliikenne open-data postgis postgresql public-data public-transport public-transportation qgis sql timemanager timeseries
Last synced: 01 Aug 2025
https://github.com/etalab-ia/mediatech
Collection of public datasets from the French administration, vectorized and ready to use in AI projects.
datasets embeddings huggingface public-data
Last synced: 04 Apr 2026
https://github.com/navchandar/civic-media-scout
Civic Media Scout compiles contact information from government websites for user-friendly public access
accessibility civic-engagement civic-tech contact-information data-compilation opendata public-data user-friendly
Last synced: 14 Jul 2025
https://github.com/brannondorsey/spectrum-wrangler-docker
A Dockerized version of Spectrum Wrangler that downloads and geo indexes public FCC license data
amateur-radio database docker docker-compose fcc fcc-data fcc-license postgis postgresql public-data radio
Last synced: 03 May 2025
https://github.com/auroramaurizio/surfr
An Rpackage to identify cells membrane marker genes from bulkRNA sequencing data
dge enrichment-analysis metaanalysis plots proteins public-data rnaseq rpackage surface surfaceome
Last synced: 19 Feb 2026
https://github.com/legalize-kr/ordinance-kr
대한민국 자치법규를 Git 저장소로 관리합니다. 각 자치법규는 Markdown 파일이고, 각 개정은 실제 공포일자를 가진 Git commit입니다.
git-history korean-law korean-ordinances legal-documents legalize-kr local-government markdown public-data
Last synced: 28 May 2026
https://github.com/krabina/open-spending-austria
Open Spending Austria
austria finance municipalities openspending public public-data spending
Last synced: 06 Feb 2026
https://github.com/jjchern/meps.hc
MEPS Household Component Annual Consolidated Data Files
ahrq data-package meps public-data survey-data
Last synced: 19 Feb 2026
https://github.com/legalize-kr/admrule-kr
대한민국 행정규칙을 Git 저장소로 관리합니다. 각 행정규칙은 Markdown 파일이고, 각 개정은 실제 발령일자를 가진 Git commit입니다.
administrative-rules git-history korean-administrative-rules korean-law legal-documents legalize-kr markdown public-data
Last synced: 28 May 2026
https://github.com/harvard-lil/duckdb-warc
DuckDB extension for reading web archive files in WARC format
Last synced: 31 May 2026
https://github.com/danlessa/cetesb-air-quality
Repository for scraping and analyzing air pollution data from the State of São Paulo by using the CETESB website
air-quality cetesb data-mining public-data sao-paulo
Last synced: 02 Apr 2025
https://github.com/numerique-gouv/statistiques-impact
Site des statistiques d'impact des services de l'Opérateur de la DINUM
Last synced: 05 Oct 2025
https://github.com/scottleechua/data
Public datasets under CC-BY-4.0 license.
Last synced: 18 Mar 2026
https://github.com/superchordate/data-viz-talk
Resources from the "Data Visualization in Business Communication" presentation at the 2023 Gamma Iota Sigma Regional Conference in Fort Worth, TX.
data-visualization eda exploratory-data-analysis public-data
Last synced: 29 Jul 2025
https://github.com/danlessa/mapa-interativo-onibus-rodoviario
Um mapa que lhe permite saber todos os destinos e origens a partir de uma cidade e/ou empresa de ônibus selecionada. Ótimo para saber até onde ir com a cicloviagem ou para escolher um destino aleatório!
brazil bus geospatial-visualization interactive-visualizations public-data
Last synced: 12 Aug 2025
https://github.com/luka-j/upisscraper
Crawls upin.mpn.gov.rs for data.
data-mining data-scraper java public-data
Last synced: 26 Apr 2026
https://github.com/taihei-05/siglume-personal-apis
Open-source Siglume Agent API projects, focused on turning starter APIs into practical agent-ready tools.
agent-api ai-agent api-store fastapi japanese open-source public-data python siglume subsidy
Last synced: 13 May 2026
https://github.com/civicdatalab/assam-tenders-data
Data mining repo for Open Contracting - Assam
ocds open-data public-data public-finance tenders
Last synced: 10 Apr 2025
https://github.com/sbauwow/atlas-tx
Texas county intelligence for water, permits, public evidence, MCP, and field verification.
agent-skill civic-tech mcp nextjs public-data texas water-quality
Last synced: 08 Jun 2026
https://github.com/jjchern/meps.panel
MEPS Two-Year Longitudinal Files
ahrq data-package meps public-data survey-data
Last synced: 19 Feb 2026
https://github.com/eehwan/courtauctioncrawler
대한민국 대법원 부동산 경매시스템에서 경매 매물 정보를 자동으로 수집하는 Python 기반 크롤러입니다.
auction korea public-data python real-estate scrapy web-crawler
Last synced: 07 Jul 2025
https://github.com/lucalullo/monitoring-healthcare-waiting-times-puglia
Monitoring and analysis of public healthcare waiting times in Puglia (Italy), 2024 — based on official open data
data-analysis healthcare italy jupyter-notebook kaggle open-data pandas public-data puglia time-series waiting-times
Last synced: 08 Jan 2026
https://github.com/apancoast/healthcare-deserts-and-public-transit
This dbt-based project aims to analyze the intersection of healthcare accessibility and public transit coverage in Mecklenburg County, NC.
analysis analytics-engineering data-engineer dbt healthcare hpsa hrsa public-data public-transit
Last synced: 27 Oct 2025
https://github.com/joacosnchz/rosariocomollegoios
Mobile Application for checking the public transport states of Rosario's City. Developed on Angular
angular comollego public-apis public-data public-transport rosario
Last synced: 09 Apr 2025
https://github.com/mboula/mboula.github.io
GitHub portfolio + interactive resume | Showcasing data projects in civil rights (housing), cannabis, and analytics
cannabis case-study civil-rights compliance dashboards data-analysis data-cleaning data-vizualization excel google-data-analytics housing open-data pattern-analysis portfolio pro-se public-data r sql tableau
Last synced: 10 Jul 2025
https://github.com/jacobpstein/georgetowncrimeanalysis
This is a repo for the Mean Streets Georgetown Data Science capstone project group.
crime public-data service-delivery
Last synced: 19 Jul 2025
https://github.com/vendethiel/bigcity
Random Objective-C project...
geolocation government-data ios objective-c public-data
Last synced: 22 Mar 2025
https://github.com/jjchern/meps.prpl
MEPS Person Round Plan Public Use Files
ahrq data-package meps public-data survey-data
Last synced: 19 Feb 2026
https://github.com/av1m/datagovuk-scraper
Scrap public data from data.gov.uk without an API KEY
public-data python-scraper python3 scraper
Last synced: 25 May 2026
https://github.com/fieldcure/fieldcure-mcp-publicdata
MCP server for Korean public data APIs (data.go.kr) — discover, inspect, and call 80,000+ government APIs
ai csharp data-go-kr dotnet korea mcp mcp-server open-api public-data
Last synced: 25 May 2026
https://github.com/vivshaw/4-19-public-data-talk
4/19 talk on public datasets given at Burlington Data Science
data-science folium geography javascript leaflet mashup public-data python talks
Last synced: 10 Apr 2026
https://github.com/fsouza99/american-baby-names
An exploration of public data about names given to American newborns, seeking to answer some questions through PostgreSQL's capabilities.
data-science plpgsql postgresql public-data sql
Last synced: 31 Jan 2026
https://github.com/civicdatalab/datadialogues-assam
Slide decks for the Data Dialogues event in Guwahati, Assam [March 21-22, 2022].
laws opendata public-data workshop-materials
Last synced: 28 May 2026
https://github.com/nathadriele/cnpj-data-pipeline
The CNPJ Data ETL Pipeline is designed to automate the download, processing, and storage of public CNPJ data from the Brazilian Federal Revenue. The pipeline is built with Mage.ai and AWS S3 to ensure efficient data management and scalability.
amazon-s3 cloud-computing cnpj-data data-engineering data-pipeline data-processing data-storage etl-pipeline mage-ai public-data script-automation
Last synced: 24 Mar 2025
https://github.com/danlessa/sao-paulo-theft-dataset
Crime Bulletins for thefts on the São Paulo State, 2001 Jan to 2021 Jan
brazil crime crime-data public-data sao-paulo theft
Last synced: 30 Oct 2025
https://github.com/frne/public-datasets
This is just a collection of public data I collected / processed. Feel free to use it...
csv data-science ehealth opendata public-data
Last synced: 16 Feb 2026
https://github.com/cherukuri-thanu/ufo-sightings-analysis
This repository contains configuration files for finding the pattern of UFO Sightings.
dashboard geospatial-analysis power-bi public-data time-series-analysis
Last synced: 02 Mar 2026
https://github.com/yeongseon/kpubdata-studio
Korea Public Data Studio — Visual interface for authoring and running KPubData builder workflows
dashboard korea kpubdata nextjs public-data typescript
Last synced: 07 Jun 2026
https://github.com/yeongseon/kpubdata-builder
Korea Public Data Builder — Dataset artifact pipeline that turns normalized records into publishable outputs
data-pipeline dataset-builder korea kpubdata public-data python
Last synced: 07 Jun 2026
https://github.com/chiefy/quickdraw-playground
SPA and simple API to visualize and investigate New York State Quick Draw data
public-data public-dataset quasar spa vuejs
Last synced: 06 May 2026
https://github.com/tsbarr/toronto-open-data
Analysis of Toronto's open data initiatives. 🌆 Exploring Toronto's urban systems through data science 📊 Python-based analyses of public datasets 🔍 Focus on community impact and urban patterns 🎓 Academic rigour meets practical insights 🔄 Regularly updated with new analyses
api-integration civic-tech ckan-api data-analysis data-cleaning data-science data-visualization exploratory-data-analysis jupyter-notebook open-data pandas public-data python tableau toronto urban-analytics
Last synced: 09 May 2026