An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/spatialcurrent/go-flat

Recursively flatten a slice of slices.

big-data bigdata data

Last synced: 29 Jan 2026

https://github.com/spatialcurrent/go-counter

Simple library and command line program for generating frequency distributions.

big-data bigdata data

Last synced: 29 Jan 2026

https://github.com/nasa-pds/nucleus

Nucleus is a software platform used to create workflows for the Planetary Data (PDS).

data ingestion pds planetary workflow

Last synced: 06 Feb 2026

https://github.com/audeering/datasets

Data cards for public audb datasets

audb audio data management

Last synced: 29 Jan 2026

https://github.com/entorb/analyze-ha-energy

Analyze Home Assistant Solar Production Data

data home-assistant pandas photovoltaic pv python

Last synced: 08 May 2026

https://github.com/aimin-nur/data-analyst

Sebuah project Data Analyst (Mechine Learning) untuk melakukan analisa harga mobil bekas Ford berdasarkan dataset yang sudah ada, serta mengetahui apa saja feature atau kolom yang mempengaruhi harga mobil bekas Ford.

analytics data mechine-learing visualization

Last synced: 29 Jan 2026

https://github.com/aimin-nur/data-analyst-model-predictive

Sebuah Project data analyst yang bertujuan untuk mengindentifikasi karakteristik customer untuk menerima penawaran campaign marketing.

analyst data mechine-learning visualization

Last synced: 29 Jan 2026

https://github.com/nagar2nd/ml-regressionmodel---cardekho-price-prediction

This repository features a machine learning model for predicting used car prices using data from CarDekho.com. The project leverages exploratory data analysis and regression techniques to empower sellers and buyers with actionable insights in the Indian used car market.

analytics cleaning-data data linear-regression machine-learning matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/badranalyst/covid-deaths-and-vaccinations-sql-data-exploration

This project involves exploratory data analysis on COVID-19 deaths and vaccinations data using SQL. It aims to uncover trends, patterns, and insights related to vaccination rates and their impact on mortality. The analysis provides a clearer understanding of the pandemic's dynamics, facilitating data-driven decisions in public health.

covid-19 data data-exploration dataset sql

Last synced: 19 Feb 2026

https://github.com/tpltnt/wir_vs_virus_hackathon_projects

A list of all projects / challenges for the WirVsVirus hackathon as CSV

coronavirus csv data hackathon raw-data

Last synced: 29 Jan 2026

https://github.com/apigear-io/template-qtcpp

QtC++ technology template

data plugin qml qt qt5

Last synced: 25 Feb 2026

https://github.com/thedevreda/jadaerospace

A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero

data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python

Last synced: 02 Aug 2025

https://github.com/dfsp-spirit/neuroimaging_testdata

Contains test data for unit tests, used in developing neuroimaging software. Ignore this. Licenses in the individual archives.

data unittesting

Last synced: 25 Feb 2026

https://github.com/chenxingqiang/modeling_tabular_data

# modeling_tabular_data | Keywords: modeling_tabular_data focusing on modeling_tabular_data.

data modeling tabular

Last synced: 30 Jan 2026

https://github.com/anct-cartographie-nationale/mednum-cli

✨ Interface en ligne de commande pour la transformation des données de lieux de médiation numériques collectées dans un format non standard vers le schéma de la mednum et leur publication sur data.gouv

anct betagouv data donnees gouvernement mediation-numerique nodejs open-data transformation

Last synced: 02 Aug 2025

https://github.com/plurid/defocus

Apophatic User Content Resolution [Desearch Concept]

data

Last synced: 08 Nov 2025

https://github.com/bearaujus/bdatamatrix

Structured Tabular Data Management in Go

data go golang matrix

Last synced: 30 Jan 2026

https://github.com/restricted/redis-data-cache

TypeScript implementation of data cache management by class name

cache data object redis state typesript

Last synced: 30 Jan 2026

https://github.com/plurid/delog

Cloud Service for Centralized Logging

cloud data logging

Last synced: 08 Nov 2025

https://github.com/bhojpur/dlm

The Bhojpur DLM is a software-as-a-service product used for Data Lifecycle Management based on Bhojpur.NET Platform for data delivery.

data lifecycle-management

Last synced: 19 Feb 2026

https://github.com/chompfoods/stub-scala-akka-http-server

Scala Akka HTTP server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

akka api branded chomp data database food grocery ingredients raw recipe-api recipes scala server stub stub-server

Last synced: 15 Apr 2026

https://github.com/plurid/datasign

Single Source of Truth Data Contract Specifier

data file-format

Last synced: 08 Nov 2025

https://github.com/davidkhala/datasets

sample datasets

data

Last synced: 19 Mar 2026

https://github.com/nushratjabenaurnima/cse_477_data_mining

A collection of labs, reports, Jupyter notebooks, and project outputs for the CSE 477 Data Mining course. This repository tracks my learning journey through data preprocessing, association rules, clustering, classification, and real-world data analysis with Python.

data data-analysis data-mining data-science google-colab-notebook jupyter-notebook machine-learning python python-3

Last synced: 09 Apr 2026

https://github.com/bubblymaps/bubblymaps

The open source bubbler map. Mapping the world's water fountains. Open Code, Open Data.

bubbler bubbly-maps data fountain map open-source water

Last synced: 31 Jan 2026

https://github.com/opdev1004/totjs

Not totally new but a file format for managing human readable data in a file. JS version.

data data-storage data-store database database-management hacktoberfest hactoberfest-accepted nodejs

Last synced: 31 Jan 2026

https://github.com/opendatach/alds

a colaborative list of resources and ideas to enable "Amt Local Data Stewards" to manage the (open) data of their respective federal office

awesome-list data datagovernance dataliteracy datamanagement datastewardship opendata opengovernmentdata

Last synced: 31 Jan 2026

https://github.com/edjoukou/human_resources

A data analysis project using MySQL Server database

analysis data mysql powerbi sql visualization

Last synced: 25 Sep 2025

https://github.com/ryanga09/digitalent_fundamentaldatascience-selfpractice

A repository of hands-on projects from DigiTalent’s Fundamental Data Science training, covering web scraping, data exploration, data cleaning, and data annotation. Includes Jupyter notebooks and example code for practical learning.

data data-analysis data-science data-visualization dataset digitalent komdigi notebook-jupyter notebooks

Last synced: 02 Aug 2025

https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics

This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.

big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard

Last synced: 01 Aug 2025

https://github.com/jigyasag18/global-terrorism-1970-2017-analysis-using-big-data

This repository explores over 180,000 terrorist incidents across 205 countries using Hadoop and Power BI. The project identifies global and regional patterns in terrorism, analyzes the impact on civilians, and highlights high-risk areas. Key insights include attack trends,weapon usage,top terror groups,& country-specific risks like those in India.

big-data big-data-analytics data data-analysis data-visualization dataanalytics dataset hadoop hive hive-database hive-db hivedb power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-report-validation powerbi-visuals powerbidashboard

Last synced: 19 Feb 2026

https://github.com/pythoncoderunicorn/jamesbeardaward

a repo for James Beard Award data

data dataset jamesbeard

Last synced: 07 Feb 2026

https://github.com/jigyasag18/ai-ml-salaries-and-ai-tools-usage-trends

This repository presents an in-depth Power BI analytics report on the AI job market trends and student AI tool usage from 2020 to 2025. It combines structured datasets (job postings, salaries, surveys) with custom DAX measures to uncover key patterns in salaries, remote work, industry demand, and student engagement. 5 interaractive dashboards made.

analysis data data-analysis data-visualization dataanalysis dataanalytics dataset datavisualization power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-visuals powerbidashboard visualization

Last synced: 16 Feb 2026

https://github.com/rissh/titanicsurvivalpredictionusingml

Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. 🚢

data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic

Last synced: 01 Feb 2026

https://github.com/ms140569/loki-example-store

Testdata for loki password manager

data

Last synced: 26 Feb 2026

https://github.com/canadaluke888/speedtable

Ultra-fast terminal table renderer written in C

c data datasets fast python python-wrapper python3 tables

Last synced: 01 Mar 2026

https://github.com/drostlab/biodbretrievr

Retrieve and efficiently index entire biological sequence databases

biological-data biological-sequences data databasestoring retrieval

Last synced: 26 Feb 2026

https://github.com/mehmetkahya0/earthquake-tracker

Earthquake Tracker, A real-time earthquake monitoring application that visualizes seismic activity worldwide using interactive maps and data visualization.

ai api css cursor data data-vizualisation earth-observation earthquake earthquake-data earthquake-visualization earthquakes html js modern-web scrape ui ui-design web

Last synced: 15 Apr 2026

https://github.com/aaronspindler/selfdrivingcar

Learning deep learning and making a self driving car in the process

car data deep deep-learning driving keras learning machine machine-learning python self self-driving-car

Last synced: 09 Apr 2026

https://github.com/mahtabranjbar/onlineshopping_analysis_dashboard

This project analyzes online shopper behavior using various machine learning models and EDA techniques.

dashboard data dataanalysis eda machine-learning streamlit

Last synced: 08 Feb 2026

https://github.com/gman-au/white-knight

Experimental .NET data abstraction using specification pattern

abstractions data datastore dotnet repository-pattern specification-pattern

Last synced: 17 Mar 2026

https://github.com/abhishekn1947/samgov-scraper

Automated Python scraper for sam.gov contracts

analytics automation aws data pandas postgresql rds selenium webscraper

Last synced: 09 Apr 2026

https://github.com/danicaalana/wine-dataset-decision-tree

This project is developed as part of Digital Skill Fair (DSF) 35.0 - Data Science by Dibimbing. I am using Wine Recognition Dataset from scikit-learn, which is the results of a chemical analysis of wines grown in the same region in Italy by three different cultivators.

data data-analysis-python data-science decision-tree-classification machine-learning python scikit-learn wine-dataset

Last synced: 18 Apr 2026

https://github.com/kayahr/datastream

Data stream classes for writing and reading all kinds of data types, even single bits

data datastream input output stream typescript

Last synced: 01 Aug 2025

https://github.com/creativecuriositystudio/cruddle

(DEPRECATED) Simplifying CRUDL screen development using ModelSafe

angular2 crud data html model typescript ui web

Last synced: 09 Apr 2026

https://github.com/bishtrishu/netflix_movies_dashboard

This project is a comprehensive dashboard for analyzing Netflix movies and shows. Using a combination of Power BI, Python, and Excel, this dashboard provides insights into various aspects of Netflix's content library.

ai artifical-intelligense dashboard data dataanalysis dataanalyst dataanalytics datacleaning datahandling datascience datavisualization excel machine-learning msexcel powerbi report

Last synced: 09 Feb 2026

https://github.com/revolutionarybukhari/datawarehouse_meshjoin_superstore

A dataware house is generated for streaming data of a superstore using extended mesh join by Syed Husnain Haider Bukhari

data data-science data-warehousing meshjoin

Last synced: 23 May 2026

https://github.com/bastianolea/servel_elecciones_core

Resultados electorales desde Servel (2024)

chile comunas data elecciones genero

Last synced: 01 Aug 2025

https://github.com/myles-parfeniuk/esp32_sdlogger

C++ esp-idf driver component for SD cards interfaced via SPI. WIP

card data esp-idf esp32 logger sd sdcard sdmmc sdspi spi

Last synced: 09 Feb 2026

https://github.com/cunfuu/network-bubbles

For Easier to manage organizations and keeping notes about them to organize events and easy access their needs

data data-visualization organizations organizations-volunteer

Last synced: 31 Jul 2025

https://github.com/ludreinsalvador/global-covid-19-data-analysis

Contains Power BI dashboards that visualizes and analyzes global COVID-19 cases, deaths, and vaccination trends using data from the World Health Organization (WHO). The project aims to provide insights into the pandemic’s impact and vaccination progress worldwide through dynamic reports and advanced analytics.

analytics covid-19 covid19-data data data-analysis data-collection data-transformation data-visualization

Last synced: 26 Feb 2026

https://github.com/dysnomia-studio/achieve-games-dump

Dump parts of achieve.games database to public including Steam Games List

data dump games steam steam-api steam-game steam-games

Last synced: 27 Feb 2026

https://github.com/enescidem/twitter-topic-modeling

Topic modeling is an unsupervised method to identify topics in text. This project analyzes tweets from prominent Turkish accounts to uncover underlying themes in their shared content.

data data-science machine-learning nlp topic-modeling twitter x

Last synced: 10 Feb 2026

https://github.com/softloud/spunk

Nutritional interventions for male infertility: a systematic review and meta-analysis

cochrane data evisynth living

Last synced: 18 Mar 2026

https://github.com/dms-codes/scrape-tokoalvabet-com

Toko Alvabet Data Scraping and Price Comparator This Python script is designed to scrape data from Toko Alvabet's website and perform price comparison for the obtained products. It includes features for viewing and analyzing product data, as well as comparing prices with other sellers.

data price python scraping

Last synced: 29 Jul 2025

https://github.com/paladini/aa-daily-reflections-database

Alcoholics Anonymous (AA) Daily Reflections in English, Spanish, French and Brazilian Portuguese

aa alcoholics-anonymous daily-reflections data database reflections

Last synced: 16 Apr 2026

https://github.com/abhinavrobinson/mc-community-world

Minecraft community world data.

data minecraft server world

Last synced: 27 Feb 2026

https://github.com/vatshayan/songs-datasets

Datasets for Songs and Music for Dancing, Emotional, Happy and scenic view

1000dataset classfication csv data datapackage datapackages dataset datasets excel free freedata freedatasets genre machine music sgenre song songs

Last synced: 18 Mar 2026

https://github.com/farrelfaricaf/exploratorydataanalyst---titanic

This project analyzes the Titanic dataset using exploratory data analysis (EDA) and visualization techniques to identify survival patterns. The goal is to understand how demographic factors like gender and age influenced survival rates during the 1912 disaster.

data data-analysis data-science data-visualization eda python titanic-dataset

Last synced: 31 Jul 2025

https://github.com/beastbytes/n6l-phone-number-data-php

NationalPhoneNumerInterface implementation using PHP for storage

data itu-t0202 phone-number php yii3

Last synced: 08 Feb 2026

https://github.com/gaemapiracicaba/norma_dec_8468-76

Padrões de qualidade e lançamento de efluentes de águas interiores

data python

Last synced: 19 Apr 2026

https://github.com/bastianolea/sicvir_indicadores_rurales

Sistema de Indicadores de Calidad de Vida Rural (Sicvir)

chile comunas data estado rural social

Last synced: 27 Feb 2026

https://github.com/sweta-kaundilya/power-bi-learning-projects

This repository contains completed exercises while learning Power BI

data datavisualization dax powerbi powerquery

Last synced: 27 Feb 2026

https://github.com/ppabam/eda-bam

Navigating data from one thing to another.

cli data eda python

Last synced: 11 Feb 2026

https://github.com/solrikk/bluemoon

This project is a Go language tool designed to automatically download, process, and save product data from a remote server into a CSV file.

analyze converter data go golang xml-parser

Last synced: 31 Jul 2025

https://github.com/anandanraju/power_bi_dashboard_projects

The goal of this project is to provide insights into consumer behavior and purchasing trends across different platforms. By analyzing data from Amazon and other sources, we aim to uncover valuable insights that can inform marketing strategies, product development, and decision-making processes.

amazon dashboard data data-visualization healthcare powerbi project

Last synced: 11 Feb 2026

https://github.com/pbinkley/tweets-national-emergency-library

A twarc harvest of tweets related to Internet Archive's National Emergency Library (2020-03-23 to 2021-02-13)

data social

Last synced: 11 Feb 2026

https://github.com/kunalthakur204/visualization-on-flower

🌸 Flower Dataset Visualization Visualizing patterns and relationships in flower data through charts and plots. Perfect for exploring floral characteristics and trends! 📊

data data-visualization dataanalysis flowerdataset python

Last synced: 16 Apr 2026

https://github.com/docuvesta/shiseido_skincare_usa_fr_infographics

Découvrir les indicateurs de performance liés aux avis d'un sérum très réputé de la marque de beauté luxe japonaise Shiseido. Cette comparaison concerne les sites web USA et FR 💯

analysis automatisation data datanalysis graphique infographie pandas plotly python skincare soins

Last synced: 11 Apr 2026

https://github.com/moeabbas6/bq_data_loader

A Python script for executing and logging batch SQL commands in Google BigQuery. Includes tracking of execution times, unique job and statement IDs, and automated logging to a specified BigQuery table.

bigquery data python

Last synced: 24 Mar 2025

https://github.com/mikeqfu/network-rail-track-fixity-layer

This project develops a data mining tool for analysing and predicting track movements using asset data, environmental factors and track design knowledge to model key parameters and generate fixity values for the GB rail network.

data data-integration data-mining data-science information-management knowledge-discovery point-cloud rail rail-alignment rail-track track-fixity

Last synced: 02 Sep 2025

https://github.com/murshidazher/client-side-data-storage

🚌 A workspace containing client-side data storage implementations

cache cache-storage client-side data indexeddb localstorage sessionstorage storage websql

Last synced: 02 Sep 2025

https://github.com/sasanthns/sql_data_warehouse_project

A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

data data-analysis data-science data-warehouse datacleaning etl etlpipeline sql sqlserver

Last synced: 24 Mar 2025

https://github.com/heyimsteve/solnftdatadash

This a React-based web application that provides detailed information about NFT collections on the Solana blockchain. It uses the HelloMoon API to fetch and display data about NFT collections, including statistics, loan summaries, ownership information, and floor prices.

dashboard data hellomoon nft react solana solana-nft

Last synced: 30 Jan 2026

https://github.com/nolanbconaway/rollercoaster-tycoon-data

Every roller coaster I have built in RCT2 for iPad

data roller-coaster-tycoon

Last synced: 24 Mar 2025

https://github.com/bdr-pro/graphyml

A powerful, interactive Streamlit application to explore, edit, visualize, and query a graph-based database of YAML nodes — ideal for movie metadata, research articles, or structured knowledge graphs.

data database yaml yml

Last synced: 23 Jul 2025

https://github.com/rubyonworld/ldpath

This is a ruby implementation of LDPath, a language for selecting values linked data resources.

data ldpath resource ruby

Last synced: 12 Nov 2025

https://github.com/plateformeio/docs

The official documentation of the Plateforme framework

api app asgi async data db docs fastapi plateforme pydantic python restx services sqlalchemy

Last synced: 11 Apr 2026

https://github.com/shudhanshusaurabh001/super_market-data-analysis-using-python

This project focuses on analyzing supermarket sales data using Python. The goal is to extract meaningful insights from the dataset, such as sales trends, customer purchasing behavior, and product performance.

analysis csv data insights matplotlib numpy pandas project python seaborn

Last synced: 06 Apr 2026

https://github.com/pbinkley/mfmcollections

Project to distill data about published collections of microfilms from library lists

data research retro

Last synced: 28 May 2026

https://github.com/jameshenderson12/chatbot-utils

Generic data and elements that can be reused or repurposed for chatbot development.

boilerplate chatbot data development elements intents template utterances

Last synced: 04 Mar 2026