An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/opendatach/alds

a colaborative list of resources and ideas to enable "Amt Local Data Stewards" to manage the (open) data of their respective federal office

awesome-list data datagovernance dataliteracy datamanagement datastewardship opendata opengovernmentdata

Last synced: 31 Jan 2026

https://github.com/nikolatechie/spotify-playlist

Data pipeline that fetches recently played songs in the past 24 hours using Spotify API and saves the data in the SQLite database. Scheduled to run daily using Apache Airflow.

apache-airflow api data data-engineering python spotify sql sqlite

Last synced: 30 Apr 2026

https://github.com/trissim/polystore

Framework-agnostic multi-backend storage abstraction for ML and scientific computing

backend data io jax ml multi-framework numpy pytorch scientific-computing storage tensorflow zarr

Last synced: 12 Apr 2026

https://github.com/zulfachafidz/titanic_explorer_predicting_survival_with_classification_using_knn_algorithm

Tracking Life Safety with the KNN Predictive Analysis Approach. Leveraging the Titanic Dataset, we apply classification analysis to predict the fate of passengers based on a variety of features.

algorithm algorithms data data-analysis data-mining data-science datamodeling datapreprocessing dataset knn-algorithm knn-classification machine-learning machine-learning-algorithms prediction-model

Last synced: 01 Sep 2025

https://github.com/axetroy/stone

build data stuck like a stone, Sturdy!

axetroy data stone stuck

Last synced: 04 Jul 2025

https://github.com/juanpablodiaz/beertv

A Next.js Full Stack app to displays funny Beer TV Ads

api-routes data next tailwindcss

Last synced: 07 May 2026

https://github.com/miozilla/pandas

pandas :panda_face::panda_face: : Python Library # Data Analysis # Dataframe

analysis data dataframe pandas python sqlite3

Last synced: 07 May 2026

https://github.com/nrrso/ex_quickfs

A wrapper / elixir client / SDK to access the quickfs.net API.

data elixir financial financial-data

Last synced: 04 Sep 2025

https://github.com/shantanujpk/bigdatacloud

Exploration of PySpark for data processing and interview prep — demonstrates handling corrupted records, applying transformations/actions, and building efficient data pipelines with practical examples.

big-data data jupyter-notebook pipeline pyspark python spark sparksql

Last synced: 07 May 2026

https://github.com/frer0t/userverse

creating api for data analysis

data data-analytics spring-boot users

Last synced: 12 Apr 2026

https://github.com/nits2612/data-science-projects

Portfolio of data science projects completed by me during PGP AI/ML, self learning, and hobby purposes.

data data-science dataanalysis deep deep-learning keras machine-learning matplotlib numpy opencv pandas python scikit-learn seaborn surprise-python tensorflow transfer-learning

Last synced: 01 Feb 2026

https://github.com/pythoncoderunicorn/jamesbeardaward

a repo for James Beard Award data

data dataset jamesbeard

Last synced: 07 Feb 2026

https://github.com/wisdom-osborn/data-analytics-course-online-

🔍 Data Analytics with Python — Hands-on Course Materials Jupyter notebooks, projects, and datasets based on the freeCodeCamp Data Analysis with Python certification. Learn NumPy, Pandas, data cleaning, and visualization through real-world examples

data data-analysis data-science data-visualization freecodecamp numpy pandas pandas-dataframe project python

Last synced: 19 Apr 2026

https://github.com/rissh/titanicsurvivalpredictionusingml

Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. 🚢

data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic

Last synced: 01 Feb 2026

https://github.com/ibttf/bayborhood

Interactive map to find the ideal neighborhood in San Francisco based on data.

data data-analysis data-visualization gis mapbox react

Last synced: 18 Jun 2026

https://github.com/getconversio/dig-the-data

Data visualizations for the Conversio blog

d3 data data-visualization

Last synced: 12 Apr 2026

https://github.com/yash-chauhan-dev/sf_analytics

Business teams often rely on data analysts to extract insights using SQL. This tool eliminates that dependency by bridging the gap between humans and data using AI.

aiml analytics data dbt langchain llm python snowflake streamlit

Last synced: 07 May 2026

https://github.com/ms140569/loki-example-store

Testdata for loki password manager

data

Last synced: 26 Feb 2026

https://github.com/drostlab/biodbretrievr

Retrieve and efficiently index entire biological sequence databases

biological-data biological-sequences data databasestoring retrieval

Last synced: 26 Feb 2026

https://github.com/mehmetkahya0/earthquake-tracker

Earthquake Tracker, A real-time earthquake monitoring application that visualizes seismic activity worldwide using interactive maps and data visualization.

ai api css cursor data data-vizualisation earth-observation earthquake earthquake-data earthquake-visualization earthquakes html js modern-web scrape ui ui-design web

Last synced: 15 Apr 2026

https://github.com/ymorsi7/quranicvisualization

A visual exploration tool for the Holy Quran using D3.js treemaps.

css d3 d3js data data-visualization html islam islamic javascript js quran quranic treemaps visualization

Last synced: 15 Apr 2026

https://github.com/gman-au/white-knight

Experimental .NET data abstraction using specification pattern

abstractions data datastore dotnet repository-pattern specification-pattern

Last synced: 17 Mar 2026

https://github.com/berviantoleo/bervdata

Temporary data definition as db

data

Last synced: 01 Apr 2025

https://github.com/itrauco/data-dirtying-tool

a simple command line tool to generate dirty data and do common data things in google cloud

data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning

Last synced: 24 Feb 2025

https://github.com/michaelfromyeg/lyrics

Lyric-store and API hosted on Git.

data lyrics

Last synced: 08 Feb 2026

https://github.com/danicaalana/wine-dataset-decision-tree

This project is developed as part of Digital Skill Fair (DSF) 35.0 - Data Science by Dibimbing. I am using Wine Recognition Dataset from scikit-learn, which is the results of a chemical analysis of wines grown in the same region in Italy by three different cultivators.

data data-analysis-python data-science decision-tree-classification machine-learning python scikit-learn wine-dataset

Last synced: 18 Apr 2026

https://github.com/cassandrajm/reddit-dashboard

INTERACTIVE DASHBOARD: Analyzing Political Discourse on Reddit: A Multi-Faceted NLP Approach to Toxicity, Bias, and Political Stance

capstone data data-analysis data-science politics python reddit

Last synced: 09 Apr 2025

https://github.com/matt-dray/draytasets

:1234::disguised_face: Miscellaneous datasets I've collected or prepared

card-games data phd pokemon

Last synced: 09 Feb 2026

https://github.com/myles-parfeniuk/esp32_sdlogger

C++ esp-idf driver component for SD cards interfaced via SPI. WIP

card data esp-idf esp32 logger sd sdcard sdmmc sdspi spi

Last synced: 09 Feb 2026

https://github.com/denisecase/620-mod6-web-scraping

Notes on how to get started scraping content from the web

beautifulsoup4 data mining python

Last synced: 11 Apr 2025

https://github.com/metapsy-project/data-panic-psyctr

Database of psychotherapy for panic disorder compared to control conditions

data

Last synced: 18 Mar 2026

https://github.com/yash-rewalia/airbnb_eda_pandas

The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.

data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/saikatharryc/motionchart-d3js

A dynamic Motion chart Built with D3 js.

chart d3js data data-science

Last synced: 23 Dec 2025

https://github.com/neurazum-ai-department/tumor-stages-dataset---v1

Synthetic MRI data generated by the ‘HF’ and 'Vbai' models based on real data.

brain data dataset datasets image mri neuroscience tumor tumor-segmentation

Last synced: 18 Mar 2026

https://github.com/hudson-newey/data-miner

A simple data miner that collects information from an API and stores it in a file

api api-client big-data bigdata data logger logging

Last synced: 10 Jun 2026

https://github.com/dysnomia-studio/achieve-games-dump

Dump parts of achieve.games database to public including Steam Games List

data dump games steam steam-api steam-game steam-games

Last synced: 27 Feb 2026

https://github.com/enescidem/twitter-topic-modeling

Topic modeling is an unsupervised method to identify topics in text. This project analyzes tweets from prominent Turkish accounts to uncover underlying themes in their shared content.

data data-science machine-learning nlp topic-modeling twitter x

Last synced: 10 Feb 2026

https://github.com/denisecase/buzzline-05-case

Kafka pipelines with data storage

consumer data kafka producer python

Last synced: 11 Apr 2025

https://github.com/chardos/get-git-data

Access git repository data in node.

data git javascript node

Last synced: 07 May 2026

https://github.com/tjas/postgrad-ai-ddv-plotly

Jupyter Notebook to analyze the salaries of Federal District government public servants, using Python, Pandas and Plotly Express, to solve the proposed exercise in "Data Discovery and Visualization" discipline.

analysis analytics data data-analytics data-discovery data-science data-visualization graph graphs jupyter-notebook jupyter-notebooks pandas plotly plotly-express python

Last synced: 07 May 2026

https://github.com/softloud/spunk

Nutritional interventions for male infertility: a systematic review and meta-analysis

cochrane data evisynth living

Last synced: 18 Mar 2026

https://github.com/paladini/aa-daily-reflections-database

Alcoholics Anonymous (AA) Daily Reflections in English, Spanish, French and Brazilian Portuguese

aa alcoholics-anonymous daily-reflections data database reflections

Last synced: 16 Apr 2026

https://github.com/acovaci/orbit

ORBIT: an Open source Rust-based implementation of a data Build Tool, inspired by DBT

cargo clap-rs data data-warehouse dbt rust rust-lang tokio-rs

Last synced: 16 Mar 2025

https://github.com/abhinavrobinson/mc-community-world

Minecraft community world data.

data minecraft server world

Last synced: 27 Feb 2026

https://github.com/jneidel/nationalities

Dataset of 100 common nationalities

data dataset json nationalities nationality opendata

Last synced: 25 Mar 2025

https://github.com/keminghe/osu

Unofficial and publicly-available NPM data-package about The Ohio State University.

college data majors ohio-state organizations public students university unofficial

Last synced: 06 Jan 2026

https://github.com/bastianolea/sicvir_indicadores_rurales

Sistema de Indicadores de Calidad de Vida Rural (Sicvir)

chile comunas data estado rural social

Last synced: 27 Feb 2026

https://github.com/sweta-kaundilya/power-bi-learning-projects

This repository contains completed exercises while learning Power BI

data datavisualization dax powerbi powerquery

Last synced: 27 Feb 2026

https://github.com/zoetrope69/website

:tada: my website

data javascript personal

Last synced: 12 Jun 2025

https://github.com/anandanraju/power_bi_dashboard_projects

The goal of this project is to provide insights into consumer behavior and purchasing trends across different platforms. By analyzing data from Amazon and other sources, we aim to uncover valuable insights that can inform marketing strategies, product development, and decision-making processes.

amazon dashboard data data-visualization healthcare powerbi project

Last synced: 11 Feb 2026

https://github.com/dineshram0212/youtube-analysis

This YouTube Analysis Package provides tools for analyzing YouTube video data, including metrics on views, likes, comments, and engagement trends. Ideal for gaining insights into video performance and audience interaction patterns.

data data-visualization pandas python webscraping youtube-api-v3

Last synced: 19 Jun 2026

https://github.com/kunalthakur204/visualization-on-flower

🌸 Flower Dataset Visualization Visualizing patterns and relationships in flower data through charts and plots. Perfect for exploring floral characteristics and trends! 📊

data data-visualization dataanalysis flowerdataset python

Last synced: 16 Apr 2026

https://github.com/bkataru/spotigo

AI-powered local music intelligence platform with a task runner server core to retrieve and backup spotify account data to storage(s) at set periodic intervals

ai backup cron data go intelligence local-llm music ollama rag runner spotify task-runner tool-calling

Last synced: 16 Jan 2026

https://github.com/jigyasag18/iit-guhawati

Empower Sakhi is a data-driven platform that uses machine learning to identify women at risk of domestic violence in India. It offers confidential self-assessments, survivor stories, and emergency resources through a trauma-informed, privacy-focused web app. The project also provides NGOs with actionable insights via Power BI dashboard for support.

aiml data dataset datavisualization domestic-violence eda jupyter-notebook label-encoding machine-learning machine-learning-algorithms machine-learning-models machinelearning machinelearningprojects powerbi python python-app random-forest random-forest-classifier streamlit streamlit-webapp

Last synced: 08 May 2026

https://github.com/filipnet/infoscreen

Arduino subscribes values by MQTT and view info on an OLED I2C display

arduino data display i2c mqtt oled-display-ssd1306 visualization weather weatherstation

Last synced: 12 Apr 2026

https://github.com/aidan-zamfir/the-iliad

Data analysis & relationship network for the characters of Homers Iliad

data data-analysis dataframes networks networkx python selenium spacy webscraping

Last synced: 08 May 2026

https://github.com/apostolissiampanis/weather-app-api

WeatherApp is a Java-based console application that retrieves and processes weather data using the wttr.in web service.

api data hibernate java json lombok objected-orientated-programing oop spring-boot spring-data-jpa sqlite webflux

Last synced: 05 May 2026

https://github.com/project-renard/test-data

Files for testing

data

Last synced: 27 Feb 2026

https://github.com/diegoperea20/datos-secuenciales-con-ia

Realizacion de procesamiento de señales unidimensionales con modelos auto regresivos, convolución 1d, convolución 2d usando el espectrograma y redes recurrentes

ai artificial-intelligence convolutional-neural-networks data ia secuential-data spectrogram uao

Last synced: 06 Feb 2026

https://github.com/afeiship/next-object-operator

Object set/get/sets/gets and other operator.

data get gets next operator set sets store

Last synced: 27 Feb 2026

https://github.com/beastbytes/postal-code-data-php

Implementation of PostalCodeDataInterface using PHP file storage

data php postal-code yii3

Last synced: 27 Feb 2026

https://github.com/kirillsemyonkin/lsd

LSD (Less Syntax Data) configuration/data transfer format.

configuration data java parsing rust

Last synced: 27 Feb 2026

https://github.com/zsvoboda/olympics

Self service analytics of 120 years of Olympics data

analytics dashboards data datavisualization dataviz olympics open-data open-datasets opendata reports

Last synced: 08 May 2026

https://github.com/nadahamdy217/movies-data-etl-using-python-gcp

Developed a comprehensive ETL pipeline for movie data using Python, Docker, and a GCP Pub/Sub emulator. Successfully processed and published the data in a local Docker environment, showcasing advanced data engineering skills.

analytics data data-engineering data-ingestion data-preparation data-preprocessing data-processing data-project docker etl etl-pipeline gcp matplotlib matplotlib-pyplot numpy pandas pubsub python scipy seaborn

Last synced: 06 Jan 2026

https://github.com/powersyang/visualization

data visualization templates 数据可视化模板

data templates visualization

Last synced: 24 Mar 2025

https://github.com/pawamoy/keycut-data

Keyboard shortcuts data stored in YAML files

data keyboard-shortcuts

Last synced: 12 Feb 2026

https://github.com/lorenzobloise/client_satisfaction_classification

Jupyter notebook in which satisfaction from clients reviewing European hotels is analyzed using Python libraries such as pandas, numpy and scikit-learn. Various classification models are trained and tested to predict client satisfaction.

classification data data-mining jupyter jupyter-notebook machine-learning pandas python

Last synced: 21 Feb 2026

https://github.com/bishtrishu/super_store_sales_dashboard

This repository contains a comprehensive sales analysis dashboard for a Superstore, created using Power BI. The objective is to contribute to the success of a business by utilizing data analysis technique, specially focusing on time series analysis, to provide valuable insights and accurate sales forecasting.

analytics data data-science dataanalysis dataanalyst datacleaning datascience datavisualization-project excel microsoft-azure microsoft-excel powerbi report sql

Last synced: 28 Feb 2026

https://github.com/ournet/videos-data

Ournet videos data module

data ournet video videos

Last synced: 04 Apr 2025

https://github.com/randomfractals/unfolded-map-snippets

Html, CSS, JavaScript, and Python 🐍 vscode snippets ✂️ extension for Unfolded Map 🗺️ and Data SDKs

code data extension map sdk snippets template unfolded vscode

Last synced: 08 May 2026

https://github.com/lckylke/vizweb

Web application for data visualization:)

data expressjs nextjs web

Last synced: 08 May 2026

https://github.com/etmendz/mendz.data

Provides tools and guidance for creating data access contexts and repositories.

context data datasettings entity-framework mendz paginginfo repository resultinfo

Last synced: 11 Jun 2025

https://github.com/bastianolea/mineduc_personal_academico

Datos de Personal Académico, entre los años 2008 y 2024, del sistema de Educación Superior.

chile data educacion meses tiempo

Last synced: 19 Jun 2026

https://github.com/mnazlukhanyan/da-projects

Портфолио с работами по аналитике данных, показывающие мои навыки, умения и опыт

data data-vizualisation hypothesis-tests matplotlib pandas plotly postgresql product-metrics python scipy seaborn sql visualization

Last synced: 11 Apr 2026

https://github.com/krishkumar/scrobbles

all the music 🎸

data music scrobble

Last synced: 13 Feb 2026

https://github.com/purarue/scramble-history

parses rubiks cube scramble history/solve time from cstimer.net, cubers.io, twistytimer -- merges them together giving you uniform averages/data/graphs

cstimer cubing data rubiks-cube speedsolving

Last synced: 11 Jun 2025

https://github.com/matthewgferrari/covid-contextualizer

A Coronavirus Contextualizer for the USA

data react visualization

Last synced: 26 Jun 2026