An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/gappeah/british-airways-analysis

This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.

data data-analysis data-visualization tableau

Last synced: 11 Jun 2025

https://github.com/mascanho/ruddit

CLI to interact with Reddit's API to programatically retrieve data

cli data marketing rust rust-lang rustlang sales

Last synced: 19 Aug 2025

https://github.com/gappeah/cookie-company-visual-dashboard

This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.

dashboard data data-visualization excel microsoft-excel

Last synced: 25 Feb 2025

https://github.com/nia-cloud-official/influx

Influx is a powerful search engine application designed to provide access to personal information of individuals from anywhere in the world. With Influx, users can search for and retrieve personal details of people, enabling them to find and connect with individuals across the globe.

data find people-search search-engine

Last synced: 27 Jun 2025

https://github.com/labwhatever/leetcode

Collection of LeetCode questions to ace the coding interview!

data data-structures-and-algorithms dsa leetcode-cpp leetcode-solutions structure structure-learning

Last synced: 22 Aug 2025

https://github.com/jerryfzhang/rockets

A Node + React App that displays space launch missions around the world.

bootstrap data expressjs less momentjs nodejs react reactjs reactstrap

Last synced: 10 Apr 2026

https://github.com/snitkin-lab-umich/prewas_manuscript_analysis

Manuscript in support of prewas software

data data-visualisation manuscript r

Last synced: 08 Jul 2025

https://github.com/petermartens98/nba-analytics-streamlit-app-with-langchain-agent

Interactive NBA Analytics app with Streamlit and a LangChain conversational agent connected to extracted data. Explore player, team, and game stats, track injuries, run simulations, visualize trends, and get AI-powered insights. Ongoing development, open to collaboration.

agentic-ai analysis data deepseek langchain nba python streamlit visualization

Last synced: 08 May 2026

https://github.com/grkndev/twitcher

A great library that will allow you to use the Twitch API service. All you need to do is use your Token and Client Id information.

api clip clipr data javascript nodejs npm npm-package npmjs streamers streaming twitch twitch-api twitch-bot twitchtv twtich-clip user

Last synced: 09 Mar 2026

https://github.com/miniql/miniql-express-mongodb-example

A MiniQL example for querying a MongoDB database through an Express REST API.

data database mongodb query query-language

Last synced: 19 Apr 2026

https://github.com/jessielw/parse-fel-master-data

Simple CLI to parse Dolby Vision master data via the RPU/MediaInfo and output data needed for x265

data dolby fel master mediainfo mi parse rpu vision

Last synced: 26 Aug 2025

https://github.com/xrahul/android-logs

Get logs of various sensors and events in android 6.0+

android data events logs

Last synced: 20 May 2026

https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data

This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.

aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python

Last synced: 03 Jan 2026

https://github.com/xdrokra/road-accident-analytics

A data visualization project that maps and analyzes road accidents across major Italian municipalities in 2023

analytics data design italy javascript

Last synced: 30 Aug 2025

https://github.com/tatey/list_of_baby_names

A list of baby names given to tiny humans in Ruby

data names ruby

Last synced: 11 Nov 2025

https://github.com/n4ze3m/timezone-json

JSON file with more than 1642 cities timezone in UTC format.

data json timeszone

Last synced: 19 Jul 2025

https://github.com/marcelo-earth/h5n8-data

🔢🦠 Confirmed cases of H5N8 in humans - Feel free to open Pull Requests with new data.

csv data h5n8 h5n8-cases h5n8-virus russia

Last synced: 19 Jan 2026

https://github.com/jackokring/www

Generic www flask server with phinka module

compression data flask phinka python

Last synced: 16 Jan 2026

https://github.com/horisystems/uk_ev_data_analysis

Analysis of Electric Vehicle charging infrastructure in the United Kingdom.

data data-science electric-vehicles ev python uk united-kingdom

Last synced: 12 Jan 2026

https://github.com/snimmagadda1/stack-exchange-dump-to-mysql

Batch pipeline to import Stack Exchange XML data dumps to relational DB

batch data mysql spring-batch stackoverflow

Last synced: 30 Mar 2025

https://github.com/cliffano/volothamp

Random D&D stuffs my son and I dabble with

data dungeons-and-dragons info little-godzilla

Last synced: 06 Apr 2025

https://github.com/ngambip/priscilla

About my work and Experience

accounting analytics data finance-management

Last synced: 03 Feb 2026

https://github.com/frequentlymisseddeadlines/chessfessor

Command line tool to extract game data from Lichess.org and Chess.com

chess data extract lichess pgn

Last synced: 19 May 2026

https://github.com/aditya172926/blockchain_indexers

Indexers to fetch data from blockchain events and transactions data with their parameters

blockchain data indexers rust

Last synced: 02 Aug 2025

https://github.com/francescodisalesgithub/data-for-developers

simple SQL database with problems and solution found on stackoverflow, documentation or chatgpt

chatgpt data database developer hacker hacking knowledge solutions sql targets

Last synced: 22 Mar 2025

https://github.com/devsujay19/knowledgebase

My knowledge base built with NextJS 14, Tailwind CSS 3 and Aceternity UI.

data knowledge-base nextjs nextjs-typescript nextjs14 react server-side-rendering tailwindcss vercel

Last synced: 10 Apr 2026

https://github.com/marxmit7/kaggle

Kaggle competitions

data kaggle kaggle-competition

Last synced: 19 May 2026

https://github.com/stdlib-js/array-base-reject

Return a shallow copy of an array containing only those elements which fail a test implemented by a predicate function.

array copy data filter generic javascript node node-js nodejs predicate reject stdlib structure test types

Last synced: 26 Dec 2025

https://github.com/husna-poyraz/titanic-machine-learning

Use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.

data data-analysis data-science data-visualization deep-learning machine-learning missing-data outlier-detection python titanic

Last synced: 10 May 2026

https://github.com/stdlib-js/array-one-to

Generate a linearly spaced numeric array whose elements increment by 1 starting from one.

array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector

Last synced: 26 Feb 2026

https://github.com/mattqdev/koalaz

Why don't use koalas as data mock? With this npm package you can!

data koala lorem-ipsum meme mock placeholder

Last synced: 13 Jan 2026

https://github.com/codenoid/webtoons.com-database

a Webtoons.com Database, collected by Hofesh Bot (Scrapper)

data database

Last synced: 28 Mar 2025

https://github.com/lmuffato/project-mysql-one-for-all-trybe

Projeto mysql one for all - Projeto avaliativo da Trybe do Bloco 21: Normalização e Modelagem de Banco de Dados

back-end data database database-modeling mysql mysqlworkbench query sql trybe-projects

Last synced: 08 May 2026

https://github.com/castdrian/kdapi

A TypeScript library that scrapes K-pop idol and group information from online sources to create comprehensive JSON datasets.

api data kpop scraper typescript

Last synced: 15 May 2025

https://github.com/stdlib-js/datasets-herndon-venus-semidiameters

Fifteen observations of the vertical semidiameter of Venus, made by Lieutenant Herndon, with the meridian circle at Washington, in the year 1846.

astronomy data dataset datasets grubbs herndon javascript node node-js nodejs outlier outliers sample statistics stats stdlib venus

Last synced: 09 Oct 2025

https://github.com/iguptashubham/walmart-eda

Imagine diving into the fascinating world of Walmart with just a few lines of code! This project lets you do that using MySQL, a powerful tool for data analysts. You can clean up messy data like a detective, uncovering hidden patterns and trends. Data scientists can take it further,.

analysis data dataset eda mysql portfolio-project python sql

Last synced: 10 Apr 2026

https://github.com/ilejuxepwaduzd/structured-data-extractor

🛠️ Extract structured data from messy texts using Chain-of-Thought prompting to improve processing of customer support and technical issues.

cdp chrome-fetcher data document-extraction ecommerce golang-library headless metadata-extraction ocr open-source pdf pdf-converter pdf-extractor ruby scraper shopify spider structured-data

Last synced: 10 Apr 2026

https://github.com/aranfononi/h4x0r-news-section-17-project

A SwiftUI-powered app that displays top stories from Hacker News. Users can open articles directly within the app, utilizing SwiftUI’s NavigationLink and custom WebView integration.

app-development data data-binding data-binding-library ios swift swiftui xcode

Last synced: 18 May 2026

https://github.com/prdktntwcklr/weatherman

A simple web app displaying environmental data from an SQLite database.

dashboard data flask sensor sqlite

Last synced: 19 May 2026

https://github.com/makepath/medaprep

medaprep is a data preparation and feature engineering toolkit for geospatial applications.

data data-science datacleaning eda exploratory-data-analysis xarray

Last synced: 29 Jun 2025

https://github.com/jrdnbradford/google-sheet-color-sort

Google Sheet-bound script that assists with sorting Google Sheet rows by background fill color

data excel google-apps google-apps-script google-sheet google-sheets javascript microsoft-excel sort-rows

Last synced: 14 Apr 2025

https://github.com/fairspec/fairspec-extension

Fairspec Extension is a Git repository template for rapid Fairspec extension development

ckan csv data dataset excel fair json ods polars python quality schema sqlite tabl typescript validation zenodo

Last synced: 20 Jan 2026

https://github.com/rremple/intervalidus

For all your interval-based data needs.

data intervals

Last synced: 21 Feb 2026

https://github.com/penspanic/datra

Datra is a comprehensive data management system for game development.

data game game-development gamedata unity unity-package unity3d-plugin

Last synced: 19 May 2026

https://github.com/thelich2112/bluesky-weather-poster

a Wordpress plugin that takes info from a clientraw.txt file and posts to Bluesky with variable options for posting.

data posting station weather wordpress

Last synced: 17 May 2026

https://github.com/stdlib-js/array-base-count-same-value

Count the number of elements that are equal to a given value in an array.

array count countif data javascript node node-js nodejs same stdlib structure sum summation total types

Last synced: 21 Apr 2026

https://github.com/themost-framework/memory

MOST Web Framework in-memory data adapter for testing environments

adapter data orm

Last synced: 01 Jul 2026

https://github.com/uvaio/datasets

Notebooks for data processing, scraping, machine learning

data dataset jupyter jupyter-notebook learning machine ml model ontology

Last synced: 21 Mar 2025

https://github.com/habedi/adbis-2023-paper

This repository hosts the code and data used for the experiments reported in the paper titled "Diversification of Top-k Geosocial Queries", published in ADBIS 2023

artifacts conference-paper data experiments graphs java research-paper

Last synced: 19 May 2026

https://github.com/coqui123/tradegpt

TradeGPT is a full-stack cryptocurrency trading application that combines a modern Fresh (Deno) frontend with a Python (FASTAPI) backend for Coinbase integration and Azure AI Services for intelligent trading analysis. 💹

analytics automation cryptocurrency data deno fastapi fresh numpy python trading-algorithms trading-strategies tradingbot typescript

Last synced: 11 Apr 2026

https://github.com/labgua/ilmeteo

Acquisizione dati dal sensore SHT71 e trasmissione in rete in Real-Time

acquisition data humidity humidity-sensor iot raspberry-pi real-time realtime rpi sht71 temperatura temperature temperature-sensor umidita web

Last synced: 24 Apr 2026

https://github.com/gcoronelc/ucv_gdi-1_202302-b2

Taller de Gestión de Datos e Información I con Gustavo Coronel.

data data-science data-structures database databases online oracle query relational-databases security sql sql-server

Last synced: 19 May 2026

https://github.com/mitevpi/vue-d3-bar-chart

Reusable, reactive, animated bar chart using D3 + Vue.js. Written in idiomatic Vue, rather than D3 syntax.

d3 data data-visualization frontend interactive svg vue web

Last synced: 18 May 2026

https://github.com/panukatan/senso

An Interface to the Philippine Census of Population and Housing Data

census data philippines r rstats

Last synced: 29 Jun 2026

https://github.com/astrid-project/cb-manager

APIs to interact with the Context Broker's database. Through a REST Interface, it exposes data and events stored in the internal storage system in a structured way. It provides uniform access to the capabilities of monitoring agents.

agent beats control data ebpf elasticsearch log logstash management programmability security

Last synced: 30 Jun 2025

https://github.com/stdlib-js/ndarray-base-reverse-dimension

Return a view of an input ndarray in which the order of elements along a specified dimension is reversed.

base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure types vector view

Last synced: 07 Mar 2026

https://github.com/diddypod/crop-data-comparer

A Python script to compare crop data over years

comparison crop data openpyxl python

Last synced: 28 Jun 2026

https://github.com/wahyuwsslah/salary_prediction-aiml

Salary Prediction using Machine Learning with 3 Models. Linear Regression, Decision Tree, Random Forest

ai analytics data data-science datascience machine-learning python python3

Last synced: 19 May 2026

https://github.com/oefenweb/python-untraceables

Randomizes IDs for a given set of tables making them untraceable across environments

anonymize data database mysql privacy python python2 python3 randomization

Last synced: 03 Feb 2026

https://github.com/hoaihuongbk/lakeops

A modern data lake operations toolkit working with multiple table formats (Delta, Iceberg, Parquet) and engines (Spark, Polars) via the same APIs.

data data-operations dataengineering datalake

Last synced: 07 Mar 2026

https://github.com/glassflow/pipelines-push-action

This Github Action lets you automate GlassFlow pipelines deployments as code

data data-processing datastreaming deployment github-actions glassflow python real-time stream-processing

Last synced: 19 May 2026

https://github.com/seguradevinn/data-project

A healthcare data audit demo using CMS SynPUF and DuckDB, showing how raw claims are cleaned, validated, and transformed into a 2009 cohort with descriptives and a RADV-style chase list.

auditing cms data duckdb sql

Last synced: 02 Sep 2025

https://github.com/bastianolea/censo_viviendas

Censo de Viviendas procesado con R para disponibilizarlo con códigos/nombres de comunas, regiones, y etiquetas de sus variables. En formato original (6,5 millones de filas) y en conteo por comunas.

chile comunas data poblacion rural

Last synced: 30 Oct 2025

https://github.com/nesterenko-kv/object-id

ObjectIDs are a special type of identifier mainly used in MongoDB to uniquely identify documents within a collection. They consist of a 12-byte binary value that includes a timestamp, a machine identifier, a process identifier, and a counter.

c-sharp data id net object-id unique-identifier

Last synced: 16 May 2025

https://github.com/emnetdegafe/allesoverfilm-backend

AllesOverFilm-backend is part of the AllesOverFilm mobile app development project and contains the database structure, server query scripts, and Sequelize-cli database structures.

backend data data-model express postgresql sequelize-cli

Last synced: 11 Apr 2026

https://github.com/spine-tools/metreload

Python application for downloading meteorological reanalysis data

data python reanalysis

Last synced: 01 Jul 2025

https://github.com/cosmos-loops/cosmos-dapper

Cosmos.Dapper is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of StackExchange.Dapper to improve development efficiency.

dapper data mysql mysqlconnector oracle postgresql sql-query sqlite sqlkata sqlserver

Last synced: 11 Apr 2026

https://github.com/sksubhadeep/nashville-housing-data-cleaning-project-using-sql

SQL Data Cleaning Project on Nashville Housing Dataset

data datacleaning sql

Last synced: 19 Mar 2026

https://github.com/idea2app/public-meta-data

HTTP API for Public Meta Data, written in TypeScript & designed for CDN.

api cdn data http meta public typescript

Last synced: 15 Mar 2025

https://github.com/ttitcombe/timekeep

Defensive timeseries analysis in python

data data-science sklearn time-series time-series-analysis timeseries

Last synced: 05 Jan 2026

https://github.com/sambacha/yearn-finance-data

data repo for proposed YIP-DATA

cryptocurrency data erc20 ethereum exchange yearn yip yyip

Last synced: 18 May 2026

https://github.com/iosdec/adstorage

Automatic Data Storage - iOS

data ios objective-c public storage xcode

Last synced: 21 Mar 2025

https://github.com/pythongiant/data-analytics-wolfram-alpha

A data analysis porgram using wolfram alpha

analytics api data wolfram-alpha

Last synced: 04 Apr 2025

https://github.com/sermetpekin/perse

Perse is an experimental Python package that combines some of the most widely-used functionalities from the powerhouse libraries Pandas, Polars, and DuckDB into a single, unified DataFrame object. The goal of Perse is to provide a streamlined and efficient interface, leveraging the strengths of these libraries to create a versatile data handling.

data data-science data-structures duckdb pandas polars

Last synced: 09 May 2026

https://github.com/amyflo/cs448b

Exploring r/LoveLetters

d3-visualization d3js data react reactjs visualization

Last synced: 18 May 2026

https://github.com/chompfoods/stub-go-server

Go server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food go-server go-swagger grocery ingredients nutrition raw recipe-api recipes

Last synced: 17 Apr 2026

https://github.com/openfoodfacts/openfoodfacts-corrector

Ruby script to correct and enhance data on OpenFoodFacts

correction data food ruby

Last synced: 24 Apr 2026

https://github.com/definetlynotai/test_generator

A tool to create datasets based on configurations from a csv file, This tool can be used as a skeleton for other software.

algorithim csv data development dynamic exam generator huge nirt powerful python skeleton test tools

Last synced: 21 Jul 2025

https://github.com/lamden/merk

A concise implementation of a merkle tree in Python.

crypto data hash merkle structure tree

Last synced: 27 May 2026

https://github.com/abdul-rafay19/youngdevinterns_machine-learning_tasks

This internship offers hands-on exposure to real-world Machine Learning applications — from data visualization and preprocessing to model development, evaluation, and deployment. It focuses on real ML workflows, problem-solving, neural networks, and hyperparameter tuning — all within a collaborative, remote, and growth-oriented environment.

ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks data data-visualization internship machine-learning machine-learning-algorithms machinelearning ml model model-development neural-network preprocessing programming-language python task tasks youngdevintern

Last synced: 29 Apr 2026

https://github.com/benmaier/boarding_school_sir

Fit SIR dynamics to the prevalence curve of an H1N1 outbreak of a British boarding school in 1978.

boarding data disease epidemiology modeling school spreading

Last synced: 31 Mar 2025

https://github.com/richardschoen/sshnetibmi

This .Net/.Net Core class library is used to interface with existing IBM i database, program calls, CL commands, service programs and data queues via the PASE based xmlservice-cli PASE command program or regular qsh/bash commands. qsh/bash commands can be used to interface with any qsh/pase based utilities such as the IBM i db2util utility

as400 cl command csharp data db2 ddm dotnet drda ibm ibmi os400 pase program qcmdexc qcmdexec queue rpg xmlservice xmlservice-cli

Last synced: 04 Feb 2026

https://github.com/h2lsoft/validator

A library of validators values in multilanguage with CSRF protection

csrf csrf-protection data form php validator

Last synced: 04 Feb 2026