An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/kedarvj/mysql-random-data-generator

This is the easiest MySQL random test data generator tool. Load the procedure and execute to auto detect column types and load data.

data dataset dummy-data dummy-data-generator mysql procedure random-generation testdata testdatabuilder

Last synced: 16 Mar 2025

https://github.com/pydap/pydap

A Python library implementing the Data Access Protocol (DAP, aka OPeNDAP).

dap data dods opendap science

Last synced: 21 Oct 2025

https://github.com/asad70/wallstreetbets-sentiment-analysis

This program finds the most mentioned ticker on r/wallstreetbets and uses Vader SentimentIntensityAnalyzer to calculate the sentiment analysis.

algotrading analysis data docker-container reddit sentiment-analysis trading vader-sentimentintensityanalyzer wallstreetbets wallstreetbets-sentiment-analysis

Last synced: 23 Oct 2025

https://github.com/Appsilon/data.validator

validate your data and create nice reports straight from R

data r reporting rhinoverse rstudio validation

Last synced: 06 May 2025

https://github.com/openstates/people

Curated information on all state legislators & governors.

data hacktoberfest legislators united-states-data

Last synced: 15 Apr 2026

https://github.com/Tauffer-Consulting/domino

User friendly and open source platform for workflow creation and monitoring

ai airflow containers data gui kubernetes open-source python workflows

Last synced: 22 Apr 2025

https://github.com/ingoscholtes/pathpy

pathpy is an OpenSource python package for the modeling and analysis of pathways and temporal networks using higher-order and multi-order graphical models

analysis data data-mining graph graphical-models machine-learning model-selection multi-order network-analysis networks pathways python sequential-data temporal-correlations temporal-networks

Last synced: 25 Sep 2025

https://github.com/leinelissen/aeon

๐Ÿ“ก Scan the internet for your personal information and modify or remove it

data electron gdpr git

Last synced: 05 Apr 2025

https://github.com/robjhyndman/fpp3

All data sets required for the examples and exercises in the book "Forecasting: principles and practice" (3rd ed, 2020) by Rob J Hyndman and George Athanasopoulos <http://OTexts.org/fpp3/>. All packages required to run the examples are also loaded.

cran data forecasting r

Last synced: 13 Feb 2026

https://github.com/petera2c/simple-table

Lightweight React data grid/table for fast, modern web apps

component data grid javascript library react simple typescript

Last synced: 02 Feb 2026

https://github.com/bps-statistics/stadata

STADATA is a Python package that simplifies access to statistical data provided by BPS - Statistics Indonesia

data data-analytics data-science national-statistics nso official-statistics python python-package statistics

Last synced: 14 Oct 2025

https://github.com/gbif/pygbif

GBIF Python client

api biodiversity data ecology gbif

Last synced: 17 Mar 2026

https://github.com/expo/entity

Entity is a privacy-aware data layer for defining, caching, and authorizing access to application data models.

authorization data privacy

Last synced: 02 Mar 2026

https://github.com/infinum/datx

DatX is an opinionated JS/TS data store. It features support for simple property definition, references to other models and first-class TypeScript support.

collection data data-store hacktoberfest javascript mobx model open-source prop state-management

Last synced: 23 Mar 2025

https://github.com/join-monster/join-monster-graphql-tools-adapter

Use Join Monster to fetch your data with Apollo Server.

apollo batch data graphql join schema sql

Last synced: 01 Apr 2026

https://github.com/dbt-labs/jaffle_shop_duckdb

Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!

data dbt duckdb sql

Last synced: 07 Apr 2025

https://github.com/rello/analytics

Analytics - Open source data warehouse and reporting for Nextcloud

analytics data data-warehouse datasources nextcloud visualization

Last synced: 18 Feb 2026

https://github.com/cipherstash/stack

Data level access controls via field level searchable encryption SDK for JavaScript/TypeScript

data data-security encryption encryption-sdk javascript postgres postgresql searchable-encryption security typescript

Last synced: 18 May 2026

https://github.com/odota/mobile

React Native apps for viewing Dota 2 data on Android/iOS

android data dota ios mobile react-native

Last synced: 21 Apr 2025

https://github.com/royal-statistical-society/datavisguide

Introductory guide to the art and science of data visualisation. Insights, advice, and examples (with code) to make data outputs more readable, accessible, and impactful.

charts data graphics tables visualization

Last synced: 27 Mar 2025

https://github.com/sheetjs/js-adler32

:ballot_box_with_check: ADLER-32 checksum

adler-32 bytes checksum data

Last synced: 12 Apr 2025

https://github.com/noahgift/cloud-data-analysis-at-scale

[Course-2020-2023] taught at Duke MIDS. This is also a Coursera Course that covers MLOps, ML Engineering and the foundations of Cloud Computing for Data Science.

analytics cloud data duke github hugging huggingface machine-learning mids syllabus

Last synced: 18 Feb 2026

https://github.com/ls9512/ubind

UBind is a value binding component for Unity, which is used to quickly realize the association binding between UI and logical data.

data databind databinding databinding-library datadriven framework u3d ugui uguicomponent ui unity unity3d unity3d-plugin unityplugin

Last synced: 09 Oct 2025

https://github.com/raystack/guardian

Guardian is universal data access management tool with automated access workflows and security controls across data stores, analytical systems, and cloud products.

access compliance control data dataops

Last synced: 24 Jul 2025

https://github.com/hadley/babynames

An R package containing US baby names from the SSA

data r

Last synced: 05 Apr 2025

https://github.com/jennybc/repurrrsive

Recursive lists to use in teaching and examples, because there is no mtcars for lists.

data json package r xml

Last synced: 06 Apr 2025

https://github.com/coderdarren/unitycore

A collection of essential game systems for Unity 3D. These generic systems can be applied to any Unity project.

audio data game game-dev game-development gamedev menu scene session tutorial tween unity unity-3d unity3d

Last synced: 12 Apr 2025

https://github.com/Meeshkan/micro-jaymock

Tiny API mocking microservice for generating fake JSON data.

data fake fake-data micro microservice test-data testing

Last synced: 07 May 2025

https://github.com/meeshkan/micro-jaymock

Tiny API mocking microservice for generating fake JSON data.

data fake fake-data micro microservice test-data testing

Last synced: 13 Mar 2025

https://github.com/coderDarren/UnityCore

A collection of essential game systems for Unity 3D. These generic systems can be applied to any Unity project.

audio data game game-dev game-development gamedev menu scene session tutorial tween unity unity-3d unity3d

Last synced: 25 Apr 2025

https://github.com/wfcd/warframe-drop-data

:moneybag: Warframe Drop Data in an easier to parse format.

data drop-data drop-sorting enemies game mod play-game relic reward warframe

Last synced: 04 Apr 2025

https://github.com/crunchydata/pgcompare

pgCompare โ€“ a straightforward utility crafted to simplify the data comparison process, providing a robust solution for comparing data across various database platforms.

compare data migration oracle postgres

Last synced: 05 Apr 2025

https://github.com/CrunchyData/pgCompare

pgCompare โ€“ a straightforward utility crafted to simplify the data comparison process, providing a robust solution for comparing data across various database platforms.

compare data migration oracle postgres

Last synced: 31 Mar 2025

https://github.com/glotzerlab/signac

Manage large and heterogeneous data spaces on the file system.

data data-management database reproducibility

Last synced: 08 Apr 2025

https://github.com/siriusastrebe/jsynchronous

Jsynchronous.js - Data synchronization for games and real-time web apps.

data games javascript nodejs realtime synchronization websockets

Last synced: 10 Apr 2025

https://github.com/c-3lab/dim

๐Ÿ“ฆ dim: Manage the open data in your project like a package manager.

cli commads command-line-tool data dataops dim gpt gpt-3 llm opendata package-manager public-data public-dataset

Last synced: 17 Jan 2026

https://github.com/stethoscope-js/stethoscope

๐Ÿฉบ Track, visualize, and embed your health and life data โ€” location, health, work, play, and more

automation clockify data github-actions goodreads google-fit last-fm life oura oura-ring pocket-casts rescue-time spotify time-tracking wakatime

Last synced: 22 Sep 2025

https://github.com/decileapp/decile

Simple, open-source analytics tool for any Postgres database.

analytics automation business dashboard data intelligence postgres reporting sql sql-editor

Last synced: 02 Aug 2025

https://github.com/bitquery/widgets

Widgets for blockchain data visualizations

analytics blockchain data graphql statistics widgets

Last synced: 23 Jun 2025

https://github.com/kwstat/agridat

Agricultural datasets

data rstats

Last synced: 16 May 2025

https://github.com/ghiscoding/aurelia-slickgrid

Aurelia-Slickgrid is a wrapper of the lightning fast & customizable SlickGrid datagrid, it also includes multiple Styling Themes

aurelia bootstrap bootstrap4 bootstrap5 data datagrid datatable graphql grid odata slickgrid treeview typescript

Last synced: 06 Apr 2025

https://github.com/airbytehq/airbyte-agent-sdk

๐Ÿ™ Drop-in tools that give AI agents reliable, permission-aware access to external systems.

ai ai-agents airbyte anthropic connectors data enterprise gemini integrations langchain llm mcp open-source openai pydantic-ai rag

Last synced: 19 Jun 2026

https://github.com/derekwisong/datui

Data Exploration in the Terminal

analysis cli data eda research tui

Last synced: 02 Jun 2026

https://github.com/miniql/miniql

A tiny JSON-based query language inspired by GraphQL

data data-analysis data-science graphql javascript json queries query query-language typescript

Last synced: 15 Apr 2025

https://github.com/ls9512/UBind

UBind is a value binding component for Unity, which is used to quickly realize the association binding between UI and logical data.

data databind databinding databinding-library datadriven framework u3d ugui uguicomponent ui unity unity3d unity3d-plugin unityplugin

Last synced: 25 Apr 2025

https://github.com/mdzio/ccu-historian

Der CCU-Historian erfasst die Betriebsdaten des Hausautomations-Systems HomeMatic der Firma eQ-3.

aquisition ccu ccu-historian data homematic pims smarthome

Last synced: 19 Jul 2025

https://github.com/propeldata/ui-kit

React components for data visualization and quickly building data analytics dashboards

analytics dashboards data datavisualization react reactjs

Last synced: 07 Apr 2026

https://github.com/Indie-Platforms/scrapecomfort

Desktop AI Data Scraper

ai data webscraping

Last synced: 14 Apr 2025

https://github.com/Canner/wren-engine

๐Ÿค– The semantic engine for LLMs, bringing semantic context to AI agents. ๐Ÿ”ฅ

business-intelligence data data-analysis data-analytics data-lake data-warehouse hacktoberfest llm semantic semantic-layer sql

Last synced: 01 Apr 2025

https://github.com/cially/cially

๐ŸชผCially is a powerful, open-source dashboard designed to provide in-depth insights, real-time analytics, and detailed statistics for your Discord server. Monitor member activity, track engagement trends, and make data-driven decisions with ease.

analytics api dashboard data discord discord-bot discord-js express expressjs full-stack nextjs pocketbase website

Last synced: 20 Jan 2026

https://github.com/pierridotite/stonks-dashboard

A cyberpunk-style, real-time financial monitor that runs directly in the terminal. It visualizes cryptocurrency and stock market data using ASCII charts and dynamic coloring.

cli dashboard data finance market market-data real-time stocks terminal trading tui

Last synced: 29 Dec 2025

https://github.com/the-osint-toolbox/people-search-osint

Search tools to help you find people, focused towards UK resources.

data data-aggregation osint people privacy search searching

Last synced: 15 Feb 2026

https://github.com/kayak/fireant

Data analysis and reporting tool for quick access to custom charts and tables in Jupyter Notebooks and in the shell.

aggregated analysis analytics bi builder business data database fireant juypter mysql oracle pandas postgres python query rdbms science sql vertica

Last synced: 19 Jul 2025

https://github.com/jasonkuhrt/alge

Type safe library for creating Algebraic Data Types (ADTs) in TypeScript. ๐ŸŒฑ

adt algebraic-data-types data typescript

Last synced: 16 May 2025

https://github.com/piersyork/owidR

An R Package for Importing Data from Our World in Data

data data-visualisation economics r r-package

Last synced: 13 Jul 2025

https://github.com/data-engineering-community/data-engineering-project-template

This is a template you can use for your next data engineering portfolio project.

data data-engineering data-warehouse etl etl-pipeline python sql

Last synced: 06 Apr 2025

https://github.com/codeclimate/platform

Code Climate Engineering Data Platform

analytics code-quality codeclimate data specification

Last synced: 05 Mar 2026

https://github.com/chesslablab/php-chess

A chess library offering move validation, common formats, multiple variants, UCI engine support, explanation of chess positions, image recognition, PGN parsing and knowledge extraction from games.

board chess data database db fen game machinelearning ml pgn php

Last synced: 06 Mar 2025

https://github.com/mapbox/svg-to-geojson

Upload SVG, return GeoJSON.

data geojson svg

Last synced: 03 Apr 2025

https://github.com/autoviml/pandas_dq

Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.

data data-science dataquality dataqualitycheck machine-learning pandas python scikit-learn

Last synced: 07 Apr 2025

https://github.com/piersyork/owidr

An R Package for Importing Data from Our World in Data

data data-visualisation economics r r-package

Last synced: 10 Apr 2025

https://github.com/kaarthik108/snowbrain

snowBrain - AI-Driven Insights with Snowflake (New version- https://github.com/kaarthik108/snowbrain-AGUI)

chatgpt data fastapi langchain nextjs pinecone snowflake tailwindcss

Last synced: 17 Mar 2025

https://github.com/airbytehq/airbyte-agent-connectors

๐Ÿ™ Drop-in tools that give AI agents reliable, permission-aware access to external systems.

ai ai-agents airbyte anthropic connectors data enterprise gemini integrations langchain llm mcp open-source openai pydantic-ai rag

Last synced: 09 Apr 2026

https://github.com/DataKitchen/data-observability-installer

Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.

data data-engineering data-observability data-profiling data-quality data-reliability data-science datachecker datacleaner datacleaning dataops dataquality datatesting datavalidation mssql pipeline-tests postgresql redshift self-hosted snowflake

Last synced: 05 May 2025

https://github.com/MaJerle/lwpkt

Lightweight packet protocol structure for multi-device communication focused on RS-485

command data devices length master microcontroller multi-slaves optimization packet packet-structure protocol rs-485 simple stm32

Last synced: 14 May 2025

https://github.com/majerle/lwpkt

Lightweight packet protocol structure for multi-device communication focused on RS-485

command data devices length master microcontroller multi-slaves optimization packet packet-structure protocol rs-485 simple stm32

Last synced: 04 Mar 2025

https://github.com/postlight/account

๐Ÿ“š๏ธ โž• ๐Ÿ”ข Tell little stories with numbers

business-models data data-journalism diagonal-hamburger expr-eval journalism labs parsimmon react sliders spreadsheet storytelling

Last synced: 06 Aug 2025

https://github.com/Rello/analytics

Analytics - Open source data warehouse and reporting for Nextcloud

analytics data data-warehouse datasources nextcloud visualization

Last synced: 03 Apr 2025

https://github.com/bps-statistics/form-gear

FormGear is a framework engine for dynamic form creation and complex form processing and validation for data collection.

census data data-collection form-builder form-engine form-generator national-statistics official-statistics survey survey-builder survey-form

Last synced: 13 Oct 2025

https://github.com/cosmo/binarykit

๐Ÿ’พ๐Ÿ”๐Ÿงฎ BinaryKit helps you to break down binary data into bits and bytes, easily access specific parts and write data to binary.

binary binary-data bit bits byte bytes data hacktoberfest nsdata swift

Last synced: 27 Feb 2026

https://github.com/Cosmo/BinaryKit

๐Ÿ’พ๐Ÿ”๐Ÿงฎ BinaryKit helps you to break down binary data into bits and bytes, easily access specific parts and write data to binary.

binary binary-data bit bits byte bytes data hacktoberfest nsdata swift

Last synced: 16 Mar 2025

https://github.com/bartobri/data-structures-c

A collection of algorithms for data structure manipulation in C

algorithms c data structures

Last synced: 26 Feb 2026

https://github.com/datakitchen/data-observability-installer

Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.

data data-engineering data-observability data-profiling data-quality data-reliability data-science datachecker datacleaner datacleaning dataops dataquality datatesting datavalidation mssql pipeline-tests postgresql redshift self-hosted snowflake

Last synced: 06 Apr 2026

https://github.com/joelgmsec/invoke-dnsteal

Simple & Customizable DNS Data Exfiltrator

data delay dns domain exfiltrator fake random tcp udp

Last synced: 10 Mar 2026

https://github.com/dbt-labs/dbt-meshify

A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.

data dbt dbt-cloud dbt-core

Last synced: 09 Apr 2025

https://github.com/nberlette/canbus

Controller Area Network (CAN) reference, wiki, and DBC files.

automotive bmw can can-bus canbus data database dbc networking vector wiki

Last synced: 21 Mar 2025

https://github.com/DataWithBaraa/sql-data-analytics-project

This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.

analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql

Last synced: 14 Oct 2025

https://github.com/anchore/vunnel

Tool for collecting vulnerability data from various sources (used to build the grype database)

data grype hacktoberfest vulnerability

Last synced: 14 Feb 2026

https://github.com/JoelGMSec/Invoke-DNSteal

Simple & Customizable DNS Data Exfiltrator

data delay dns domain exfiltrator fake random tcp udp

Last synced: 12 Jul 2025