An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/daveoncode/pyvaru

Rule based data validation library for python 3.

data data-validation form-validation model validation validator

Last synced: 06 May 2025

https://github.com/marcskovmadsen/holoviz-mcp

✨A MCP server that provides intelligent access to the HoloViz ecosystem for humans and AIs.

ai analytics data datascience dataviz holoviews holoviz hvplot mcp model-context-protocol panel python

Last synced: 25 Jan 2026

https://github.com/streamr-dev/chat

A proof-of-concept chat application built on the Streamr network.

chat dapp data messaging real-time streamr web3

Last synced: 13 Jul 2025

https://github.com/rmax/databrewer-recipes

DataBrewer Recipes Repository.

data datasets

Last synced: 20 Mar 2025

https://github.com/izelnakri/memoria

Single JS/TS ORM for frontend, backend & in-memory in-browser testing. Based on typeorm API, allows you to change adapters: MemoryAdapter, RESTAdapter, SQLAdapter etc.

browser data database decorators frontend graphql in-memory-database javascript json-api mock-server orm rest rest-client server sql state state-management testing-tools typeorm typescript

Last synced: 29 Jun 2025

https://github.com/nasa/ziggy

Ziggy, a portable, scalable infrastructure for science data processing pipelines, is the child of the Transiting Exoplanet Survey Satellite (TESS) pipeline and the grandchild of the Kepler Pipeline.

algorithm analysis arc data data-analysis data-reduction java k2 kepler linux macos nasa open-source pipeline science tess ziggy

Last synced: 26 Jan 2026

https://github.com/stefen-taime/car-price-predictor

Predicting Car Prices with FastAPI, Streamlit, MLflow, Kafka, and Debezium: A Practical Demonstration

data data-science dataanalysis-projects engineering machine-learning mlops predictive-modeling

Last synced: 04 Aug 2025

https://github.com/hitsz-ids/argus

Argus is a result review engine that prevents data leakage and ensures out-of-domain result traceability.

data data-protection data-security privacy secruity

Last synced: 01 Jul 2025

https://github.com/jgmdev/lessram

Pure PHP implementation of array data structures that use less memory.

data extension less memory php ram structures

Last synced: 15 May 2025

https://github.com/siddharthpatelde/distance-to-next-edge

This project focuses on building a logic to calculate the distance to the next edge when a robot equipped with a 2D LIDAR sensor is placed on a table. The project leverages the RPlidar.h library and a Raspberry Pi Pico to work with the LIDAR sensor.

2dlidar arduino cpp data data-visualization filtering-data functions jason lidar linux lowpass-filter mathematics physics raspberry-pi-pico ros serial-communication trignometry uart

Last synced: 12 Aug 2025

https://github.com/didinj/spring-boot-security-rest

Securing RESTful API with Spring Boot, Security, and Data MongoDB

api data example mongodb rest restful security spring springboot tutorial

Last synced: 05 Aug 2025

https://github.com/DesignandHuman/qui-possede-les-medias

Qui possède les grands médias que nous lisons ?

data extension holders media web-extension

Last synced: 02 Aug 2025

https://github.com/xiaodaigh/fstfileformat.jl

Julia bindings for the fst format

data fst julia julia-language julialang

Last synced: 07 May 2025

https://github.com/ibhavikmakwana/sample_data

generate random sample data to test your application.

dart data flutter sample sample-data

Last synced: 26 Mar 2025

https://github.com/ff137/bitstamp-btcusd-minute-data

Daily updates of Bitstamp BTC/USD 1-minute OHLC data, with historical data since 2012

bitcoin bitstamp candle data ohlcv-data price-data

Last synced: 09 Aug 2025

https://github.com/grimen/python-attributedict

A dictionary object with attributes support - for Python.

attribute attributes custom data dict dictionary object properties property python struct

Last synced: 03 May 2025

https://github.com/snowplow/quickstart-examples

Examples of how to automate creating a Snowplow Community Edition pipeline

analysis aws azure data gcp snowplow snowplow-analytics snowplow-pipeline terraform

Last synced: 21 Apr 2025

https://github.com/knime/knime-r

KNIME Interactive R Statistics Integration

analysis data knime learning machine mining r statistics

Last synced: 21 Jan 2026

https://github.com/hasnep/dataskimmer.jl

📊 A Julia package that summarises tabular data in the REPL

data skimr summary

Last synced: 28 Dec 2025

https://github.com/malloydata/malloy-samples

Malloy model examples and associated datasets

data data-modeling malloy semantic-modeling sql

Last synced: 16 Oct 2025

https://github.com/atolcd/hop-gis-plugins

🗺 GIS plugins for Apache Hop Orchestration Platform

data dxf etl geojson geopackage gis gpx hop java mif-mid shp spatialite

Last synced: 23 Jan 2026

https://github.com/malloydata/malloy-vscode-extension

The Malloy Visual Studio Code extension facilitates building Malloy data models, querying and transforming data, and creating simple visualizations and dashboards

data data-modeling malloy semantic-modeling sql

Last synced: 26 Apr 2025

https://github.com/osl-pocs/skdata

Python tools for data analysis

data data-analysis data-science open-data python

Last synced: 12 Dec 2025

https://github.com/castelao/gsw-rs

Unofficial Gibbs Sea Water Oceanographic Toolbox of TEOS-10 in Rust

data ocean ocean-sciences oceanography rust

Last synced: 03 Apr 2025

https://github.com/vzhufk/z1p

Zip Codes Validation and Parse.

data geo geocode geolocation latitude longitude zip zipcode

Last synced: 12 Jan 2026

https://github.com/stephanakkerman/crypto-ohlcv

Gets historical OHLCV data from supported exchanges and converts it into dataframe readable by TensorTrade.

binance bitcoin crypto cryptocurrency data ethereum ftx ohlcv ohlcv-data

Last synced: 10 Apr 2025

https://github.com/bradleyboehmke/completejourney

An R data 📦 of retail shopping transactions for 2469 households over one year

data dataset r r-package

Last synced: 13 Apr 2025

https://github.com/siongui/data

Data files for Pāḷi Tipiṭaka, Pāḷi Dictionaries, and external libraries

data pali translation

Last synced: 08 May 2025

https://github.com/zeutschler/cubedpandas

Filter faster, analyze smarter – because your DataFrames deserve it!

analysis cube data olap pandas

Last synced: 12 Apr 2025

https://github.com/axetroy/struct

A Modern, Scalable , Graceful, Easy Use data structure validator

data modern struct validator

Last synced: 13 Sep 2025

https://github.com/base-repos/base-data

adds a `data` method to base-methods. 100% unit test coverage and browserify-friendly lazy-caching.

base context data extend files glob merge object objects plugin read

Last synced: 11 Jan 2026

https://github.com/dkoguciuk/mesh2pointcloud

A mini scripts to sample ModelNet40 or ShapeNetCore55v2 meshes into 3D point clouds

3d data deep-learning machine-learning

Last synced: 20 Mar 2025

https://github.com/gsurma/twitter_data_parser

Python scripts that download metadata and tweets for given users.

data machine-learning parser python python2 twitter twitter-api

Last synced: 20 Jul 2025

https://github.com/skysingh04/fractal

A flexible, configurable data processing tool

cli compiler data firebase go golang lexer parser

Last synced: 07 May 2025

https://github.com/peterdavehello/docker-azcopy

🐳 Tiny Dockerized AzCopy (Azure Storage data transfer utility) inside Alpine Linux 🐧 (~10MB)

azcopy azure cli container copy data docker docker-image hacktoberfest storage sync

Last synced: 18 Mar 2025

https://github.com/robjhyndman/tscompdata

Time series competition data

data r time-series

Last synced: 14 Jul 2025

https://github.com/Aaronong/SQLitmus

A simple and practical tool for SQL data generation and performance testing

data database electron mock-data react redux sql

Last synced: 09 Jul 2025

https://github.com/hodur-org/hodur-lacinia-schema

Hodur is a domain modeling approach and collection of libraries to Clojure. By using Hodur you can define your domain model as data, parse and validate it, and then either consume your model via an API or use one of the many plugins to help you achieve mechanical results faster and in a purely functional manner.

clojure data graphql lacinia modeling schema

Last synced: 12 Dec 2025

https://github.com/polakowo/gtd-analysis

Visualizing the Global Terrorism Database (GTD) with D3.js

analysis d3 d3js data terrorism

Last synced: 07 May 2025

https://github.com/hannesdelbeke/blender-transfer-vertex-order-addon

transfer vertex order from one mesh to another mesh in Blender

bake blender data mesh order transfer vertex

Last synced: 29 Oct 2025

https://github.com/rezapace/komputasi-big-data

This repository contains materials and practical exercises for learning Python in the context of Big Data Computation. The focus is on analyzing and processing large datasets using various tools and techniques.

ai big data data-science git-reza gunadarma gundar komputasi-big-data

Last synced: 28 Sep 2025

https://github.com/benoitvx/data-gouv-skill

🇫🇷 Skill professionnel pour Claude Code - Accès au catalogue de données ouvertes data.gouv.fr

claude-code data datagouv france opendata python

Last synced: 13 Jan 2026

https://github.com/senrok/yadal

Yet Another Data Access Layer: Accessing S3, POSIX in the same way. Deeply inspired by Databend's OpenDAL

cloud-native data go minio s3 storage

Last synced: 12 Jan 2026

https://github.com/doemser/ai-data-generata

OpenAI-powered JSON data generator.

ai chatbot data generator json-api mockdata nextjs openai react reactjs

Last synced: 11 Jun 2025

https://github.com/espoirx/elegantdata

像操作Room一样操作 SharedPreferences 和 File 文件.

data database db elegantdata file room sharedpreferences

Last synced: 01 Sep 2025

https://github.com/ahmetfurkandemir/amazon-dynamodb-building-nosql-database-driven-applications

Amazon DynamoDB: Building NoSQL Database-Driven Applications (Coursera)

amazon aws cloud data database dynamodb nosql sql veri veritabani

Last synced: 15 Apr 2025

https://github.com/gragland/react-component-data

🍯 Data fetching for server-rendered React applications.

data props react resolving server-rendered universal-app

Last synced: 07 May 2025

https://github.com/kiwicom/contessa

Easy way to define, execute and store quality rules for your data.

data data-engineering data-quality framework mysql postgres python quality-assurance sqlite3

Last synced: 29 Jul 2025

https://github.com/kornicameister/asyncpg-migrate

Database migration tool that does what alembic does... asynchronously

asyncio asyncpg cli click data database library migration postgresql python python3 python36 python37

Last synced: 20 Aug 2025

https://github.com/dimitryzub/hotels-scraper-js

Scrape Airbnb, Booking, Hotels.com from a single JavaScript module. ❗No longer maintained.

airbnb booking data datascraping hotels hotels-api playwright puppeteer puppeteer-extra webscraping

Last synced: 07 Sep 2025

https://github.com/hodur-org/hodur-spec-schema

Hodur is a domain modeling approach and collection of libraries to Clojure. By using Hodur you can define your domain model as data, parse and validate it, and then either consume your model via an API or use one of the many plugins to help you achieve mechanical results faster and in a purely functional manner.

clojure data modeling schema spec types validation

Last synced: 12 Dec 2025

https://github.com/ajayarunachalam/gui-pandas-ai

GUIPandasAI - Integrating Generative AI capabilities into Pandas as Web Interface along with key-words based data analysis services

ai chatgpt data data-analysis data-analytics data-science generative-ai gpt-3 gpt-4 llm pandas python streamlit web-app

Last synced: 06 Jul 2025

https://github.com/uladz-zubrycki/Reseed

Initialize and clean integration tests database in a convenient, reliable and fast way.

csharp data data-seeding database dotnet integration-testing mssql mssql-database ndbunit2 netcore seed tests

Last synced: 06 Aug 2025

https://github.com/juliahealth/icd_gems.jl

ICD_GEMs.jl is a Julia package that allows to translate ICD-9 codes in ICD-10 and viceversa via the General Equivalence Mappings (GEMs) of the International Classification of Diseases (ICD).

cdc clinical-data clinical-research data death-certificates epidemiology health-data icd-10 icd-10-cm icd-9 icd-codes julia julia-language julia-package mortality-data public-health who

Last synced: 22 Apr 2025

https://github.com/marykdb/maryk

Maryk is a Kotlin Multiplatform library which helps you to store, query and send data in a structured way over multiple platforms. The data store stores any value with a version, so it is possible to request only the changed data or live listen for updates.

data database graph json kotlin kotlin-multiplatform rocksdb serialization versioned yaml

Last synced: 14 Jan 2026

https://github.com/danilofreire/prisonbrief

An R package that returns tidy data from the World Prison Brief website.

data prison rstats world-prison-brief

Last synced: 09 Apr 2025

https://github.com/mark-hoffmann/fastteradata

Tools for faster and optimized interaction with Teradata and large datasets.

data fastexport fastteradata python teradata

Last synced: 26 Jan 2026

https://github.com/cattte/reddit-migrate

A command-line tool to migrate, export, import, and purge reddit account data!

data export import migrate reddit

Last synced: 09 Oct 2025

https://github.com/brightway-lca/brightway2-data

Tools for the management of inventory databases and impact assessment methods. Part of the Brightway LCA framework.

brightway data life-cycle-assessment python

Last synced: 19 Oct 2025

https://github.com/glamboyosa/mey

A react package that exports hooks for handling the request lifecycle.

data data-fetching fetch hooks react react-native

Last synced: 12 May 2025

https://github.com/kettanaito/react-data-preview

Fancy interactive preview of your JavaScript data.

data data-preview javascript preview react react-data-preview

Last synced: 06 May 2025

https://github.com/zq99/pgn2data

A library that converts a chess pgn file into a tabulated CSV data set.

chess chess-analysis csv data dataset fen library pgn

Last synced: 17 Jan 2026

https://github.com/ECCC-MSC/msc-animet

MSC AniMet is a simple tool enabling users to interact with MSC Open Data weather data and create custom weather animations for any area in the world. The resulting animations can be downloaded and shared with a permalink.

animation canada data visualization weather wms

Last synced: 20 Jul 2025

https://github.com/hoangsonww/latticedb-nextgen-dbms

🗂️ A next-gen relational database with mergeable CRDT tables, time-travel queries, vector search, and differential privacy built-in. Written in C++17 with a SQL engine, WAL storage, and a modern web Studio.

cmake cplusplus data database databases db dbms dbms-project docker file relational-database relational-databases sql sql-parser

Last synced: 13 Sep 2025

https://github.com/chainguard-dev/image-comparison

Comparison of Chainguard Images to others

containers data no-ghaudit-branch-protections security

Last synced: 29 Oct 2025

https://github.com/d-wasserman/shared-row

This is an open data specification for describing the right-of-way (ROW) for street centerline networks. It is intended to establish a common set of attributes (schema) to describe how space is allocated along a streets right of way from sidewalk edge to sidewalk edge.

data right-of-way row schema sharedstreets specification standard streets

Last synced: 05 May 2025

https://github.com/marioruiz/string_pattern

Generate strings supplying a simple pattern. Perfect to be used in test data factories. Validate if a text fulfills a specific pattern. Also you can use regular expressions (Regexp) to generate strings: `/[a-z0-9]{2,5}\w+/.gen`. Generate words in English or Spanish.

data error-detection factories generation pattern random regex-pattern regexp regular-expressions ruby ruby-gem string test

Last synced: 05 May 2025

https://github.com/darothen/xbpch

xarray interface for bpch files

analysis binary climate data geos-chem python xarray

Last synced: 21 Mar 2025

https://github.com/retailmenotsandbox/dart

Self-service data workflow management

data datastore workflow

Last synced: 10 Apr 2025

https://github.com/lcsb-biocore/distributeddata.jl

Simple distributed data manipulation and processing routines in Julia

data distributed julia

Last synced: 22 Apr 2025

https://github.com/itsjafer/tv-show-recommendations

Machine learning pipeline trained offline that, given a TV Show, recommends 10 similar TV Shows using cosine similarities based on a variety of features

data engine learning machine python recommendation science tv-shows

Last synced: 09 Jul 2025

https://github.com/chase-manning/eth-twitter-accounts

Data dump of 15,451 Twitter accounts and their Ethereum Address

data ethereum twitter

Last synced: 12 Apr 2025

https://github.com/minhaskamal/alphabetrecognizer

Simple Optical Character Recognizer (english-ocr-image-to-text-recognition-sample-trainig-alphabet-photo-data-database-dataset)

alphabet-recognizer data database english image-processing java machine-learning ocr sample template-matching text-recognition training-data writing

Last synced: 11 Apr 2025

https://github.com/anuran-roy/serpytor

A distributed, low-code, end-to-end data collection and analysis tool for data folks. Take the pain out of data collection from your pipeline!

data dataengineering datascience distributed-computing distributed-systems low-code lowcode open-source pipeline python python3

Last synced: 16 May 2025

https://github.com/randomfractals/duckdb-sql-tools

DuckDB SQL Tools add DuckDB support to VSCode, and provide database schema and SQL query interfaces for the popular SQLTools extension, SQL query editor, language server, and data processing tools.

data data-tools duckdb sql sql-tools sqltools sqltools-driver viewer vscode

Last synced: 22 Mar 2025

https://github.com/PoliticaArgentina/geoAr

🌎 🇦🇷 spatial toolbox for R

argentina data geo gis maps r

Last synced: 29 Jul 2025

https://github.com/nxr-deen/student-records

This repository contains a C program that manages student data in a binary file, allowing for input and retrieval of records.

binaryfiles c data management records students

Last synced: 06 Aug 2025

https://github.com/nevillelyh/scio-koans

A collection of Scio exercises inspired by Ruby Koans and many others.

algebird data scala scio

Last synced: 22 Aug 2025

https://github.com/gexijin/datably

Datably.ai

ai analytics data

Last synced: 29 Jul 2025

https://github.com/w2sv/koala

A poor man's version of a pandas DataFrame for dart.

dart dartlang data dataframe datamanagement datamanipulation flutter pandas-dataframe

Last synced: 09 Sep 2025

https://github.com/martinstarman/instantapi

Instant API for development

api data micro mock random

Last synced: 14 Apr 2025

https://github.com/vkcom/vkdata-sketchplugin

Sketch plugin for using data from your account at vk.com

data sketch sketch-plugin sketchapp vk vkontakte

Last synced: 27 Sep 2025

https://github.com/marco-roy/DDO

A DBT package to perform DataOps & administrative CI/CD on your data warehouse.

data dataops datawarehouse datawarehouseautomation dbt snowflake

Last synced: 05 May 2025

https://jhildenbiddle.github.io/class-change/

A micro-library for manipulating CSS class names, triggering change events using HTML data attributes, and creating declarative class-related event listeners

attributes change class classlist css data event event-listener html listener polyfill ponyfill

Last synced: 11 May 2025

https://github.com/mikestefanello/batcher

Type-safe, automatic, asynchronous batch processing.

batch batch-processing concurrency data goroutines

Last synced: 26 Sep 2025

https://github.com/vojay-dev/sc2-data-pipeline

StarCraft 2 Data Pipeline with Airflow, DuckDB and Streamlit

airflow data data-engineering data-science duckdb starcraft2 streamlit

Last synced: 20 Sep 2025

https://github.com/tushar2704/everyday_python

Welcome to Everyday Python Sheets – your go-to resource for everyday Python cheat sheets, pro tips, interview questions, Python one-liners, and Python data structures. Whether you're a beginner looking to learn Python or an experienced developer seeking quick reference materials, this Streamlit application has got you covered.

artificial-intelligence cheatsheet data data-analysis data-science data-structures data-visualization database protips python streamlit streamlit-tushar2704 tushar2704

Last synced: 04 Nov 2025

https://github.com/jhildenbiddle/class-change

A micro-library for manipulating CSS class names, triggering change events using HTML data attributes, and creating declarative class-related event listeners

attributes change class classlist css data event event-listener html listener polyfill ponyfill

Last synced: 17 Aug 2025