An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/zhangyoujia/hd_write_verify

LBA tools(hd_write_verify & hd_write_verify_dump) are very useful for testing Storage stability and verifying DATA consistency, there are much better than FIO & vdbench's verifying functions. for example: physical disk: ide/sata/scsi/ssd/iscsi/fc/raid/...; virtual disk: loop/nbd/lvm/soft raid/...; VM disk: ide/sata/scsi/virtio-blk/virtio-scsi/...;

consistency data filesystem migration physical-disk snapshot stability storage testing verifying virtual-disk vm-backup vm-disk

Last synced: 26 Jan 2026

https://github.com/spine-tools/spine-toolbox

Spine Toolbox is an open source Python package to manage data, scenarios and workflows for modelling and simulation. You can have your local workflow, but work as a team through version control and SQL databases.

anaconda data energy miniconda python simulation-model spine-toolbox workflow

Last synced: 04 Apr 2025

https://github.com/hi-folks/data-block

PHP Package for handling, querying, filtering, and setting nested data structures

data data-structure hacktoberfest json-data php

Last synced: 04 Jan 2026

https://github.com/virtadpt/exocortex-halo

Various and sundry additional pieces of software I've written to incorporate into my exocortex.

bots conversation data exocortex interactive parser python rest-api

Last synced: 09 Apr 2025

https://github.com/glynnbird/countriesgeojson

Countries of the world as GeoJSON

data geography geojson geospatial json

Last synced: 21 Mar 2025

https://github.com/dkandalov/activity-tracker

Plugin for IntelliJ IDEs to track and record user activity

data intellij plugin

Last synced: 11 Sep 2025

https://github.com/anish-agnihotri/blog-effective-nft-launches-data

Data+code for NFT launch guide blogpost.

data exploiting fairness launches nft

Last synced: 14 Apr 2025

https://github.com/9b/netinfo

Simple IP enrichment service and API wrapping PyASN and MaxMind GeoIP.

cybersecurity data devops enrichment ip-address-lookup network osint python3 webservice

Last synced: 25 Jan 2026

https://github.com/stephanakkerman/tensortrade-extras

Discover a curated list of projects complementing TensorTrade, distinct from those mentioned in its official documentation. Contributions are welcome; if you spot a missing project, please submit a pull request!

crypto cryptocurrency cryptocurrency-trading-bot data finance live-trading openai-gym reinforcement-learning reinforcement-learning-agent stock-trading stocks technical-analysis tensortrade

Last synced: 18 Mar 2025

https://github.com/stonecypher/flocks.js

A radically simpler alternative to Flux - opinionated React state and rendering management

data flux javascript js layer react react-js reactjs redux screw-flux simple simplicity

Last synced: 02 May 2025

https://github.com/anandchowdhary/bookshelf-action

📚 Track your reading using GitHub Actions

api books data generator github-actions reading tracker

Last synced: 05 Apr 2025

https://github.com/alibaba/dimbin

High-performance serialization for multi-dimension arrays 海量数据高性能序列化方案

binary csharp data serialization typescript

Last synced: 14 Oct 2025

https://github.com/rishit-dagli/cppe-dataset

Code for our paper CPPE - 5 (Medical Personal Protective Equipment), a new challenging object detection dataset

artificial-intelligence computer-vision cppe5 data dataset deep-learning machine-learning models object-detection pretrained-models pytorch tensorflow vision

Last synced: 15 Jun 2025

https://github.com/publici/us-polling-places

Standardized data on historical general election polling places in the United States.

data elections open-data open-elections polling-locations polling-places

Last synced: 03 Jul 2025

https://github.com/purarue/HPI

Human Programming Interface - a way to unify, access and interact with all of my personal data [my modules]

data gdpr history lifelogging personal-api quantified-self

Last synced: 23 Oct 2025

https://github.com/kirankunigiri/Apple-Family

A simple framework that brings Apple devices together - like a family

bluetooth connectivitiy data ios macos usb wifi

Last synced: 30 Jul 2025

https://github.com/AnandChowdhary/bookshelf-action

📚 Track your reading using GitHub Actions

api books data generator github-actions reading tracker

Last synced: 18 Apr 2025

https://github.com/enviodev/hyperindex

📖 Blazing-fast multi-chain indexer

blockchain dapp data envio evm framework fuel indexer rescript

Last synced: 04 Apr 2025

https://github.com/visual-layer/visuallayer

Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, mislabels and others.

cleaning computer computer-vision data data-science dataset datasets-preparation generative machine-learning python vision

Last synced: 19 Apr 2025

https://github.com/kirankunigiri/apple-family

A simple framework that brings Apple devices together - like a family

bluetooth connectivitiy data ios macos usb wifi

Last synced: 15 Sep 2025

https://github.com/caduandrade/davi_flutter

A full customized data view that builds the cells on demand. Focused on Web/Desktop Applications. Bidirectional scroll bars.

data dataview flutter grid layout table widget

Last synced: 06 Apr 2025

https://github.com/tensorflow/tfjs-data

Simple APIs to load and prepare data for use in machine learning models

data deep-learning javascript machine-learning neural-network tensorflow tfjs

Last synced: 30 Sep 2025

https://github.com/doodlewind/bumpover

🚧 Async data transforming with simple rules.

data json markup migration tree validation xml

Last synced: 11 Sep 2025

https://github.com/jneidel/job-titles

Normalized dataset of 70k job titles

data dataset jobs json opendata

Last synced: 27 Jul 2025

https://github.com/0xplaygrounds/subgrounds

An intuitive Python library for interfacing with subgraphs and GraphQL

analytics blockchain data decentralized graph graphql pandas python subgraphs substreams

Last synced: 12 Sep 2025

https://github.com/malloydata/malloy-composer

Malloy Composer is a simple application to build dashboards or run ad-hoc queries using an existing Malloy model

business-analytics business-intelligence data data-modeling data-visualization malloy semantic-modeling

Last synced: 08 May 2025

https://github.com/ArtesiaWater/hydropandas

Module for loading observation data into custom DataFrames

data groundwater hydrology observations pandas timeseries

Last synced: 20 Jul 2025

https://github.com/matheusfelipeog/worldometer

Get live, population, geography, projected, and historical data from around the world 🌍

api data historical historical-data live livedata metrics mit-license projected pypi python scraping world worldometer worldometer-api worldometer-scraping worldometers

Last synced: 12 Apr 2025

https://github.com/binste/dbt-ibis

Write your dbt models using Ibis

data dbt ibis

Last synced: 06 Apr 2025

https://github.com/patch/i18n-testing

International data for testing and QA

data i18n qa testing unicode

Last synced: 06 Jan 2026

https://github.com/0xPlaygrounds/subgrounds

An intuitive Python library for interfacing with subgraphs and GraphQL

analytics blockchain data decentralized graph graphql pandas python subgraphs substreams

Last synced: 09 May 2025

https://github.com/raystack/compass

Compass is an enterprise data catalog that makes it easy to find, understand, and govern data.

data dataops discovery lineage metadata

Last synced: 10 Jul 2025

https://github.com/artesiawater/hydropandas

Module for loading observation data into custom DataFrames

data groundwater hydrology observations pandas timeseries

Last synced: 26 Jun 2025

https://github.com/ngnjs/ngn

A systems development platform. A revamped portal is coming soon at:

browser data data-model deno event-driven eventemitter javascript middleware ngn nodejs queue systems web

Last synced: 16 May 2025

https://github.com/datasets/football-datasets

Major Europe leagues data (England, Spain, Italy, Germany and France)

csv data datasets football open soccer

Last synced: 31 Mar 2025

https://github.com/paintedbicycle/sketch-data-faker

A Sketch plugin providing 130+ types of smart placeholder content for your mockups from Faker.js and other sources.

api content data fakerjs javascript json mock-data plugin sketch sketch-plugin sketchapp

Last synced: 26 Jun 2025

https://github.com/pedrokehl/caminho

Tool for creating efficient data pipelines in a JavaScript environment

backpressure concurency data dataprocessing functional javascript parallel pipeline reactive typescript

Last synced: 06 Apr 2025

https://github.com/whitfin/jen

A fast utility to generate fake/test documents based on a template

data dataset generator json template templating

Last synced: 22 Aug 2025

https://github.com/blazejkustra/dynamode

Dynamode is a modeling tool for Amazon's DynamoDB

amazon aws data database datastore db document dynamo dynamodb model nosql schema

Last synced: 14 Sep 2025

https://github.com/octoenergy/timeserio

Better `keras` models for time series and beyond

data data-science

Last synced: 24 Jun 2025

https://github.com/nom-tam-fits/nom-tam-fits

A full featured 100% Java library for reading and writing FITS files

astronomy data fits fits-files fits-image fitsio java open-source scientific

Last synced: 11 Jan 2026

https://github.com/greenelab/adage

Data and code related to the paper "ADAGE-Based Integration of Publicly Available Pseudomonas aeruginosa..." Jie Tan, et al · mSystems · 2016

autoencoders data dataset denoising-autoencoders gene-expression machine-learning manuscript methodology neural-networks paper pseudomonas-aeruginosa research supplement

Last synced: 09 Oct 2025

https://github.com/alannikos/edg4llm

A unified tool to generate fine-tuning datasets for LLMs, including questions, answers, and dialogues. ✨🤖📚💬

chagpt chatglm data deepseek fine-tuning generation internlm llm

Last synced: 14 Jan 2026

https://github.com/altaurog/pdfforms

Populate fillable pdf forms from csv data file

data forms pdf

Last synced: 12 May 2025

https://github.com/disease-sh/node-api

A JavaScript API Wrapper for NovelCOVID/API

coronavirus data javascript library open-source wrapper

Last synced: 21 Apr 2025

https://github.com/514-labs/moose

The developer framework for your data & analytics stack

analytics data dataengineering deployment framework insights metrics python rust typescript

Last synced: 05 Apr 2025

https://github.com/rolyatmax/citibike-trips

Visualizing citibike trips with webgl

citibike data javascript regl visualization webgl

Last synced: 14 Apr 2025

https://github.com/hswick/jutsu

Graphing tool for Clojure built with the web and interactivity in mind

clojure data plotlyjs visualization

Last synced: 09 Sep 2025

https://github.com/micromata/generator-http-fake-backend

Yeoman generator for building a fake backend by providing the content of JSON files or JavaScript objects through configurable routes.

api backend data fake fake-data http http-server json mock mocking mocking-server mocks node nodejs rest rest-api restful restful-api yeoman yeoman-generator

Last synced: 14 Jan 2026

https://github.com/bluzi/name-db

:rocket: A multilingual collection of names from around the world

data language names translations

Last synced: 09 Oct 2025

https://github.com/z3z1ma/alto

Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plugins, and create a data reservoir whereby you can extract once and replay to as many destinations as you want.

data data-pipeline etl meltano singer

Last synced: 17 Mar 2025

https://github.com/fillmula/jsonclasses

🌎 The Modern Declarative Data Flow Framework for the AI Empowered Generation.

data json python validation

Last synced: 07 Sep 2025

https://github.com/noopeeks/datanvim

A fully-featured batteries-included Neovim distribution for the world of Data Science. Prepared to run code and interact with Jupyter Notebooks without ever leaving your terminal.

data data-science distribution jupyter-notebook machine-learning neovim nvim nvim-config text-editor vim

Last synced: 06 Oct 2025

https://github.com/unsw-ceem/nemosis

NEMOSIS: NEM Open-source information service. A Python package for downloading historical data published by the Australian Energy Market Operator (AEMO)

aemo australia data energy national-electricity-market nem nemweb python

Last synced: 04 Apr 2025

https://github.com/nikitastupin/orgs-data

Mapping from bug bounty and vulnerability disclosure programs to respective GitHub organizations

bug-bounty data github reconnaissance vulnerability-disclosure

Last synced: 09 Apr 2025

https://github.com/queryverse/excelreaders.jl

ExcelReaders is a package that provides functionality to read Excel files.

data excel julia queryverse

Last synced: 12 Apr 2025

https://github.com/geirolz/fly4s

A lightweight, simple and functional wrapper of Flyway using cats effect.

cats cats-effect data database database-migrations db flyway flyway-migrations flywaydb functional-programming persistence scala

Last synced: 11 Apr 2025

https://github.com/leeper/csvy

Import and Export CSV Data With a YAML Metadata Header

cran csv csvy data r yaml

Last synced: 17 Mar 2025

https://github.com/fityannugroho/idn-area-data

Provides the administrative areas data of Indonesia based on the latest official sources 🇮🇩

data data-sources hacktoberfest idn-area indonesia island javascript open-data pulau wilayah wilayah-indonesia

Last synced: 06 Apr 2025

https://github.com/kristoferjoseph/redeux

Minimal unidirectional data flow utility library

data flux reducer store unidirectional utility

Last synced: 06 May 2025

https://github.com/leonawicz/rtrek

R package for Star Trek datasets and related R functions.

data r-package stapi star-trek

Last synced: 06 Sep 2025

https://github.com/worldbank/llm4data

LLM4Data is a Python library designed to facilitate the application of large language models (LLMs) and artificial intelligence for development data and knowledge discovery.

ai data development-data gpt gpt-4 indicators llm llm4data metadata sql wdi world-development-indicators

Last synced: 07 Apr 2025

https://github.com/hadley/neiss

Data from National Electronic Injury Surveillance System

data r

Last synced: 19 Oct 2025

https://github.com/digao-dalpiaz/dztalkapp

Delphi non-visual component to communicate between applications

applications communication component data delphi pascal

Last synced: 12 Jul 2025

https://github.com/dxxxxy/EssentialCosmeticsUnlocker

Client-side only patch that allows you to unlock ALL cosmetics (+ emotes) in the Essential mod. Works on every version of Essential MC (1.8.9 - 1.20.6).

client compatibility config cosmetics data dump emotes essential fabric forge free inject minecraft mixin mod patch side universal unlocker

Last synced: 09 Aug 2025

https://github.com/kevinwang15/treebox

an interactive TreeMap visualization - Please star if you like this project

canvas canvas2d data javascript treemap visualization

Last synced: 13 May 2025

https://github.com/turbot/steampipe-sqlite

Steampipe SQLite is a zero-ETL engine for SQLite. Virtual tables translate queries into live API calls for cloud services and APIs. Hundreds of plugins with thousands of documented examples.

aws azure data devsecops etl gcp golang kubernetes security sql sqlite steampipe steampipe-engine zero-etl

Last synced: 28 Jul 2025

https://github.com/mysociety/yournextrepresentative

A website for crowd-sourcing structured election candidate data

civic-tech data elections politics

Last synced: 02 Aug 2025

https://github.com/vijinho/iso-country-data

ISO Country data in JSON and CSV format.

country-data currencies data iso-country json-data

Last synced: 21 Jan 2026

https://github.com/emberexperts/ember-await

Await component for Ember Applications. Resolve your data on demand, just when needed.

await data ember-addon ember-await emberjs javascript loading

Last synced: 12 Apr 2025

https://github.com/jonschlinkert/cache-base

Basic object store with methods like get/set/extend/omit

cache config data dot-notation emit extend inherit javascript node nodejs object store

Last synced: 05 Apr 2025

https://github.com/tg12/script-toolbox

This repository contains a collection of scripts and tools that I have written to solve various problems that I have come across.

data exif exif-metadata exif-reader exiftool scripts scripts-collection toolbox toolkit toolkits

Last synced: 27 Jul 2025

https://github.com/datakitchen/dataops-testgen

DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling,  new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring

data data-engineering data-observability data-quality data-science data-testing datachecker dataops dataprofiling dataquality datavalidation mssql postgresql python redshift self-hosted snowflake

Last synced: 06 Apr 2025

https://github.com/alinski29/stonks.jl

Julia library for standardizing financial data retrieval and storage from multiple APIs.

data data-mining data-science dataframe finance julia trading trading-algorithms

Last synced: 06 May 2025

https://github.com/zachowj/xfinity-data-usage

Fetch Xfinity data usage and serve it via an HTTP endpoint, publish it to MQTT or post it to an URL.

data home-assistant mqtt usage xfinity xfinity-data

Last synced: 09 Mar 2025

https://github.com/zbrookle/sql_to_ibis

A Python package that parses sql and converts it to ibis expressions

data databases dataframes etl hacktoberfest ibis sql

Last synced: 14 Apr 2025

https://github.com/ioos/bio_data_guide

Standardizing Marine Biological Data Working Group - An open community to facilitate the mobilization of biological data to OBIS.

darwin-core data data-management marine-biology marine-data obis tutorials

Last synced: 30 Oct 2025

https://github.com/mcarlucci/decky-storage-cleaner

A Decky Loader plugin for tidying up your Steam Deck's storage. Quickly visualize, select and clear shader cache and compatibility data.

cache cleaner compat compatibility data decky decky-loader plugin shader steam steamdeck storage utility

Last synced: 03 Mar 2025

https://github.com/spratiher9/sparkora

Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟

apache apache-spark data data-analysis data-analysis-python data-analytics easy-to-use eda exploratory-data-analysis open-source opensource pyspark python python3 toolkit

Last synced: 10 Jul 2025

https://github.com/OElesin/querypal

Web UI for Amazon Athena

analytics aws aws-athena data data-lake sql

Last synced: 30 Jul 2025