An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/blenderskool/lexico

🔭 Powerful data searching with terse syntax

data dsl extended fuzzy hacktoberfest indexes logic operators search

Last synced: 12 Apr 2025

https://github.com/jravi2/chat-analyzer

A python program to analyze your Social Chats

chat-analysis data python

Last synced: 12 Apr 2025

https://github.com/agile-lab-dev/governance-decision-record

The Governance Decision Record (GDR) is a specification model for (computational) data governance policies inspired from the ADR (Architectural Decision Record).

architectural-decision-records data data-governance data-management data-management-platform data-mesh federated-computational-governance governance-decision-record platform policy-as-code

Last synced: 30 Jun 2025

https://github.com/benkingcode/react-data-fetching-components

♻️ Asynchronously load data for your React components with SSR

async code-splitting data loading react server-side-rendering ssr

Last synced: 14 Apr 2025

https://github.com/lungben/tableio.jl

A glue package for reading and writing tabular data. It aims to provide a uniform api for reading and writing tabular data from and to multiple sources.

arrow csv data data-science database dataframe dataframes excel jdf json-format parquet postgresql sqlite zip

Last synced: 12 Oct 2025

https://github.com/hoangsonww/standard-deviation-calculator

📊 This repository contains a Standard Deviation Calculator implemented in C++. It provides an efficient algorithm for calculating the statistical standard deviation of a dataset, making it a valuable tool for students, researchers, and analysts seeking a reliable method for data analysis.

algorithms cplusplus cpp data data-analysis data-analytics data-science standard-deviation standard-deviation-calculator standard-deviations

Last synced: 22 Sep 2025

https://github.com/lazarus-org/dj-data-generator

A package for generating data

data django django-packages fake-data python

Last synced: 28 Jun 2025

https://github.com/sighrobot/jqp

A serverless proxy for filtering JSON using node-jq

data jq json nextjs node proxy public

Last synced: 13 Jan 2026

https://github.com/okthought/data-mapper

A declarative data mapper

data mapper python

Last synced: 15 May 2025

https://github.com/zoilomora/xiaomi-mi-fit-data-export

Export data from Xiaomi Mi Fit in the most automated way possible

data export mi-band scraping xiaomi

Last synced: 26 Jan 2026

https://github.com/thorium/fsharp.data.jsonprovider.serializer

Plugin to replace FSharp.Data.JsonProvider default serialization with fast System.Text.Json

data fsharp json typeprovider

Last synced: 23 Jul 2025

https://github.com/dotnet-ad/Faker.Portable

C# faked data generation for testing and prototyping purpose.

data fake model stub

Last synced: 27 Jun 2025

https://github.com/rxn4chemistry/rxn-reaction-preprocessing

Preprocessing of datasets of chemical reactions: standardization, filtering, augmentation, tokenization, etc.

chemistry data processing reactions retrosynthesis rxn

Last synced: 16 Jan 2026

https://github.com/rousan/samples-viewer-generator

:tada: A CLI utility tool to generate web app of data visualization samples for presentation purpose

boilerplate chart cli data generator viz

Last synced: 13 Jul 2025

https://github.com/lmangani/videoparquet

Python library for converting Parquet files (containing array-like or tabular data) to video files and back using ffmpeg

data ffmpeg parquet videodata

Last synced: 07 Oct 2025

https://github.com/ask-sage/asksage-open-source-community

🚀 Welcome to AskSage Open Source Repo, a community aimed at providing code 🧑‍💻, documentation📝, and proof-of-concept projects for interacting with the AskSage API. This repository serves as an unofficial resource for paid subscribers seeking practical guidance on utilizing the API. If you have any questions email us asksage.community@gmail.com

agnostic-llm api artificial-intelligence asksage chatgpt cohere data gemini generative-ai government large-language-model large-language-models llama machine-learning open-source openai prompt-engineering rag

Last synced: 06 Oct 2025

https://github.com/emptymalei/audiorepr

A python package to represent data using musical notes.

audiolization data data-audiolization data-science

Last synced: 12 Oct 2025

https://github.com/lisad/phaser

The missing layer for complex data batch integration pipelines

data data-integration etl etl-pipeline

Last synced: 23 Apr 2025

https://github.com/anthonydb/data-wrangling-python-nicar-2017

Materials for the NICAR 2017 Data Wrangling with Python hands-on class

agate csvkit data jupyter nicar-2017 nicar17 python

Last synced: 22 Apr 2025

https://github.com/vincetran96/scrape-finance-data

My code for scraping financial data in Vietnam

data finance model python scrapy stock

Last synced: 17 Jan 2026

https://github.com/codeforcroatia/open-data

Croatia open data repository (Open Data HR)

data datasets open-data open-datasets opendata

Last synced: 27 Feb 2025

https://github.com/davdroman/swift-builders

Result builders for Swift and Foundation types

array builder data dictionary resultbuilder set string substring swift

Last synced: 09 Apr 2025

https://github.com/floriank13/skyimages

Download and manage sky images for solar power forecasting.

data dataset datasets energy forecasting machine-learning python python-package pytorch

Last synced: 13 May 2025

https://github.com/klaudiosinani/kiu

FIFO Queues for ES6

data es6 fifo queue structure typescript

Last synced: 24 Apr 2025

https://github.com/base-repos/base-pkg

Base plugin for adding a `pkg` object with get/set methods for getting data from package.json or setting data to package.json.

config data libary module npm package package-json project store

Last synced: 07 May 2025

https://github.com/electricbolt/bindkit

Two-way data binding framework for iOS. Only one API to learn.

binding data ios objective-c reactive rxswift swift uikit

Last synced: 04 May 2025

https://github.com/goodforgod/dummymaker

🧰 Generates random Java Classes/Records for you.

bean data dto dummy easy-random entities export factory fill generate java pojo random random-generation record

Last synced: 20 Mar 2025

https://github.com/dawidolko/algorithms-data-structures

Tasks studies - laboratory

data java labs structures works

Last synced: 06 Apr 2025

https://github.com/hgraph-os/hgraph-react

(Note: Use main hGraph repo) An open source visualization for patient health data, as a React component using d3.

boston care-plan data design health-data healthcare hgraph open-source patient-health-data ux visualization wellness

Last synced: 12 Jan 2026

https://github.com/zqzess/openapidata

存储一些收集到的接口数据,爬虫定时自动更新

api data

Last synced: 28 Oct 2025

https://github.com/newrelic-experimental/newrelic-logs-for-salesforce-commerce-cloud

A Docker image purpose-built to monitor Salesforce Commerce Cloud (fka Demandware) logs using New Relic Logs.

data data-acquisition new-relic-logs newrelic nrlabs nrlabs-odp observability-data salesforce-commerce-cloud

Last synced: 10 Apr 2025

https://github.com/travishorn/diabetes-food-database

Food information for people with diabetes.

data database diabetes diabetic food food-safety information medical

Last synced: 14 Apr 2025

https://github.com/ipeagit/aopdata

Download data from the Access to Opportunities Project (AOP)

data r transport transport-accessibility transport-planning

Last synced: 02 May 2025

https://github.com/aria-ml/dataeval

Python library for analyzing data quality and its impact on model performance across classification and object-detection tasks.

ai bias-detection data linter metrics out-of-distribution-detection outlier-detection sufficiency

Last synced: 16 Jan 2026

https://github.com/jonschlinkert/merge-configs

Find, load and merge JSON and YAML config settings from one or more files, in the specified order.

combine conf config configuration data eslint find jonschlinkert lookup merge namespace node nodejs object package rc runtime-config search store

Last synced: 26 Jun 2025

https://github.com/fleschutz/base256u

C++ sample implementation of base256 encoding using Unicode characters.

base256 base256u binary data encoding unicode

Last synced: 05 May 2025

https://github.com/ineelhere/clintrialx

R package to fetch and explore clinical trials data from freely available registries. Fetch data in bulk, customize data and build comprehensive html reports. Currently, it supports the ClinicalTrials.gov registry and CTTI AACT (Access to Aggregate Content of ClinicalTrials.gov).

aact bioinformatics clinical-data clinical-trials clinicaltrialsgov ctti data data-management medical-informatics r-language r-package trials

Last synced: 28 Oct 2025

https://github.com/ruofeidu/ducrawler

An automatic crawler to mine images from Google and Bing Image search (part of SketchyScene at ECCV 2018)

data google image mining python search

Last synced: 11 Apr 2025

https://github.com/moscarde/foliumtools

Um breve guia de como utilizar ferramentas básicas do Folium para gerar mapas interativos através de dados geográficos em conjunto com Python, Pandas e GoogleMaps.

data data-visualization folium googlemaps-api maps pandas phyton

Last synced: 14 Apr 2025

https://github.com/randomfractals/tabular-data-viewer

Tabular Data Viewer 🀄 VSCode extension for viewing very large local and remote CSV and TSV data files with Tabulator Table, Perspective Datagrid and D3FC Chart Views 📊📈

charts csv d3fc data data-packages datapackage dsv flat-data large-data perspective remote-data tabular tabulator tsv view viewer vscode

Last synced: 11 Apr 2025

https://github.com/ronin-co/client

Access RONIN via TypeScript.

bun client data javascript js library node nodejs query ts typescript

Last synced: 10 Apr 2025

https://github.com/streamr-dev/datav2

The second incarnation of the DATA token

data ethereum streamr token

Last synced: 13 Jul 2025

https://github.com/warisgill/feddefender

FedDefender is a novel defense mechanism designed to safeguard Federated Learning from the poisoning attacks (i.e., backdoor attacks).

backdoor-attacks data differential-testing federated-learning poisoning-attack

Last synced: 21 Mar 2025

https://github.com/dhhruv/stock-price-prediction

A deep learning project in which the model was trained using LSTM layers and Tata Stock prices were predicted and compared with thier actual values.

algorithm cli college-project data data-science dataset deep-learning jupyter jupyter-notebook lstm machine-learning prediction science shell stock-price-prediction tata-beverages terminal

Last synced: 03 May 2025

https://github.com/kyryl-opens-ml/ml-in-production-practice

Practice for Machine Learning in Production course

data inference-api infrastructure llm ml mlops monitoring pipelines platform

Last synced: 16 Jan 2026

https://github.com/erichenry/swagger-data-gen

Tool to generate random data from a Sagger/OpenAPI spec

cli data generator mock-data openapi openapi-specification random swagger tool

Last synced: 14 Apr 2025

https://github.com/benjamincharity/angular-json-calendar

:calendar: An AngularJS module that generates calendar data as a JSON object and/or HTML :muscle:

angularjs calendar data date

Last synced: 22 Mar 2025

https://github.com/tuanai-vireox/gcp-professional-data-engineer

GCP Professional Data Engineer Certification- Learning

data dataengineering gcp professional

Last synced: 22 Aug 2025

https://github.com/viraltux/datawrangler.jl

Data transformation tools for analytics

data transformations wrangling

Last synced: 15 Aug 2025

https://github.com/bst04/tools-for-data-recovery

All tools for Data Recovery

data data-recovery programs recovery tools

Last synced: 29 Jun 2025

https://github.com/kachayev/timely0

Minimalistic implementation of Naiad paper "A Timely Dataflow System" in Scala

data dataflow dataflow-programming graph timely

Last synced: 12 Apr 2025

https://github.com/nas5w/imdb-data

A JSON file of 50,000 IMDB movie reviews to be used in machine learning applications.

data data-science imdb javascript machine-learning

Last synced: 19 Apr 2025

https://github.com/williamtroup/tree.js

🌲 A lightweight JavaScript library that allows you to create responsive and customizable interactive tree diagrams from an array of JS objects.

css3 data grid html5 javascript map tree treemap visualization

Last synced: 15 Apr 2025

https://github.com/ludbek/validatex

A simple yet powerful data validator for javascript.

async data javascript schema validator

Last synced: 29 Jul 2025

https://github.com/spatialcurrent/go-simple-serializer

Simple library and command line program for converting between JSON, YAML, TOML, and many more common serialization formats.

big-data bigdata data

Last synced: 29 Jan 2026

https://github.com/cfpb/cfpb-chart-builder

Charts for the Consumer Financial Protection Bureau

chart data visualization

Last synced: 09 Apr 2025

https://github.com/dimitryzub/ecommerce-scraper-py

Scrape ecommerce websites such as Amazon, eBay, Walmart, Home Depot, Google Shopping from a single module in Python🐍

data datamining ecommerce ecommerce-website python python3 selectolax selenium serpapi webscraper webscraping

Last synced: 03 Sep 2025

https://github.com/the-swarm-corporation/backtesteragent

An enterprise-grade AI-powered backtesting framework built on the Swarms framework for automated trading strategy validation and optimization.

ai backtesting data finance finance-agents jpmorgan yahoofinance

Last synced: 13 Oct 2025

https://github.com/mzjp2/kedro-dataframe-dropin

A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)

data gpu-acceleration kedro-catalog kedro-plugin modin rapidsai

Last synced: 09 Apr 2025

https://github.com/purarue/autotui

quickly create UIs to interactively prompt, validate, and persist python objects to disk (JSON/YAML) and back using type hints

cli data deserialization json namedtuple serialization tui typehints

Last synced: 16 Mar 2025

https://github.com/haxetink/tink_streams

Streams from the future. With lasers, of course ... whoaaaaa!!!!

data gadt haxe immutable streams tink

Last synced: 27 Jan 2026

https://github.com/jcrodriguez1989/thesimpsons

Package (dataset): The Simpsons episodes dataset

data dataset r simpsons

Last synced: 26 Oct 2025

https://github.com/edgararuiz-zz/datalang

Package to translate R data sets

data rlang translation

Last synced: 21 Apr 2025

https://github.com/ethicnology/blockchain-ekstrakto

Blockchain ekstrakto is a Python program which extracts all Bitcoin blockchain data using Bitcoin Core.

data

Last synced: 11 Oct 2025

https://github.com/abcnews/census-100-people

Census 2016: This is Australia as 100 people

australia census data visualisation

Last synced: 27 Jan 2026

https://github.com/klaudiosinani/avlbinstree

AVL self-balancing binary search trees for ES6

avl balancing binary data es6 search self structure tree typescript

Last synced: 24 Apr 2025

https://github.com/philss/brazil-in-notebooks

A collection of notebooks presenting data about Brazil.

brazil data data-visualization elixir livebook vega-lite

Last synced: 13 Oct 2025

https://github.com/bradlindblad/cheatsheet

A simple package to grab cheat sheets and save them to your local computer

cheatsheets data datascience r

Last synced: 22 Oct 2025

https://github.com/hodur-org/hodur-graphviz-schema

Hodur is a domain modeling approach and collection of libraries to Clojure. By using Hodur you can define your domain model as data, parse and validate it, and then either consume your model via an API or use one of the many plugins to help you achieve mechanical results faster and in a purely functional manner.

clojure data modeling schema visualization

Last synced: 12 Dec 2025

https://github.com/huemulsolutions/huemul-bigdatagovernance

Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, largos de textos, máximos/mínimos de números y fechas, valores únicos y valores por default. También permite clasificar los campos en aplicabilidad de derechos ARCO para facilitar la implementación de leyes de protección de datos tipo GDPR, identificar los niveles de seguridad y si se está aplicando algún tipo de encriptación. Adicionalmente permite agregar reglas de validación más complejas sobre la misma tabla.

bigdata chile cloudera data data-engineer data-engineering data-governance data-warehouse datamart dataquality gdpr hadoop hive hortonworks huemul huemul-bigdatagovernance parquet spark spark-sql trabaja-sobre-spark

Last synced: 26 Apr 2025

https://github.com/fairdataihub/fairdataihub.org

Website of the FAIR Data Innovations Hub

data fair nextjs organization software typescript

Last synced: 27 Jan 2026

https://github.com/jacquietran/wnblr

An R package containing game stats from the Women's National Basketball League (WNBL).

basketball data r r-package wnbl

Last synced: 06 Oct 2025

https://github.com/CreateAndFake/CreateAndFake

A C# class library that handles mocking, test data generation, and validation.

clone cloning create data equality fake mock mocking mocks random randomization stubs testing tests

Last synced: 26 Apr 2025

https://github.com/hyriver/pydaymet

A part of HyRiver software stack for retrieving and post-processing climate data from the Daymet Webservice.

climate data daymet hydrology python webservice

Last synced: 12 Dec 2025

https://github.com/bfolkens/pandas-datareader-gdax

GDAX data for Pandas in the style of DataReader

bitcoin cryptocurrency data data-analysis dataset finance gdax pandas quant

Last synced: 28 Jan 2026

https://huanglii.github.io/awesome-gis-rs-data/

Awesome GIS RS Data

awesome data gis rs

Last synced: 29 Apr 2025

https://github.com/richardlitt/ebird-ext

Tools for doing stuff with eBird data

birding birds community-science data ebird science

Last synced: 27 Oct 2025

https://github.com/njonsson/structured_io

An Elixir API for consuming structured input streams

binary data elixir io parser stream

Last synced: 12 Oct 2025

https://github.com/hschne/sommerkinos

Kalendar für Sommerkinos in Wien

austria calendar data vienna

Last synced: 08 Jan 2026

https://github.com/justintime50/dad-node

Dummy Address Data (DAD) - Retrieve real addresses from all around the world. (Node Client Library)

address addresses dad data dataset dummy dummy-data real retrieving-addresses world

Last synced: 30 Apr 2025

https://github.com/lamm-mit/moleculediffusiontransformer

Molecular generation using diffusion models and autoregressive transformer models

ai chemistry data design dft generative modeling molecular-design quantum-mechanics

Last synced: 13 Apr 2025

https://github.com/aradfarahani/Geoelectricspy

This project presents an interactive 3D visualization of subsurface resistivity using geoelectric data. It utilizes Python and its scientific libraries to process and visualize the data across multiple depths, ranging from 1 meter to 464 meters.

data goelectrics visualization

Last synced: 28 Mar 2025

https://github.com/blackcipher101/spaceye

A tool to decode star spectrum images using OpenCV to make predictions about its charatestics.

data data-visualizations hacktoberfest opencv pysimplegui space

Last synced: 20 Mar 2025

https://github.com/buccaneerai/rxjs-stats

Moved to @bottlenose/rxstats (https://github.com/buccaneerai/bottlenose)

analytics data data-mining data-science observables reactive rxjs statistics

Last synced: 15 Jul 2025

https://github.com/ikramagix/faussaire

Generate ultra-realistic fake data in French and Greek with credible, context-aware, and culturally relevant options.

data faker-generator france french french-language french-speaking french-translation gem rspec rspec-testing rspec-tests ruby ruby-gem ruby-on-rails rubygem rubygems testing yaml

Last synced: 10 Apr 2025

https://github.com/alemidev/dashboard

my custom data collector and visualizer dashboard

dashboard data egui rust timeseries

Last synced: 29 Oct 2025