An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/zoilomora/xiaomi-mi-fit-data-export

Export data from Xiaomi Mi Fit in the most automated way possible

data export mi-band scraping xiaomi

Last synced: 26 Jan 2026

https://github.com/anthonydb/data-wrangling-python-nicar-2017

Materials for the NICAR 2017 Data Wrangling with Python hands-on class

agate csvkit data jupyter nicar-2017 nicar17 python

Last synced: 22 Apr 2025

https://github.com/viraltux/datawrangler.jl

Data transformation tools for analytics

data transformations wrangling

Last synced: 15 Aug 2025

https://github.com/kachayev/timely0

Minimalistic implementation of Naiad paper "A Timely Dataflow System" in Scala

data dataflow dataflow-programming graph timely

Last synced: 12 Apr 2025

https://github.com/sap-samples/salt

Source code and data for the paper "SALT: Sales Autocompletion Linked Business Tables Dataset"

business data dataset linked tables

Last synced: 09 Mar 2026

https://github.com/dimitryzub/ecommerce-scraper-py

Scrape ecommerce websites such as Amazon, eBay, Walmart, Home Depot, Google Shopping from a single module in Python🐍

data datamining ecommerce ecommerce-website python python3 selectolax selenium serpapi webscraper webscraping

Last synced: 03 Sep 2025

https://github.com/purarue/autotui

quickly create UIs to interactively prompt, validate, and persist python objects to disk (JSON/YAML) and back using type hints

cli data deserialization json namedtuple serialization tui typehints

Last synced: 16 Mar 2025

https://github.com/ludbek/validatex

A simple yet powerful data validator for javascript.

async data javascript schema validator

Last synced: 29 Jul 2025

https://github.com/the-swarm-corporation/backtesteragent

An enterprise-grade AI-powered backtesting framework built on the Swarms framework for automated trading strategy validation and optimization.

ai backtesting data finance finance-agents jpmorgan yahoofinance

Last synced: 13 Oct 2025

https://github.com/cfpb/cfpb-chart-builder

Charts for the Consumer Financial Protection Bureau

chart data visualization

Last synced: 09 Apr 2025

https://github.com/edgararuiz-zz/datalang

Package to translate R data sets

data rlang translation

Last synced: 21 Apr 2025

https://github.com/williamtroup/tree.js

🌲 A lightweight JavaScript library that allows you to create responsive and customizable interactive tree diagrams from an array of JS objects.

css3 data grid html5 javascript map tree treemap visualization

Last synced: 15 Apr 2025

https://github.com/alexanderkasten/use-next-sse

use-next-sse is a lightweight and easy-to-use React hook library for implementing Server-Sent Events (SSE) in Next.js applications, enabling real-time, unidirectional data streaming from server to client.

data events eventsource nextjs react real-time server-sent-events serverless sse streaming

Last synced: 19 Feb 2026

https://github.com/kyryl-opens-ml/ml-in-production-practice

Practice for Machine Learning in Production course

data inference-api infrastructure llm ml mlops monitoring pipelines platform

Last synced: 16 Jan 2026

https://github.com/mzjp2/kedro-dataframe-dropin

A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)

data gpu-acceleration kedro-catalog kedro-plugin modin rapidsai

Last synced: 09 Mar 2026

https://github.com/bst04/tools-for-data-recovery

All tools for Data Recovery

data data-recovery programs recovery tools

Last synced: 29 Jun 2025

https://github.com/klaudiosinani/avlbinstree

AVL self-balancing binary search trees for ES6

avl balancing binary data es6 search self structure tree typescript

Last synced: 24 Apr 2025

https://github.com/nas5w/imdb-data

A JSON file of 50,000 IMDB movie reviews to be used in machine learning applications.

data data-science imdb javascript machine-learning

Last synced: 19 Apr 2025

https://github.com/fleschutz/base256u

C++ sample implementation of base256 encoding using Unicode characters.

base256 base256u binary data encoding unicode

Last synced: 05 May 2025

https://github.com/moscarde/foliumtools

Um breve guia de como utilizar ferramentas básicas do Folium para gerar mapas interativos através de dados geográficos em conjunto com Python, Pandas e GoogleMaps.

data data-visualization folium googlemaps-api maps pandas phyton

Last synced: 14 Apr 2025

https://github.com/benjamincharity/angular-json-calendar

:calendar: An AngularJS module that generates calendar data as a JSON object and/or HTML :muscle:

angularjs calendar data date

Last synced: 22 Mar 2025

https://github.com/ineelhere/clintrialx

R package to fetch and explore clinical trials data from freely available registries. Fetch data in bulk, customize data and build comprehensive html reports. Currently, it supports the ClinicalTrials.gov registry and CTTI AACT (Access to Aggregate Content of ClinicalTrials.gov).

aact bioinformatics clinical-data clinical-trials clinicaltrialsgov ctti data data-management medical-informatics r-language r-package trials

Last synced: 28 Oct 2025

https://github.com/streamr-dev/datav2

The second incarnation of the DATA token

data ethereum streamr token

Last synced: 13 Jul 2025

https://github.com/ruofeidu/ducrawler

An automatic crawler to mine images from Google and Bing Image search (part of SketchyScene at ECCV 2018)

data google image mining python search

Last synced: 11 Apr 2025

https://github.com/warisgill/feddefender

FedDefender is a novel defense mechanism designed to safeguard Federated Learning from the poisoning attacks (i.e., backdoor attacks).

backdoor-attacks data differential-testing federated-learning poisoning-attack

Last synced: 21 Mar 2025

https://github.com/randomfractals/tabular-data-viewer

Tabular Data Viewer 🀄 VSCode extension for viewing very large local and remote CSV and TSV data files with Tabulator Table, Perspective Datagrid and D3FC Chart Views 📊📈

charts csv d3fc data data-packages datapackage dsv flat-data large-data perspective remote-data tabular tabulator tsv view viewer vscode

Last synced: 11 Apr 2025

https://github.com/erichenry/swagger-data-gen

Tool to generate random data from a Sagger/OpenAPI spec

cli data generator mock-data openapi openapi-specification random swagger tool

Last synced: 14 Apr 2025

https://github.com/jonschlinkert/merge-configs

Find, load and merge JSON and YAML config settings from one or more files, in the specified order.

combine conf config configuration data eslint find jonschlinkert lookup merge namespace node nodejs object package rc runtime-config search store

Last synced: 26 Jun 2025

https://github.com/dhhruv/stock-price-prediction

A deep learning project in which the model was trained using LSTM layers and Tata Stock prices were predicted and compared with thier actual values.

algorithm cli college-project data data-science dataset deep-learning jupyter jupyter-notebook lstm machine-learning prediction science shell stock-price-prediction tata-beverages terminal

Last synced: 03 May 2025

https://github.com/ronin-co/client

Access RONIN via TypeScript.

bun client data javascript js library node nodejs query ts typescript

Last synced: 10 Apr 2025

https://github.com/spatialcurrent/go-simple-serializer

Simple library and command line program for converting between JSON, YAML, TOML, and many more common serialization formats.

big-data bigdata data

Last synced: 29 Jan 2026

https://github.com/ethicnology/blockchain-ekstrakto

Blockchain ekstrakto is a Python program which extracts all Bitcoin blockchain data using Bitcoin Core.

data

Last synced: 11 Oct 2025

https://github.com/abcnews/census-100-people

Census 2016: This is Australia as 100 people

australia census data visualisation

Last synced: 27 Jan 2026

https://github.com/jcrodriguez1989/thesimpsons

Package (dataset): The Simpsons episodes dataset

data dataset r simpsons

Last synced: 26 Oct 2025

https://github.com/haxetink/tink_streams

Streams from the future. With lasers, of course ... whoaaaaa!!!!

data gadt haxe immutable streams tink

Last synced: 27 Jan 2026

https://github.com/fairdataihub/fairdataihub.org

Website of the FAIR Data Innovations Hub

data fair nextjs organization software typescript

Last synced: 03 May 2026

https://github.com/richardlitt/ebird-ext

Tools for doing stuff with eBird data

birding birds community-science data ebird science

Last synced: 27 Oct 2025

https://huanglii.github.io/awesome-gis-rs-data/

Awesome GIS RS Data

awesome data gis rs

Last synced: 29 Apr 2025

https://github.com/qzcool/cpef

私募基金管理人查询数据接口。Chinese Private Equity Funds APIs.

china crawler data finance fund funds hedge-funds private-equity python python3 scraper scraping-websites spider

Last synced: 26 Feb 2026

https://github.com/jacquietran/wnblr

An R package containing game stats from the Women's National Basketball League (WNBL).

basketball data r r-package wnbl

Last synced: 06 Oct 2025

https://github.com/seancolsen/music-theory-data

A data source containing names and details for musical scales, chords, intervals, and notes.

data music music-theory musical-scales musicology

Last synced: 17 Mar 2026

https://github.com/aoemods/attrib

Age of Empires 4 attribute dump, keeping track of patch changes. Converted with AOEMods.Essence. Join our discord on the website link.

age-of-empires-iv aoe4 data information mods stats

Last synced: 23 Feb 2026

https://github.com/bradlindblad/cheatsheet

A simple package to grab cheat sheets and save them to your local computer

cheatsheets data datascience r

Last synced: 22 Oct 2025

https://github.com/bfolkens/pandas-datareader-gdax

GDAX data for Pandas in the style of DataReader

bitcoin cryptocurrency data data-analysis dataset finance gdax pandas quant

Last synced: 28 Jan 2026

https://github.com/hodur-org/hodur-graphviz-schema

Hodur is a domain modeling approach and collection of libraries to Clojure. By using Hodur you can define your domain model as data, parse and validate it, and then either consume your model via an API or use one of the many plugins to help you achieve mechanical results faster and in a purely functional manner.

clojure data modeling schema visualization

Last synced: 12 Dec 2025

https://github.com/hyriver/pydaymet

A part of HyRiver software stack for retrieving and post-processing climate data from the Daymet Webservice.

climate data daymet hydrology python webservice

Last synced: 12 Dec 2025

https://github.com/njonsson/structured_io

An Elixir API for consuming structured input streams

binary data elixir io parser stream

Last synced: 12 Oct 2025

https://github.com/judehunter/reactivefile

Parse and reactively auto-save JSON, TOML, YAML and any other data file with ease.

data data-structures file filesystem javascript reactive reactive-programming typescript typescript-definitions

Last synced: 15 Apr 2025

https://github.com/kishyassin/goframe

goframe is a Go package inspired by Python's pandas, designed for data manipulation and analysis.

data dataframe go goframe golang good-first-issue package

Last synced: 04 Oct 2025

https://github.com/huemulsolutions/huemul-bigdatagovernance

Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, largos de textos, máximos/mínimos de números y fechas, valores únicos y valores por default. También permite clasificar los campos en aplicabilidad de derechos ARCO para facilitar la implementación de leyes de protección de datos tipo GDPR, identificar los niveles de seguridad y si se está aplicando algún tipo de encriptación. Adicionalmente permite agregar reglas de validación más complejas sobre la misma tabla.

bigdata chile cloudera data data-engineer data-engineering data-governance data-warehouse datamart dataquality gdpr hadoop hive hortonworks huemul huemul-bigdatagovernance parquet spark spark-sql trabaja-sobre-spark

Last synced: 26 Apr 2025

https://github.com/philss/brazil-in-notebooks

A collection of notebooks presenting data about Brazil.

brazil data data-visualization elixir livebook vega-lite

Last synced: 13 Oct 2025

https://github.com/CreateAndFake/CreateAndFake

A C# class library that handles mocking, test data generation, and validation.

clone cloning create data equality fake mock mocking mocks random randomization stubs testing tests

Last synced: 26 Apr 2025

https://github.com/emreyalvac/sulfur

Shaping, Processing, and Transforming Data with the Power of Sulfur with Rust

data data-analysis data-flow database

Last synced: 19 Aug 2025

https://github.com/domaindrivenarchitecture/data-test

framework for data-driven tests

clojure data test test-driven-development

Last synced: 13 Aug 2025

https://github.com/mgroves/sqlservertocouchbase

Library to automatically best effort move and remodel data from relational databases (like SQL Server) to Couchbase

couchbase data json migration sql sql-server tables

Last synced: 26 Sep 2025

https://github.com/dpguthrie/bankfind

Python interface to the FDIC's API for publically available bank data

api api-wrapper banking data finance pandas python united-states

Last synced: 02 Aug 2025

https://github.com/isoverse/isoreader

Read IRMS (Isotope Ratio Mass Spectrometry) data files into R

data ecology geochemistry isotopes r

Last synced: 19 Feb 2026

https://github.com/hschne/sommerkinos

Kalendar für Sommerkinos in Wien

austria calendar data vienna

Last synced: 08 Jan 2026

https://github.com/prioritizr/aoh

Create Area of Habitat Data

conservation data ecology gis-data rstats rstats-package

Last synced: 15 Apr 2025

https://github.com/fwd/reddit

Graph Visualization UI for Reddit.

data data-science datasets worldnews

Last synced: 24 Apr 2025

https://github.com/giacomopiccinini/rush

Swiss-army knife for media inspection and manipulation

cli data data-engineering multimedia rust

Last synced: 09 Mar 2026

https://github.com/durgeshsamariya/100daysofdatascience

A 100 Day DS Challenge to learn and implement DS concepts ranging from the beginner of Data Science to Data Scientist.

100days 100daysofcode 100daysofdscode 100daysofmlcode data data-science

Last synced: 15 Apr 2025

https://github.com/mitevpi/urban-insights-frontend

Winning AEC Hackathon 2019 Silicon Valley Project. AR/VR Application for visualizing proposed buildings on their sites and overlaying environmental and zoning analysis.

a-frame analysis ar architecture city computation data design rhino smart ui vr vue vuejs

Last synced: 08 Aug 2025

https://github.com/alemidev/dashboard

my custom data collector and visualizer dashboard

dashboard data egui rust timeseries

Last synced: 29 Oct 2025

https://github.com/aradfarahani/Geoelectricspy

This project presents an interactive 3D visualization of subsurface resistivity using geoelectric data. It utilizes Python and its scientific libraries to process and visualize the data across multiple depths, ranging from 1 meter to 464 meters.

data goelectrics visualization

Last synced: 28 Mar 2025

https://github.com/ikramagix/faussaire

Generate ultra-realistic fake data in French and Greek with credible, context-aware, and culturally relevant options.

data faker-generator france french french-language french-speaking french-translation gem rspec rspec-testing rspec-tests ruby ruby-gem ruby-on-rails rubygem rubygems testing yaml

Last synced: 10 Apr 2025

https://github.com/daniel-frak/dummy4j

An extensible dummy/fake data generator library for Java

data dummy faker generator java

Last synced: 12 Jan 2026

https://github.com/justintime50/dad-node

Dummy Address Data (DAD) - Retrieve real addresses from all around the world. (Node Client Library)

address addresses dad data dataset dummy dummy-data real retrieving-addresses world

Last synced: 30 Apr 2025

https://github.com/kovah/taboo-data

A data set for Taboo games. Plain JSON files which contain the keyword and some buzzwords like in the original Taboo game

data data-structures dataset datasets game game-resources taboo tabu

Last synced: 14 Apr 2025

https://github.com/tusharnankani/data-analysis-with-python

A complete introduction to data analysis covering the basics of Python, Numpy, Pandas, Data Visualization, and Exploratory Data Analysis.

data data-analysis data-visualization hacktoberfest jovian jovian-ml jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 07 May 2025

https://github.com/buccaneerai/rxjs-stats

Moved to @bottlenose/rxstats (https://github.com/buccaneerai/bottlenose)

analytics data data-mining data-science observables reactive rxjs statistics

Last synced: 15 Jul 2025

https://github.com/yagoluiz/meuremedio-extracao

[PT-BR] Extração de dados de preço de medicamentos disponibilizados pela ANVISA

data extraction python3

Last synced: 15 Jul 2025

https://github.com/gamemann/xdp-access-last-byte

Repository to store information accessing the last byte of a packet in BPF and XDP.

bpf byte data express last lastbyte network network-programming networking packet path payload processing xdp

Last synced: 18 Mar 2025

https://github.com/lamm-mit/moleculediffusiontransformer

Molecular generation using diffusion models and autoregressive transformer models

ai chemistry data design dft generative modeling molecular-design quantum-mechanics

Last synced: 13 Apr 2025

https://github.com/ismet55555/logging-for-matlab

A flexible message logger for your MATLAB scripts and programs

best-practices class daq data logging logging-library matlab mfiles utility

Last synced: 20 Mar 2025

https://github.com/blackcipher101/spaceye

A tool to decode star spectrum images using OpenCV to make predictions about its charatestics.

data data-visualizations hacktoberfest opencv pysimplegui space

Last synced: 20 Mar 2025

https://github.com/keshiarose/toggl-web-data-connector

A Tableau Web Data Connector that pulls in data from the Detail Reports view from toggl.com

data tableau tableau-desktop toggl toggl-track wdc web-data-connector

Last synced: 24 Dec 2025

https://github.com/akoury/ml-helper

Python library with helpers to speed up and structure machine learning projects.

data data-visualization machine-learning ml python scikit-learn sklearn

Last synced: 24 Oct 2025

https://github.com/contextlab/data-wrangler

Wrangle messy numerical, image, and text data into consistent well-organized formats

data data-analysis data-science data-wrangling hugging-face image-data machine-learning nlp numpy pandas python scikit-learn

Last synced: 10 Apr 2025

https://github.com/cfnptr/pack

Runtime optimized multi-platform data packing library for realtime game resources loading

c c99 compression compressor container cpp cross-platform csharp data library multi-platform pack package packer packing resource resources runtime storage zstd

Last synced: 30 Oct 2025

https://github.com/zhaochenyang20/tsing-answer

为清华大学答疑坊所写的数据分析脚本

data json

Last synced: 02 Sep 2025

https://github.com/john-sandall/maven

Maven provides easy access to open datasets in both raw and model-ready formats.

data etl maven open pipeline python

Last synced: 27 Mar 2026

https://github.com/karanpratapsingh/scale-etl

Partition, Transform, Load, and Search large CSV files

concurrency data etl golang

Last synced: 10 Jul 2025

https://github.com/vin-cento/fakesnake

Do you need to quickly generate a 🎲random dataset to work with? This is the 🔨tool for you! You can generate a quick list of random or quickly populate a 🏬database.

data database python

Last synced: 04 Nov 2025

https://github.com/hinto-janai/someday

Lock-free MVCC primitive

atomic concurrency data lock-free mvcc rust

Last synced: 18 Mar 2025