An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/heikomuller/histore

Library for maintaining snapshots of evolving tabular data sets

data version-control

Last synced: 10 Apr 2025

https://github.com/philhawksworth/netlify-plugin-trello-lists

A plugin to fetch the JSON data of a public Trello board, and stash the data for each list in a JSON file before your build runs making the data available to your static site generator at build time.

api data eleventy netlify plugin trello

Last synced: 20 Jan 2026

https://github.com/elggem/shapeset-generator

Generates varying shapes as training data for neural nets: ShapeSet.

data machine-learning numpy opencv python svg training

Last synced: 11 Apr 2026

https://github.com/longzheng/southeastwater-usage-scraper

Extract hourly water usage data from South East Water portal website for digital water meters

australia data iot playwright southeastwater victoria water

Last synced: 06 Feb 2026

https://github.com/lemmotresto/migrational

A data migration library

data java migration versioning

Last synced: 30 Oct 2025

https://github.com/Ekey/ER.DATA.Tool

Tool for extract data archives from mobile game Earth Revival (Project Arrival)

data earth-revival idx project-arrival

Last synced: 19 May 2026

https://github.com/stdlib-js/datasets-suthaharan-single-hop-sensor-network

Labeled wireless sensor network data set collected from a simple single-hop wireless sensor network deployment using TelosB motes.

data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib

Last synced: 03 Mar 2025

https://github.com/dmnsgn/airports-data

Airports data: static, dynamic and custom dump.

airport airports cli data database json

Last synced: 30 Apr 2025

https://github.com/kom-senapati/ghw-data-hacks

🌍 Global Hack Week data projects, 📊 focused on exploration, manipulation, and analysis...

data ghw

Last synced: 12 Mar 2025

https://github.com/jorgeatgu/clau

📊 Gráficas con d3, responsive y reutilizables

charts d3js d3v5 data data-visualization data-viz graphs

Last synced: 12 Sep 2025

https://github.com/joisino/twinpaper

Code for "Twin Papers: A Simple Framework of Causal Inference for Citations via Coupling" (CIKM 2022)

causal-inference data research science-of-science

Last synced: 21 Mar 2025

https://github.com/miguelgargallo/next13-fetch-data-turbo

Next13 Fetch Data Request

ai data fetch pylar request

Last synced: 27 Jul 2025

https://github.com/jasondrawdy/compendio

Collection of common and noteworthy extension methods, security tools, and filesystem functions generally found in most applications; focusing on extensibility and portability.

compendium converters cryptography data extensions generators hashing library security utilities validation windows

Last synced: 18 May 2026

https://github.com/royruddle/vizdataquality

Python package for visualizing data quality

data data-science data-visualization jupyter-notebook missing-data python

Last synced: 05 May 2025

https://github.com/robjg/dido

Data In/Data Out in many formats

csv-parser data etl java json-parser

Last synced: 11 Jan 2026

https://github.com/kale-ko/ejcl

An advanced configuration library for Java with support for local files, mysql databases, and more.

config configs configuration configuration-files data java java-serialization json mariadb mysql parser serialization serializer smile yaml

Last synced: 02 Feb 2026

https://github.com/nowosad/cllc

Country-level Land Cover - categories and transitions

data dataset land-cover land-cover-transitions r

Last synced: 04 Apr 2025

https://github.com/hariprashad-ravikumar/ai-datascience-lab

AI‑DataScience‑Lab is a web app for uploading CSV datasets, cleaning with Pandas, and running quick exploratory analyses and regression models using scikit‑learn. Its modular design supports future AI extensions, like deep learning with TensorFlow or insight generation via the OpenAI API.

ai api azure cloudcomputing data data-analysis data-science data-visualization mathplotlib numpy openai pandas python scikit-learn

Last synced: 02 Aug 2025

https://github.com/courtois-neuromod/anat

Anatomical sub-dataset of Courtois-Neuromod project.

data raw

Last synced: 17 Jan 2026

https://github.com/unaygney/js-challenges-data-structures-and-algorithms

Repo of the challenges I'm trying to solve to understand data structures and algorithms..

algorithms-and-data-structures data javascript structure

Last synced: 29 Oct 2025

https://github.com/thekartikeyamishra/data_cleaning_project

Welcome to the Data Cleaning and Visualization project! This repository demonstrates how to clean messy data and create insightful visualizations using Python with Pandas and Matplotlib.

data dataanalysis matplotlib matplotlib-pyplot pandas python

Last synced: 02 May 2026

https://github.com/marcosvidolin/firestore-bulk-loader

A simple tool to load data to Cloud Firestore.🔥

bulk-loader cloud data database firebase firestore import load loader tools

Last synced: 23 Jun 2025

https://github.com/sun-lab-nbb/sl-experiment

A Python library that provides tools to acquire, manage, and preprocess scientific data in the Sun (NeuroAI) lab.

ataraxis data experiment methods neuroscience

Last synced: 07 Mar 2026

https://github.com/woctezuma/download-steam-banners-data

Data consisting of Steam banners.

data steam steam-api

Last synced: 06 Jan 2026

https://github.com/stanford-oval/medxchange

Medical Data Exchange (MedXchange) platform

data ethereum exchange medical medxchange

Last synced: 16 May 2026

https://github.com/radekbednarik/data_generator

Random data generator using Python. Generate data files with random string, floats, ints, dates via console or TOML files..

csv data generator python python3 random test-data-generator

Last synced: 13 Dec 2025

https://github.com/polina-prokofieva/viewjson

The class for convenient visualization of json with some settings.

data data-visualization es5 es6 javascript json

Last synced: 15 May 2026

https://github.com/nazar-pc/fixed-size-multiplexer

A tiny library for multiplexing data chunks into blocks of fixed size and vice versa

chunk data demultiplex demux fixed multiplex mux size

Last synced: 31 Oct 2025

https://github.com/writetome51/big-dataset-paginator

A TypeScript/JavaScript class for pagination in a real-world web app.

app data javascript pagination paginator typescript

Last synced: 17 May 2026

https://github.com/cherylisabella/statistics--caret

Training Regression and Classification Models using caret

data data-analysis data-mining data-science datascience dataset r statistics

Last synced: 24 Jun 2025

https://github.com/max-tonny8/android_web3

This is a library for Android to call data from Node on Ethereum Chain or Solana Chain

android blockchain coroutines coroutines-android data eth-call ethereum kotlin ktx retrofit rpc smart-contracts solana web3 web3j

Last synced: 27 Mar 2025

https://github.com/serhatkacmaz/cpp-datastructuresandalgortihms

Contains codes related to data structures

algorithms cplusplus data data-structures

Last synced: 10 Jul 2025

https://github.com/randomfractals/unfolded-map-renderer

Unfolded Map 🗺️ Notebook 📓 Renderer for VSCode.

cell data flat-data geo geo-location geo-spatial map notebook output renderer unfolded view vscode

Last synced: 21 Mar 2025

https://github.com/null-none/jwlibrary2json

Handler for jwlibrary to json

data json jwlibrary library parser

Last synced: 14 May 2026

https://github.com/kabeech/real-dice

Random number generation based on physical media touched by humans

data dice haskell human-computer-interaction physical random random-generation random-number-generators rng

Last synced: 10 Apr 2025

https://github.com/maskedsyntax/covid-tracker

Qt app to keep a track of Covid-19 records of different countries.

coronavirus coronavirus-tracking covid-19 data parsing scraping scraping-websites tracker web-scraping

Last synced: 29 Mar 2025

https://github.com/heliomarpm/keyvalues-storage

A simple and robust KeyValues Storage's library

app-library config data db json json-db keyvalues localdb settings storage store

Last synced: 12 Apr 2025

https://github.com/reppon97/cryptosnake

Simple, unofficial python wrapper for Binance API. You'll find this easy-to-use package helpful if you're interested in general market data and cryptocurrency values. You don't need to have a Binance account or API Key since you can't purchase/trade cryptocurrencies using this package.

api binance bitcoin crypto cryptocurrencies cryptocurrency cryptocurrency-exchanges cryptography data ethereum json litecoin python python3 statistics

Last synced: 05 Mar 2025

https://github.com/pjmagee/starwars-data

A Star Wars Web app with Charts and entire Timeline events!

aspire blazor blazor-webassembly data database dataset docker dotnet json starwars starwars-data starwars-fandom

Last synced: 07 Mar 2026

https://github.com/jeroenvdmeer/feyod

Open data set of matches played by Feyenoord.

data dataset feyenoord open-data sql

Last synced: 25 Jan 2026

https://github.com/mohammadkarbalaee/introduction-to-data-science-sbu

Reports and full documentation of the introduction to data science course held at SBU

data data-science python shahid-beheshti-university

Last synced: 27 Mar 2025

https://github.com/coatless-rpkg/ucimlrepo

An unofficial R port of the Python package to download data off of the UCI ML repository

data data-science machine-learning rstats statistics uci-machine-learning uci-machine-learning-repository web-api

Last synced: 28 Jun 2025

https://github.com/real-veersandhu/scifaa-covid-19-project

📈 COVID-19 Data Science Project (2021 Internship @ SCI-FAA)

covid-19 data data-science data-visualization python

Last synced: 14 May 2026

https://github.com/nikoshet/exploratory-data-analysis-using-r

Exploratory Data Analysis using R Course Project for M.Sc. 'Data Science and Machine Learning' in NTUA

data data-analysis data-science eda exploratory-data-analysis ggplot2 r

Last synced: 14 May 2026

https://github.com/karlinarayberinger/karbytes_23andme_ancestry_data

This repository contains web page source code and media files which constitute (some of) the intellectual property encapsulated by the website named Karbytes For Life Blog dot WordPress dot Com.

2023 23andme ancestry blog data dna genome karbytes website wordpress

Last synced: 12 May 2025

https://github.com/weisscharlesj/data_scicompforchem

Zipped data for SciCompforChem book for easy download

chemistry chemistry-education data data-visualization python

Last synced: 07 Nov 2025

https://github.com/moumouls/data-toulouse-grapql

A (partial) GraphQL Support for the Toulouse Métropole Open Data service

api data graphql open service toulouse

Last synced: 16 May 2026

https://github.com/uttamo/messenger-data-analysis

Analyse Messenger data exports from Facebook

analysis data facebook json messenger pandas parse python

Last synced: 18 Jul 2025

https://github.com/horizom/dto

Data Transfer Objects for all PHP applications.

data dto object transfer

Last synced: 14 Sep 2025

https://github.com/masesgroup/datadistributionmanager

A reliable subsystem to distribute data across multiple datacenters using multiple languages (C/C++, .NET, JVM enabled languages) over multiple technologies (e.g. Apache Kafka, OpenDDS, etc)

apachekafka availability business-solutions businesscontinuity data durability opendds reliability softwareneverlockdown

Last synced: 14 Apr 2025

https://github.com/eosdis-nasa/earthdata-pub-dashboard

Front-end Dashboard for Earthdata Pub

data earthdata edpub publication

Last synced: 15 Jan 2026

https://github.com/rawsashimi1604/jobextract

Scrapes LinkedIn data. Conducts sentiment analysis on what traits and qualifications employers are looking for.

data data-analysis data-analytics data-cleaning linkedin mvc python webscraper

Last synced: 06 Nov 2025

https://github.com/anonympins/data-primals-engine

Manage and automate your data at scale 🚀 With data-primals-engine you get workflows, dashboards, alerts, i18n, client integration & AI assistant — all open-source, all MongoDB powered.

api automation data data-analysis data-engineer data-visualization database expressjs low-code mongodb nodejs rest-api

Last synced: 07 Mar 2026

https://github.com/dataopstix/modelt

Modelt(mow·delt) is a modern data integration solution that connects data to data for advanced analytics.

airbyte airflow airflow-docker data data-analysis data-visualization database dbt elt etl etl-automation metabase metadata modern modern-dev modernization

Last synced: 28 Mar 2025

https://github.com/tuananh/opentravel

✈ A collection of travel related data

data travel

Last synced: 09 Oct 2025

https://github.com/pirhoo/spytula

A Python library that provides a simple and convenient way to build JSON and YAML data structures using a builder pattern.

data dsl json python yaml

Last synced: 13 Apr 2026

https://github.com/mongoexpuser/synthetic-drilling-data-app-for-sqlite-ml

Generate synthetic drilling data that can be used for testing machine learning (ML) models.

classification data drilling events inference machine-learning ml-models node-sqlite3 nodejs prediction python sqlite3 training

Last synced: 08 Apr 2026

https://github.com/ezra-obiwale/vue-doo

Utility library for VueJS

ajax axios data function http js method store utility vue vuejs

Last synced: 07 May 2025

https://github.com/shysolocup/aepl

A Node.JS multi-layered class creation package with built-in parenting systems that let you get info from classes above as well as better function and property makers for easier to read and understand development and modding support inspired by Roblox's Studio API.

aepl backend classes data framework game-development game-framework javascript js js-class js-framework lightweight nodejs package

Last synced: 28 Oct 2025

https://github.com/ugurcanerdogan/cross-validation-with-imbalanced-dataset

BBM467*SDSP - Small Data Science Project - Things to consider in cross validation and resampling when dealing with Imbalanced Data : What is the right way?

bbm467 cross-validation data data-science kfold-cross-validation logistic-regression machine-learning oversampling sdsp smote

Last synced: 21 Jun 2025

https://github.com/kodie/migrate-acf-field-data-to-repeater

A WordPress plugin that migrates field metadata for ACF fields that have been moved inside of a repeater

acf acf-field acf-fields advance-custom-field data data-migration data-migration-tool wordpress wordpress-plugin

Last synced: 19 May 2026

https://github.com/ahmedkhalf/arabic-keyword-scraper

Stop wasting your time! And obtain Arabic definitions without having to look it up.

arabic data definitions scraper sentences wordsearch

Last synced: 12 Mar 2025

https://github.com/chickennungets/ifsc-data-analysis

An implementation of Elo-MMR ranking in the boulder and lead disciplines.

d3 data elo elo-rating ifsc visualization webscraping

Last synced: 20 Jan 2026

https://github.com/richardschoen/ileaccess

Save file packaging of IBM i libraries ILEASTIC, NOXDB and ILEUSION for convenience installation. ILEastic allows HTTP microservices to be created on IBM i. NOXDB makes SQL,JSON and XML made easy for IBM i, ILEusion packages it all up to make IBM i data and program call services easier.

as400 call cl command data database db2 http https ibmi ile ileastic ileusion microservice noxdb program queue rpg sql xmlservice

Last synced: 24 Jul 2025

https://github.com/amethyst-php/address

The place where a person or organization can be found or communicated with. Contains fields such as: street, postal code, city, country etc... Can be used for example as a shipment address or as an invoice address.

address amethyst amethyst-package api data laravel

Last synced: 13 Aug 2025

https://github.com/stdlib-js/array-base-copy

Copy the elements of an array-like object to a new generic array.

array copy data generic javascript node node-js nodejs stdlib structure types

Last synced: 30 Oct 2025

https://github.com/hmeleiro/alquilermad

Housing rent map in Comunidad de Madrid / Mapa del alquiler en la Comunidad de Madrid

data data-science data-visualization datascience housing-location-visualization rent renting

Last synced: 13 Sep 2025

https://github.com/ciscorn/tinybufr

A Rust library for decoding BUFR meteorological observation data format

bufr data meteorology rust weather wmo

Last synced: 11 Jan 2026

https://github.com/antoineaugusti/purchasing-power

Archive daily data about purchasing power parity: how much goods should cost in various countries

archive data purchasing-power-parity

Last synced: 28 Oct 2025

https://github.com/stdlib-js/ndarray-base-dtype2c

Return the C data type associated with a provided data type string.

array base c data dtype javascript multidimensional ndarray node node-js nodejs stdlib type types util utilities utility utils

Last synced: 18 Jun 2025

https://github.com/emrecpp/datapacket-cpp

Send, recv, encrypt, decrypt, compress data as Packet and send it with socket for C++.

compress data deserialization deserialize deserializer encrypt packet recv send serialization serialize serializer socket storage

Last synced: 28 Mar 2025

https://github.com/meetyildiz/pandazip

Reduce RAM footprint of Pandas DataFrame without losing information.

compression data data-mining data-science machine-learning pandas utilities

Last synced: 20 Jan 2026

https://github.com/smithsonian/massdigi-tools

Scripts that support Mass Digitization projects by the Digitization Program Office, OCIO

data data-validation digital-files digitization digitization-workflows museums

Last synced: 13 Apr 2025

https://github.com/satyam4229/college-predictor-system

The college predictor system is a Python-based application that utilizes a machine learning model to predict colleges and their corresponding degree programs and branches based on a student's JEE (Joint Entrance Examination) score.

data data-science jupyter-notebook kaggle prediction python

Last synced: 06 Apr 2026

https://github.com/headless-start/data-augmentation-impact

This repository contains effect of Data Augmentation of Training Set during Model Training.

augmented-images cuda data gpu keras matplotlib mnist opencv-python python3 tensorflow training-data

Last synced: 05 Apr 2026

https://github.com/shahiakhilesh1304/dsa

This repository is my personal space for practicing Data Structures and Algorithms. I'll be adding questions, solutions, and explanations as I progress, covering topics from basics to advanced.

algorithms code contribution contributions-and-collaboration contributions-welcome data data-structures data-structures-and-algorithms dsa dsa-algorithm dsa-learning-series dsa-practice hacktober hacktoberfest hacktoberfest-accepted hacktoberfest-starter practice programming python-3

Last synced: 13 Apr 2025

https://github.com/rrighart/rrighart.github.io

A webpage about data science, programming, statistics and related topics

analyses data data-mining programming statistics

Last synced: 20 Jan 2026

https://github.com/danlsn/causality

A Personal Data Platform and the culmination of years of curiosity and learning in the Data Engineering space.

data data-engineering datawarehousing personal-data quantified-self

Last synced: 06 Mar 2026

https://github.com/itzshoaib/hashtegrity

A library for generating hash, validating data integrity, monitoring file/directory integrity, offchain data integrity

crypto-hash data data-integrity hacktoberfest hash integrity

Last synced: 07 Mar 2026

https://github.com/ange007/jquery.mydata

jQuery.myData - Small jQuery&Zepto plugin for two-ways data binding.

data data-binding jquery jquery-plugin zepto zepto-plugin zeptojs

Last synced: 19 May 2026