An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/max-tonny8/android_web3

This is a library for Android to call data from Node on Ethereum Chain or Solana Chain

android blockchain coroutines coroutines-android data eth-call ethereum kotlin ktx retrofit rpc smart-contracts solana web3 web3j

Last synced: 27 Mar 2025

https://github.com/fiedsch/datamanagement

Data management helpers (PHP-CLI)

csv-data data datamanagement helper php

Last synced: 05 Apr 2025

https://github.com/weisscharlesj/data_scicompforchem

Zipped data for SciCompforChem book for easy download

chemistry chemistry-education data data-visualization python

Last synced: 07 Nov 2025

https://github.com/uttamo/messenger-data-analysis

Analyse Messenger data exports from Facebook

analysis data facebook json messenger pandas parse python

Last synced: 18 Jul 2025

https://github.com/woctezuma/download-steam-banners-data

Data consisting of Steam banners.

data steam steam-api

Last synced: 06 Jan 2026

https://github.com/ournet/news-sources

A repository of news sources for every country

data news news-sources sources

Last synced: 11 Jul 2025

https://github.com/heikomuller/histore

Library for maintaining snapshots of evolving tabular data sets

data version-control

Last synced: 10 Apr 2025

https://github.com/gavinr/minor-league-baseball

Data set of minor league baseball teams

baseball data hacktoberfest maps open-data open-datasets sports

Last synced: 06 Apr 2025

https://github.com/real-veersandhu/scifaa-covid-19-project

πŸ“ˆ COVID-19 Data Science Project (2021 Internship @ SCI-FAA)

covid-19 data data-science data-visualization python

Last synced: 14 May 2026

https://github.com/felixklauke/atomizer

Playing around with butter knife, android bindings and rx java.

binding butterknife data java react rx rxjava

Last synced: 15 May 2026

https://github.com/danlsn/causality

A Personal Data Platform and the culmination of years of curiosity and learning in the Data Engineering space.

data data-engineering datawarehousing personal-data quantified-self

Last synced: 06 Mar 2026

https://github.com/satyam4229/college-predictor-system

The college predictor system is a Python-based application that utilizes a machine learning model to predict colleges and their corresponding degree programs and branches based on a student's JEE (Joint Entrance Examination) score.

data data-science jupyter-notebook kaggle prediction python

Last synced: 06 Apr 2026

https://github.com/shysolocup/aepl

A Node.JS multi-layered class creation package with built-in parenting systems that let you get info from classes above as well as better function and property makers for easier to read and understand development and modding support inspired by Roblox's Studio API.

aepl backend classes data framework game-development game-framework javascript js js-class js-framework lightweight nodejs package

Last synced: 28 Oct 2025

https://github.com/emrecpp/datapacket-cpp

Send, recv, encrypt, decrypt, compress data as Packet and send it with socket for C++.

compress data deserialization deserialize deserializer encrypt packet recv send serialization serialize serializer socket storage

Last synced: 28 Mar 2025

https://github.com/antvis/create-antv-demo

A simple CV-dashboard framework for practicing how to use AntV.

antv cv dashboard data resume resume-template resume-website visualization

Last synced: 09 Apr 2025

https://github.com/horizom/dto

Data Transfer Objects for all PHP applications.

data dto object transfer

Last synced: 14 Sep 2025

https://github.com/itzshoaib/hashtegrity

A library for generating hash, validating data integrity, monitoring file/directory integrity, offchain data integrity

crypto-hash data data-integrity hacktoberfest hash integrity

Last synced: 07 Mar 2026

https://github.com/radekbednarik/data_generator

Random data generator using Python. Generate data files with random string, floats, ints, dates via console or TOML files..

csv data generator python python3 random test-data-generator

Last synced: 13 Dec 2025

https://github.com/andrei-vataselu/data-science-snippets

🧰 Essential EDA and Data Cleaning Helpers for Any DataFrame This collection of functions is designed to accelerate exploratory data analysis (EDA), quickly surface data quality issues, and offer high-level insights into the structure and content of your dataset.

artificial-intelligence data data-science eda feature-engineering hyperparamater-tunning library loading model-evaluation modeling preprocessing python snippets text-processing time-series visualization

Last synced: 10 Mar 2026

https://github.com/pjmagee/starwars-data

A Star Wars Web app with Charts and entire Timeline events!

aspire blazor blazor-webassembly data database dataset docker dotnet json starwars starwars-data starwars-fandom

Last synced: 07 Mar 2026

https://github.com/memair/apps

App Store for Memair

apps appstore data data-science quantified-self

Last synced: 06 Apr 2026

https://github.com/maskedsyntax/covid-tracker

Qt app to keep a track of Covid-19 records of different countries.

coronavirus coronavirus-tracking covid-19 data parsing scraping scraping-websites tracker web-scraping

Last synced: 29 Mar 2025

https://github.com/utrechtuniversity/dataprivacysurvey

Code for analysing data from the Data Privacy Survey (2022)

data gdpr open-science privacy rdm research research-data-management survey utrecht-university

Last synced: 16 Jun 2025

https://github.com/richardschoen/ileaccess

Save file packaging of IBM i libraries ILEASTIC, NOXDB and ILEUSION for convenience installation. ILEastic allows HTTP microservices to be created on IBM i. NOXDB makes SQL,JSON and XML made easy for IBM i, ILEusion packages it all up to make IBM i data and program call services easier.

as400 call cl command data database db2 http https ibmi ile ileastic ileusion microservice noxdb program queue rpg sql xmlservice

Last synced: 24 Jul 2025

https://github.com/coatless-rpkg/ucimlrepo

An unofficial R port of the Python package to download data off of the UCI ML repository

data data-science machine-learning rstats statistics uci-machine-learning uci-machine-learning-repository web-api

Last synced: 28 Jun 2025

https://github.com/ciscorn/tinybufr

A Rust library for decoding BUFR meteorological observation data format

bufr data meteorology rust weather wmo

Last synced: 11 Jan 2026

https://github.com/cherylisabella/statistics--caret

Training Regression and Classification Models using caret

data data-analysis data-mining data-science datascience dataset r statistics

Last synced: 24 Jun 2025

https://github.com/sun-lab-nbb/sl-experiment

A Python library that provides tools to acquire, manage, and preprocess scientific data in the Sun (NeuroAI) lab.

ataraxis data experiment methods neuroscience

Last synced: 07 Mar 2026

https://github.com/johntocci/nullaxe

Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.

data data-analysis data-science datacleaning pandas polars python

Last synced: 06 Apr 2026

https://github.com/dhruvldrp9/simpledht

A Python-based Distributed Hash Table (DHT) implementation enabling cross-network key-value storage, automatic node discovery, and data replication with a simple CLI and library interface.

cross-network-node-communation data data-replication data-synchronization dht dht-python distributed-hash-table key-value-storage nat netowork node-discovery peer-to-peer peer-to-peer-network python sha-256 simple udp udp-socket-communication

Last synced: 28 Feb 2026

https://github.com/marlenezw/speech-to-text

Turn any video or audio recording into a written transcript using python

data data-science python speech speech-recognition speech-synthesis speech-to-text

Last synced: 27 Apr 2026

https://github.com/mongoexpuser/synthetic-drilling-data-app-for-sqlite-ml

Generate synthetic drilling data that can be used for testing machine learning (ML) models.

classification data drilling events inference machine-learning ml-models node-sqlite3 nodejs prediction python sqlite3 training

Last synced: 08 Apr 2026

https://github.com/minightdev/paperclip

Paperclip is a powerful privacy-focused data breach search engine that empowers users to swiftly and securely investigate breaches using email addresses and phone numbers. Our robust search engine delivers real-time results while prioritizing the privacy and security of user queries.

beaches data database pwn pwned search-engine

Last synced: 22 Mar 2025

https://github.com/sharmadhiraj/free-json-datasets

Collection of free JSON data that are scraped and parsed from different websites.

collection crawler data data-scraping datasets json sports statistics web-scraping

Last synced: 28 Mar 2025

https://github.com/zarr-developers/cookiecutter-zarr-store

Cookiecutter for Zarr store implementations

chunked data n-dimensional zarr

Last synced: 16 Jun 2025

https://github.com/antoineaugusti/purchasing-power

Archive daily data about purchasing power parity: how much goods should cost in various countries

archive data purchasing-power-parity

Last synced: 28 Oct 2025

https://github.com/hmeleiro/alquilermad

Housing rent map in Comunidad de Madrid / Mapa del alquiler en la Comunidad de Madrid

data data-science data-visualization datascience housing-location-visualization rent renting

Last synced: 13 Sep 2025

https://github.com/thekartikeyamishra/data_cleaning_project

Welcome to the Data Cleaning and Visualization project! This repository demonstrates how to clean messy data and create insightful visualizations using Python with Pandas and Matplotlib.

data dataanalysis matplotlib matplotlib-pyplot pandas python

Last synced: 02 May 2026

https://github.com/hyper63/copy

hyper copy tool

copy data hyper

Last synced: 20 Jul 2025

https://github.com/courtois-neuromod/anat

Anatomical sub-dataset of Courtois-Neuromod project.

data raw

Last synced: 17 Jan 2026

https://github.com/ahmedkhalf/arabic-keyword-scraper

Stop wasting your time! And obtain Arabic definitions without having to look it up.

arabic data definitions scraper sentences wordsearch

Last synced: 12 Mar 2025

https://github.com/wfamous/fiv_update-data

This project automates the retrieval, processing, and publishing of digital product data for our Shopify store. It integrates Google Cloud Platform (GCP), Amazon Web Service (AWS), Terraform (Tofu), Python, Bash, Ansible and GitHub Actions to manage data pipelines efficiently.

ansible aws bash data data-analysis data-science devops gcp python pythonpackage shopify terraform tofu

Last synced: 17 Feb 2026

https://github.com/miguelgargallo/next13-fetch-data-turbo

Next13 Fetch Data Request

ai data fetch pylar request

Last synced: 27 Jul 2025

https://github.com/reppon97/cryptosnake

Simple, unofficial python wrapper for Binance API. You'll find this easy-to-use package helpful if you're interested in general market data and cryptocurrency values. You don't need to have a Binance account or API Key since you can't purchase/trade cryptocurrencies using this package.

api binance bitcoin crypto cryptocurrencies cryptocurrency cryptocurrency-exchanges cryptography data ethereum json litecoin python python3 statistics

Last synced: 05 Mar 2025

https://github.com/ugurcanerdogan/cross-validation-with-imbalanced-dataset

BBM467*SDSP - Small Data Science Project - Things to consider in cross validation and resampling when dealing with Imbalanced Data : What is the right way?

bbm467 cross-validation data data-science kfold-cross-validation logistic-regression machine-learning oversampling sdsp smote

Last synced: 21 Jun 2025

https://github.com/petermeissner/statsgrokse

R-API-binding to stats.grok.se server providing Wikipedia page view statistics for 2008 up to 2015

api binding data pageviews r wikipedia

Last synced: 17 May 2026

https://github.com/eosdis-nasa/earthdata-pub-dashboard

Front-end Dashboard for Earthdata Pub

data earthdata edpub publication

Last synced: 15 Jan 2026

https://github.com/nikoshet/exploratory-data-analysis-using-r

Exploratory Data Analysis using R Course Project for M.Sc. 'Data Science and Machine Learning' in NTUA

data data-analysis data-science eda exploratory-data-analysis ggplot2 r

Last synced: 14 May 2026

https://github.com/nazar-pc/fixed-size-multiplexer

A tiny library for multiplexing data chunks into blocks of fixed size and vice versa

chunk data demultiplex demux fixed multiplex mux size

Last synced: 31 Oct 2025

https://github.com/kom-senapati/ghw-data-hacks

🌍 Global Hack Week data projects, πŸ“Š focused on exploration, manipulation, and analysis...

data ghw

Last synced: 12 Mar 2025

https://github.com/marcosvidolin/firestore-bulk-loader

A simple tool to load data to Cloud Firestore.πŸ”₯

bulk-loader cloud data database firebase firestore import load loader tools

Last synced: 23 Jun 2025

https://github.com/dmnsgn/airports-data

Airports data: static, dynamic and custom dump.

airport airports cli data database json

Last synced: 30 Apr 2025

https://github.com/longzheng/southeastwater-usage-scraper

Extract hourly water usage data from South East Water portal website for digital water meters

australia data iot playwright southeastwater victoria water

Last synced: 06 Feb 2026

https://github.com/alexandregazagnes/scikit-res

Very Basic package to store results of ML models Grid search results are hard to exploit. This package aims to store them in a more convenient way.

data machine-learning mlops mlops-workflow results scikit-learn

Last synced: 20 Jan 2026

https://github.com/pirhoo/spytula

A Python library that provides a simple and convenient way to build JSON and YAML data structures using a builder pattern.

data dsl json python yaml

Last synced: 13 Apr 2026

https://github.com/tuananh/opentravel

✈ A collection of travel related data

data travel

Last synced: 09 Oct 2025

https://github.com/unaygney/js-challenges-data-structures-and-algorithms

Repo of the challenges I'm trying to solve to understand data structures and algorithms..

algorithms-and-data-structures data javascript structure

Last synced: 29 Oct 2025

https://github.com/imadsaddik/bodmaghdataset

BoDmagh dataset is a Supervised Fine-Tuning (SFT) dataset for the Darija language

arabic-llm arabic-nlp darija-llm darija-nlp data dataset fine-tuning llm nlp sft

Last synced: 03 Apr 2025

https://github.com/mzazakeith/puppetmaster

Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues

agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation

Last synced: 13 May 2025

https://github.com/kodie/migrate-acf-field-data-to-repeater

A WordPress plugin that migrates field metadata for ACF fields that have been moved inside of a repeater

acf acf-field acf-fields advance-custom-field data data-migration data-migration-tool wordpress wordpress-plugin

Last synced: 19 May 2026

https://github.com/joisino/twinpaper

Code for "Twin Papers: A Simple Framework of Causal Inference for Citations via Coupling" (CIKM 2022)

causal-inference data research science-of-science

Last synced: 21 Mar 2025

https://github.com/stdlib-js/datasets-suthaharan-single-hop-sensor-network

Labeled wireless sensor network data set collected from a simple single-hop wireless sensor network deployment using TelosB motes.

data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib

Last synced: 03 Mar 2025

https://github.com/rawsashimi1604/jobextract

Scrapes LinkedIn data. Conducts sentiment analysis on what traits and qualifications employers are looking for.

data data-analysis data-analytics data-cleaning linkedin mvc python webscraper

Last synced: 06 Nov 2025

https://github.com/philhawksworth/netlify-plugin-trello-lists

A plugin to fetch the JSON data of a public Trello board, and stash the data for each list in a JSON file before your build runs making the data available to your static site generator at build time.

api data eleventy netlify plugin trello

Last synced: 20 Jan 2026

https://github.com/junkwaxhero/cardlists

Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!

baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck

Last synced: 24 Apr 2025

https://github.com/acaciaman/db-autotest

DB Database test automation. This python package allows to create database object structure and load data from database.

data database test-automation

Last synced: 05 May 2026

https://github.com/y0hnn/slack-file-downloader

Download files from Slack servers with an export dataset. Useful when wanting to quit Slack but keep your files with you.

channels data export gdpr privacy slack

Last synced: 27 Apr 2026

https://github.com/aleklukanen/chapterhousedb-v1

Allows you to create simple data streaming warehouses written in Golang using Apache Parquet and Arrow.

arrow data database event golang ingestion parquet pipeline processing stream

Last synced: 27 Feb 2026

https://github.com/datafold/vhol-demo

Get hands-on examples of dbt + Datafold CI/CD workflows

data data-engineering datafold dbt diff

Last synced: 28 Dec 2025

https://github.com/thyringer/cast

CLI tool for reading strings or complex data sets from CSV files to output them in other text formats.

csv-converter data data-preprocessing python python3 sql-builder

Last synced: 02 Feb 2026

https://github.com/countervolts/apple-music-stats-calculator

how to get your most streamed songs/artists

apple apple-music applemusic calculator data

Last synced: 11 Feb 2026

https://github.com/xtrendence/comp2001-coursework

Grade: 98%. COMP2001 Coursework by Khodadad (Adrian) Nouchin. A RESTful authentication API, and a linked data application.

api asp-net csharp data dataset linked-data php restful restful-api

Last synced: 13 Apr 2026

https://github.com/nononoexe/setariaviridis

🌾 Field-collected data of green foxtail

data data-science dataset rpackage

Last synced: 27 Feb 2026

https://github.com/nixhantb/data-structures-and-algorithms-in-java-

Master Java Programming and Data Structures and Algorithms in Java in an efficient way. Clear concept on Recursion and Sorting

algorithms algorithms-and-data-structures competitive-programming data data-structures java java-8 programming

Last synced: 05 Jul 2025

https://github.com/amethyst-php/customer

A person or an organization that pays for goods or services

amethyst amethyst-package api customer data laravel

Last synced: 11 May 2026

https://github.com/cosmos-loops/cosmos-efcore

Cosmos.EntityFrameworkCore is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of Microsoft.EntityFrameworkCore to improve development efficiency.

cosmos-loops data efcore entityframeworkcore

Last synced: 14 Aug 2025

https://github.com/stdlib-js/ndarray-base-dtype-resolve-enum

Return the enumeration constant associated with a supported ndarray data type value.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs stdlib types util utilities utility utils

Last synced: 13 Apr 2025

https://github.com/vyahello/fake-employee-api

πŸ‘¨β€πŸ”§ Simple mock employees data parser (responder + heroku + pytest + github/travis CI)

data employee employer mock responder rest-api

Last synced: 09 Jun 2026

https://github.com/jaldekoa/nyfedapi

A Python wrapper to easily retrieve data from the Federal Reserve Bank of New York (FRBoNY) official API in pandas format.

api api-wrapper banking data finance pandas python united-states

Last synced: 08 Feb 2026

https://github.com/ctechhindi/auto-fill-form-data

AUTO FILL AND AUTOCOMPLETE USER DATA WITH KEY NAME

autocomplete chrome-extension data extension

Last synced: 17 Apr 2026

https://github.com/e-candeloro/data-analysis-code-snippets-for-pandas-and-sklearn

These notebooks are useful to learn how to load, understand, clean and classify data using Pandas and Sklearn with Python

analysis big-data classification data datascience datavisualization machine-learning notebook numpy pandas python sklearn

Last synced: 10 Apr 2026