An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/nop-dev/learning-js

Esse repositório contem todas as anotações que fiz enquanto estudava um módulo da trilha Explorer da Rocketseat sobre JavaScript. 🔰

data data-structures functions javascript js

Last synced: 17 Apr 2026

https://github.com/jesusgraterol/bitcoin-blockchain-dataset-builder

The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.

bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation machine-learning

Last synced: 06 May 2026

https://github.com/bradlindblad/quotableoffice

Repo for the quotable office R Shiny app

data datascience golem-apps r shiny shiny-apps text text-mining

Last synced: 26 May 2026

https://github.com/figuran04/big-data

📃 Praktikum Big Data

anaconda big data hadoop hive mongodb pig spark

Last synced: 21 Jan 2026

https://github.com/csadorf/pydata-ann-arbor-2018

Slides and notebooks demonstrating signac for PyData Ann Arbor Meetup 2018

data data-management jupyter signac workflow

Last synced: 04 Jun 2026

https://github.com/norton120/dfmock

Python Pandas DataFrame mock generator. You need mock'd data in a dataframe? this is what you need.

data mock pandas pandas-dataframe python python37

Last synced: 19 Jan 2026

https://github.com/audeering/emodb

Publishes Berlin Database of Emotional Speech with audb

audb data emotion

Last synced: 19 Oct 2025

https://github.com/aleklukanen/chapterhousedb-v1

Allows you to create simple data streaming warehouses written in Golang using Apache Parquet and Arrow.

arrow data database event golang ingestion parquet pipeline processing stream

Last synced: 27 Feb 2026

https://github.com/y0hnn/slack-file-downloader

Download files from Slack servers with an export dataset. Useful when wanting to quit Slack but keep your files with you.

channels data export gdpr privacy slack

Last synced: 27 Apr 2026

https://github.com/machu-gwu/constant2-project

provide extensive way of managing your constant variable.

configuration constants data developer-tools python

Last synced: 26 May 2026

https://github.com/johnmackintosh/simd2016_tmap

Mapping SIMD with tmap - static & interactive

data data-science data-visualization mapping r visualisation

Last synced: 20 Mar 2025

https://github.com/dhruvldrp9/simpledht

A Python-based Distributed Hash Table (DHT) implementation enabling cross-network key-value storage, automatic node discovery, and data replication with a simple CLI and library interface.

cross-network-node-communation data data-replication data-synchronization dht dht-python distributed-hash-table key-value-storage nat netowork node-discovery peer-to-peer peer-to-peer-network python sha-256 simple udp udp-socket-communication

Last synced: 28 Feb 2026

https://github.com/Ekey/ER.DATA.Tool

Tool for extract data archives from mobile game Earth Revival (Project Arrival)

data earth-revival idx project-arrival

Last synced: 19 May 2026

https://github.com/ange007/jquery.mydata

jQuery.myData - Small jQuery&Zepto plugin for two-ways data binding.

data data-binding jquery jquery-plugin zepto zepto-plugin zeptojs

Last synced: 19 May 2026

https://github.com/exaluc/webhookcatcher

Catch your webhooks like a dream

api catcher data webhook webhook-callbacks webhooks-catcher

Last synced: 14 Apr 2025

https://github.com/johntocci/nullaxe

Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.

data data-analysis data-science datacleaning pandas polars python

Last synced: 06 Apr 2026

https://github.com/joisino/twinpaper

Code for "Twin Papers: A Simple Framework of Causal Inference for Citations via Coupling" (CIKM 2022)

causal-inference data research science-of-science

Last synced: 21 Mar 2025

https://github.com/uttamo/messenger-data-analysis

Analyse Messenger data exports from Facebook

analysis data facebook json messenger pandas parse python

Last synced: 18 Jul 2025

https://github.com/ciscorn/tinybufr

A Rust library for decoding BUFR meteorological observation data format

bufr data meteorology rust weather wmo

Last synced: 11 Jan 2026

https://github.com/utrechtuniversity/dataprivacysurvey

Code for analysing data from the Data Privacy Survey (2022)

data gdpr open-science privacy rdm research research-data-management survey utrecht-university

Last synced: 16 Jun 2025

https://github.com/eliot-akira/png-compressor

Compress and encode data as PNG image

cartridge data gzip png

Last synced: 17 Mar 2025

https://github.com/memair/apps

App Store for Memair

apps appstore data data-science quantified-self

Last synced: 06 Apr 2026

https://github.com/andrei-vataselu/data-science-snippets

🧰 Essential EDA and Data Cleaning Helpers for Any DataFrame This collection of functions is designed to accelerate exploratory data analysis (EDA), quickly surface data quality issues, and offer high-level insights into the structure and content of your dataset.

artificial-intelligence data data-science eda feature-engineering hyperparamater-tunning library loading model-evaluation modeling preprocessing python snippets text-processing time-series visualization

Last synced: 10 Mar 2026

https://github.com/jasondrawdy/compendio

Collection of common and noteworthy extension methods, security tools, and filesystem functions generally found in most applications; focusing on extensibility and portability.

compendium converters cryptography data extensions generators hashing library security utilities validation windows

Last synced: 18 May 2026

https://github.com/radekbednarik/data_generator

Random data generator using Python. Generate data files with random string, floats, ints, dates via console or TOML files..

csv data generator python python3 random test-data-generator

Last synced: 13 Dec 2025

https://github.com/kodie/migrate-acf-field-data-to-repeater

A WordPress plugin that migrates field metadata for ACF fields that have been moved inside of a repeater

acf acf-field acf-fields advance-custom-field data data-migration data-migration-tool wordpress wordpress-plugin

Last synced: 19 May 2026

https://github.com/stdlib-js/ndarray-base-dtype2c

Return the C data type associated with a provided data type string.

array base c data dtype javascript multidimensional ndarray node node-js nodejs stdlib type types util utilities utility utils

Last synced: 18 Jun 2025

https://github.com/mzazakeith/puppetmaster

Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues

agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation

Last synced: 13 May 2025

https://github.com/antvis/create-antv-demo

A simple CV-dashboard framework for practicing how to use AntV.

antv cv dashboard data resume resume-template resume-website visualization

Last synced: 09 Apr 2025

https://github.com/longzheng/southeastwater-usage-scraper

Extract hourly water usage data from South East Water portal website for digital water meters

australia data iot playwright southeastwater victoria water

Last synced: 06 Feb 2026

https://github.com/cherylisabella/statistics--caret

Training Regression and Classification Models using caret

data data-analysis data-mining data-science datascience dataset r statistics

Last synced: 24 Jun 2025

https://github.com/nowosad/cllc

Country-level Land Cover - categories and transitions

data dataset land-cover land-cover-transitions r

Last synced: 04 Apr 2025

https://github.com/marcosvidolin/firestore-bulk-loader

A simple tool to load data to Cloud Firestore.🔥

bulk-loader cloud data database firebase firestore import load loader tools

Last synced: 23 Jun 2025

https://github.com/nazar-pc/fixed-size-multiplexer

A tiny library for multiplexing data chunks into blocks of fixed size and vice versa

chunk data demultiplex demux fixed multiplex mux size

Last synced: 31 Oct 2025

https://github.com/mongoexpuser/synthetic-drilling-data-app-for-sqlite-ml

Generate synthetic drilling data that can be used for testing machine learning (ML) models.

classification data drilling events inference machine-learning ml-models node-sqlite3 nodejs prediction python sqlite3 training

Last synced: 08 Apr 2026

https://github.com/mohammadkarbalaee/introduction-to-data-science-sbu

Reports and full documentation of the introduction to data science course held at SBU

data data-science python shahid-beheshti-university

Last synced: 27 Mar 2025

https://github.com/weisscharlesj/data_scicompforchem

Zipped data for SciCompforChem book for easy download

chemistry chemistry-education data data-visualization python

Last synced: 07 Nov 2025

https://github.com/ugurcanerdogan/cross-validation-with-imbalanced-dataset

BBM467*SDSP - Small Data Science Project - Things to consider in cross validation and resampling when dealing with Imbalanced Data : What is the right way?

bbm467 cross-validation data data-science kfold-cross-validation logistic-regression machine-learning oversampling sdsp smote

Last synced: 21 Jun 2025

https://github.com/shysolocup/aepl

A Node.JS multi-layered class creation package with built-in parenting systems that let you get info from classes above as well as better function and property makers for easier to read and understand development and modding support inspired by Roblox's Studio API.

aepl backend classes data framework game-development game-framework javascript js js-class js-framework lightweight nodejs package

Last synced: 28 Oct 2025

https://github.com/stdlib-js/array-base-copy

Copy the elements of an array-like object to a new generic array.

array copy data generic javascript node node-js nodejs stdlib structure types

Last synced: 30 Oct 2025

https://github.com/satyam4229/college-predictor-system

The college predictor system is a Python-based application that utilizes a machine learning model to predict colleges and their corresponding degree programs and branches based on a student's JEE (Joint Entrance Examination) score.

data data-science jupyter-notebook kaggle prediction python

Last synced: 06 Apr 2026

https://github.com/danlsn/causality

A Personal Data Platform and the culmination of years of curiosity and learning in the Data Engineering space.

data data-engineering datawarehousing personal-data quantified-self

Last synced: 06 Mar 2026

https://github.com/rrighart/rrighart.github.io

A webpage about data science, programming, statistics and related topics

analyses data data-mining programming statistics

Last synced: 20 Jan 2026

https://github.com/aaronmeder/social-history

A quick look into your history on social media. Drop in the archives you've downloaded from Facebook and Instagram and see some stats about your time on the networks.

archives data facebook instagram statistics stats

Last synced: 27 Mar 2025

https://github.com/lukanedimovic/table_editor

A simple table data editor, with easily scalable functions and operations & a nice GUI

data data-science formula java parser parsing preprocessing swing tokenizer

Last synced: 04 Apr 2025

https://github.com/felixklauke/atomizer

Playing around with butter knife, android bindings and rx java.

binding butterknife data java react rx rxjava

Last synced: 15 May 2026

https://github.com/minightdev/paperclip

Paperclip is a powerful privacy-focused data breach search engine that empowers users to swiftly and securely investigate breaches using email addresses and phone numbers. Our robust search engine delivers real-time results while prioritizing the privacy and security of user queries.

beaches data database pwn pwned search-engine

Last synced: 22 Mar 2025

https://github.com/fiedsch/datamanagement

Data management helpers (PHP-CLI)

csv-data data datamanagement helper php

Last synced: 05 Apr 2025

https://github.com/sermetpekin/evdscpp

evdscpp is a C++ library for fast, efficient, and user-friendly interaction with the EVDS API Server. Designed with performance in mind, it provides built-in caching, an Excel export option, and an intuitive user interface for configuring and retrieving data. evdscpp can be extended for integration with other C++ projects and offers options for use

cbrt central-bank cpp data edds evds evds-api evdscpp tcmb tcmb-api

Last synced: 07 Sep 2025

https://github.com/heikomuller/histore

Library for maintaining snapshots of evolving tabular data sets

data version-control

Last synced: 10 Apr 2025

https://github.com/miguelgargallo/next13-fetch-data-turbo

Next13 Fetch Data Request

ai data fetch pylar request

Last synced: 27 Jul 2025

https://github.com/kale-ko/ejcl

An advanced configuration library for Java with support for local files, mysql databases, and more.

config configs configuration configuration-files data java java-serialization json mariadb mysql parser serialization serializer smile yaml

Last synced: 02 Feb 2026

https://github.com/sun-lab-nbb/sl-experiment

A Python library that provides tools to acquire, manage, and preprocess scientific data in the Sun (NeuroAI) lab.

ataraxis data experiment methods neuroscience

Last synced: 07 Mar 2026

https://github.com/woctezuma/download-steam-banners-data

Data consisting of Steam banners.

data steam steam-api

Last synced: 06 Jan 2026

https://github.com/hariprashad-ravikumar/ai-datascience-lab

AI‑DataScience‑Lab is a web app for uploading CSV datasets, cleaning with Pandas, and running quick exploratory analyses and regression models using scikit‑learn. Its modular design supports future AI extensions, like deep learning with TensorFlow or insight generation via the OpenAI API.

ai api azure cloudcomputing data data-analysis data-science data-visualization mathplotlib numpy openai pandas python scikit-learn

Last synced: 02 Aug 2025

https://github.com/serhatkacmaz/cpp-datastructuresandalgortihms

Contains codes related to data structures

algorithms cplusplus data data-structures

Last synced: 10 Jul 2025

https://github.com/randomfractals/unfolded-map-renderer

Unfolded Map 🗺️ Notebook 📓 Renderer for VSCode.

cell data flat-data geo geo-location geo-spatial map notebook output renderer unfolded view vscode

Last synced: 21 Mar 2025

https://github.com/polina-prokofieva/viewjson

The class for convenient visualization of json with some settings.

data data-visualization es5 es6 javascript json

Last synced: 15 May 2026

https://github.com/maskedsyntax/covid-tracker

Qt app to keep a track of Covid-19 records of different countries.

coronavirus coronavirus-tracking covid-19 data parsing scraping scraping-websites tracker web-scraping

Last synced: 29 Mar 2025

https://github.com/pjmagee/starwars-data

A Star Wars Web app with Charts and entire Timeline events!

aspire blazor blazor-webassembly data database dataset docker dotnet json starwars starwars-data starwars-fandom

Last synced: 07 Mar 2026

https://github.com/moumouls/data-toulouse-grapql

A (partial) GraphQL Support for the Toulouse Métropole Open Data service

api data graphql open service toulouse

Last synced: 16 May 2026

https://github.com/horizom/dto

Data Transfer Objects for all PHP applications.

data dto object transfer

Last synced: 14 Sep 2025

https://github.com/dataopstix/modelt

Modelt(mow·delt) is a modern data integration solution that connects data to data for advanced analytics.

airbyte airflow airflow-docker data data-analysis data-visualization database dbt elt etl etl-automation metabase metadata modern modern-dev modernization

Last synced: 28 Mar 2025

https://github.com/ezra-obiwale/vue-doo

Utility library for VueJS

ajax axios data function http js method store utility vue vuejs

Last synced: 07 May 2025

https://github.com/emrecpp/datapacket-cpp

Send, recv, encrypt, decrypt, compress data as Packet and send it with socket for C++.

compress data deserialization deserialize deserializer encrypt packet recv send serialization serialize serializer socket storage

Last synced: 28 Mar 2025

https://github.com/meetyildiz/pandazip

Reduce RAM footprint of Pandas DataFrame without losing information.

compression data data-mining data-science machine-learning pandas utilities

Last synced: 20 Jan 2026

https://github.com/sharmadhiraj/free-json-datasets

Collection of free JSON data that are scraped and parsed from different websites.

collection crawler data data-scraping datasets json sports statistics web-scraping

Last synced: 28 Mar 2025

https://github.com/gabrielu3/ori-inverted-list

Inverted List made for a college discipline named Organization and Retrieval of Information

c data data-structures index inverted-index list

Last synced: 24 Feb 2026

https://github.com/godeltech/godeltech.data

.NET library to access data storage with Unit of Work, Repository and Entity classes

data entity repository unitofwork

Last synced: 30 Apr 2025

https://github.com/godeltech/godeltech.data.entityframeworkcore

Library to access database with Unit of Work, Repository and Entity classes for Entity Framework Core.

data entity entity-framework-core repository unitofwork

Last synced: 30 Apr 2025

https://github.com/thealphadollar/messiah

Messiah: The Mighty Son Of God Is Here To Help You Through Times Of Calamity

azure backend data data-analysis flask frontend materialize natural-disasters

Last synced: 19 Jan 2026

https://github.com/gavinr/minor-league-baseball

Data set of minor league baseball teams

baseball data hacktoberfest maps open-data open-datasets sports

Last synced: 06 Apr 2025

https://github.com/bgmp/tesis-german-deuster

Datos estadísticos para tercería de una tésis

charts data r

Last synced: 28 Mar 2025

https://github.com/d2hydro/hydrodashboards

Open Source Dashboards for hydro data

bokeh data geopandas hydrology hydrometrics pandas sheetjs

Last synced: 26 Jul 2025

https://github.com/seregpie/vuefort

The state management for Vue.

data model state store vue

Last synced: 13 Apr 2026

https://github.com/alexandregazagnes/scikit-res

Very Basic package to store results of ML models Grid search results are hard to exploit. This package aims to store them in a more convenient way.

data machine-learning mlops mlops-workflow results scikit-learn

Last synced: 20 Jan 2026

https://github.com/manuelblancovalentin/dynamictable

An easy and user-friendly way to display dynamic data in Python using command-line tables

data deep display dynamic friendly gui interface io keras learning loop machine python python3 pytorch table tensorflow user

Last synced: 20 Jan 2026

https://github.com/schbenedikt/datamining

Heise (https://heise.de) News Crawler

data data-science heise postgresql web-crawler

Last synced: 10 Apr 2025

https://github.com/sheweny/discord-resolve

This module groups together functions to retrieve data from different types of arguments.

data discord discord-js mentions resolver sheweny utility

Last synced: 29 Oct 2025

https://github.com/0xdir/satisfaction_dart

A collection of customer satisfaction scores and metrics used by businesses to measure and assess customer satisfaction.

csat customer data metrics nps

Last synced: 04 Mar 2025

https://github.com/rahulraikwar00/advault

Advault is a adhaar data vault generation tool

aadhaar data hacktoberfest uidai vault

Last synced: 05 Apr 2025

https://github.com/imadsaddik/bodmaghdataset

BoDmagh dataset is a Supervised Fine-Tuning (SFT) dataset for the Darija language

arabic-llm arabic-nlp darija-llm darija-nlp data dataset fine-tuning llm nlp sft

Last synced: 03 Apr 2025

https://github.com/philhawksworth/netlify-plugin-trello-lists

A plugin to fetch the JSON data of a public Trello board, and stash the data for each list in a JSON file before your build runs making the data available to your static site generator at build time.

api data eleventy netlify plugin trello

Last synced: 20 Jan 2026