An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/praveenpuglia/css-support

The source of truth for CSS browser support of info

api browser compatibility css data properties selectors support

Last synced: 31 Mar 2025

https://github.com/ingmarboeschen/jatsdecoderevaluation

Evaluation data and code

data evaluation jatsdecoder

Last synced: 04 Feb 2026

https://github.com/vikyw89/usesyncv

a simplistic react global store with pregenerated CRUD, and built in async fetch

data fetch mobx reactjs reactquery redux state state-management store swr zustand

Last synced: 06 Jan 2026

https://github.com/corle-hyz/ucas_ds_lab

中国科学院大学(UCAS)2020年春季学期数据结构课程作业

data ds oj sort structure traffic ucas

Last synced: 16 Mar 2025

https://github.com/vikashpr/18cse301j_ra2011003010737

This website tells the story of a nation's GDP through data visualization, providing insights on global GDP, state-wise GDP, sector-wise GDP, and the vision for India's economy. It includes data sets and sources for further reference.

css3 d3-visualization d3js data data-vizualisation gephi-visualizations html5 indian-economy indian-gdp information-visualization js python-word-cloud python3 storytelling tableau tableau-public threejs wordcloud-visualization

Last synced: 03 May 2026

https://github.com/marcuwynu23/phaddress

Data API of Regions,Provinces, CityMunicipalities, and Barangay of the Philippines

address address-data-api api barangay city data geolocation municipalities provinces

Last synced: 14 Feb 2026

https://github.com/yashika-malhotra/cardioflex-treadmill-analysis-using-descriptive-statistics-probability

Description Analysis and Visualization on CardioFlex Treadmill data to provide insights and recommendations to improve their userbase.

colab-notebook data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/junkwaxhero/cardlists

Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!

baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck

Last synced: 24 Apr 2025

https://github.com/stdlib-js/array-base-filled-by

Create a filled generic array according to a provided callback function.

alloc allocate array callback data fill filled foreach generic javascript map node node-js nodejs stdlib structure types

Last synced: 24 Apr 2025

https://github.com/lindsaygelle/emojipedia

Go application. Simple program that scrapes unicode.org for Emoji content. Parses out HTML into categorically ordered data subsets. Explored from the command line.

cli data data-mining emoji emojipedia encyclopedia go golang golang-application html-scraping unicode-characters

Last synced: 11 Mar 2026

https://github.com/georgetdn/syscppcplinux

Store Linux C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)

class data framework linux object persistence serialize sql

Last synced: 12 Feb 2026

https://github.com/yakupzengin/data-structures-and-algortihms

This repo contains implementation of data structures and algorithms using JAVA

algorithms algorithms-and-data-structures data structure

Last synced: 03 Dec 2025

https://github.com/divithraju/divith-raju-searchengine-wikipedia

search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.

algorithms data dataengineering inverted-index linux merge-sort nlp project project-repository python3 serchengine software-engineering ubuntu wikipedia

Last synced: 16 May 2026

https://github.com/financejs/discord-bot

A Discord Bot Used In Financejs Discord Server

data discord discord-bot discordjs-bot finance financejs financial

Last synced: 13 Apr 2026

https://github.com/natylaza89/covid19-il

Python package which brings a "Facade" interface for the client for using official covid 19 data of israeli data gov. ★19K+ Downloads★

api covid covid19 covid19-data data israel pandas python

Last synced: 13 Apr 2026

https://github.com/agnosticeng/agx

Query and explore local and remote data with Clickhouse

clickhouse d3 data rust svelte

Last synced: 26 Oct 2025

https://github.com/yaoguangduan/protosync

generate go code from protobuf ,sync proto dirty data

data golang protobuf sync

Last synced: 12 Mar 2026

https://github.com/caelean/twittermap

Map of twitter user's influence as defined on by influencetracker

data google-maps maps sparql twitter visualization

Last synced: 14 Jun 2025

https://github.com/arcticsnow/climatepy

Collection of tools to perform timeseries analysis on climate data (Observation and Downscaled)

climate data era5 meteorological-data noaa-data pandas timeseries weather wmo xarray

Last synced: 05 Feb 2026

https://github.com/ashwinpn/visualization

Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.

analysis data data-analysis data-science data-visualization graphs plots python python3 visualization

Last synced: 13 Apr 2026

https://github.com/mystpi/crossings

🌉 A tiny library focused on easily connecting JS to HTML.

connect data frontend html javascript reactive simple small tiny

Last synced: 10 Jun 2026

https://github.com/kaos599/apollo-synthetic-data-generator

Apollo is a Python GUI application designed to simplify the complex process of generating random data based on fixed values. It allows users to generate various types of binary datasets, such as Yes/No type questions, by specifying probabilities.

data data-engineering data-generation data-generator data-science faker-library machine-learning tkinter-gui

Last synced: 22 Jul 2025

https://github.com/bdpedigo/neuropull

A (soon to be) lightweight Python package for accessing single-cell connectome networks with metadata.

connectome connectomes connectomics data dataset networks networks-biology

Last synced: 05 Oct 2025

https://github.com/gauravkoradiya/tensorflow-data-and-deployement

This repository contains usage of data and deployment pipline in tensorflow.

data deployment machine-learning-algorithms pipline tensorflowjs

Last synced: 06 Oct 2025

https://github.com/henrylin03/video-games

Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.

analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games

Last synced: 14 Apr 2026

https://github.com/figuran04/big-data

📃 Praktikum Big Data

anaconda big data hadoop hive mongodb pig spark

Last synced: 21 Jan 2026

https://github.com/pommes-public/pommesdata

A full-featured transparent data preparation routine from raw data to POMMES model inputs

data opensource power raw-data transparent

Last synced: 07 Oct 2025

https://github.com/wonderium/browser-feature-compatibility

This repository contains browser support details for HTML, CSS, JS and SVG features.

browsers compatability css data html js json releases support svg wonderium

Last synced: 27 Jan 2026

https://github.com/StudyResearchProjects/arrbuffstr

Creates Strings from ArrayBuffers and viceversa in NodeJS and the Browser

arraybuffer browser data node string transform

Last synced: 09 Oct 2025

https://github.com/farovictor/mongodbextractor

This project is intended to be used as a data extractor to support ELT pipelines or any kind of process that requires a heavy data dump from MongoDb databases.

data go mongodb pipeline

Last synced: 14 Jan 2026

https://github.com/dantesc03/uberpool-case-study

This project was designed to understand the statistical effects of longer wait times on uber rides. Particularly on the user and driver experience with the Uber Pool System.

analysis data excel jupyter jupyternotebooks learn python seaborn statistics t-tests uber visualization

Last synced: 16 Apr 2026

https://github.com/antononcube/raku-data-importers

Various data importing routines with a unified interface (data-import, slurp).

data data-ingestion raku rakulang slurp

Last synced: 23 Feb 2026

https://github.com/stdlib-js/datasets-suthaharan-multi-hop-sensor-network

Labeled wireless sensor network data set collected from a multi-hop wireless sensor network deployment using TelosB motes.

data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib

Last synced: 10 Oct 2025

https://github.com/xxczaki/parsify-plugin-covid19

Parsify plugin, that adds COVID 19-related variables 🦠

confirmed coronavirus covid19 data deaths fun math parser parsify parsify-plugin plugin variable variables

Last synced: 13 Mar 2026

https://github.com/ndarville/ipa

HTML hex codes for the IPA table.

data ipa linguistics

Last synced: 24 Jan 2026

https://github.com/norton120/dfmock

Python Pandas DataFrame mock generator. You need mock'd data in a dataframe? this is what you need.

data mock pandas pandas-dataframe python python37

Last synced: 19 Jan 2026

https://github.com/stdlib-js/datasets-cdc-nchs-us-births-1994-2003

US birth data from 1994 to 2003, as provided by the Center for Disease Control and Prevention's National Center for Health Statistics.

america babies births data dataset datasets javascript node node-js nodejs stdlib time-series timeseries united-states us usa

Last synced: 12 Oct 2025

https://github.com/yanpitangui/iteminfoconverter

Application that converts ragnarok legacy data files to iteminfo.lua

data itemdbconf iteminfo luafiles ragnarok

Last synced: 12 Oct 2025

https://github.com/efark/data-receiver

Small web server to receive data through HTTP requests.

data gin-gonic go golang http

Last synced: 23 Feb 2026

https://github.com/datahub-local/datahub-local

DataHub.local is a powerful data platform designed for edge devices, enabling seamless analytics and insights at home

data data-engineering devops kubernetes raspberrypi

Last synced: 21 Jan 2026

https://github.com/tayeva/eia-client-python

EIA Open Data API Client - Python

data open-source python python-3 python3

Last synced: 14 Oct 2025

https://github.com/skywarth/fenrir-wolfpack-simulator

Simulating wolfpack behaviours and future of the pack in an environment using Javascript and data trees.

data data-structures javascript max-heap simulation simulations wolfpack

Last synced: 14 Oct 2025

https://github.com/yorkulibraries/vendorpol

URLs for vendor privacy policies and terms of use.

data libraries privacy-policy

Last synced: 15 Oct 2025

https://github.com/liamross/use-data

A React hook for async fetching of data, data manipulation, and take latest vs take every functionality.

async data hook hooks react

Last synced: 22 Jan 2026

https://github.com/cptpiepmatz/tabledatamerge

🔀 Merge plain text tables together.

cli data format latex table tdm

Last synced: 24 Feb 2026

https://github.com/a3r0id/lightshot-data-miner

A random idea I had a while back to make a data miner for lightshot. Never released this but after a friend sent me a post about lightshot's transparency I figured it'd be a good time to release this. I've included some output from a run before making the repo. I am not responsible for the imagery or it's contents.

brute-force bruteforce data dataset face-recognition image-processing lightshot mining scraper scraping text-recognition

Last synced: 19 Oct 2025

https://github.com/plabayo/datapoints.earth

Earth data liberation for and by its citizens.

data foss free scrape

Last synced: 15 Mar 2026

https://github.com/p32929/use-megamind

A simple react hook for managing asynchronous function calls with ease on the client side

async asynchronous-tasks axios client-side-javascript data data-fetching easy fetch generics hooks javascript npm painless promise query react rest simple small typescript

Last synced: 23 Jan 2026

https://github.com/olegegoism/datagenerator

Django web application for managing database connections and generating test data.

app application big-data csv data database dataset db django fake generator schema teable work

Last synced: 26 Oct 2025

https://github.com/squareslab/probabilisticmodel_saner2018

Paper and supporting materials of the Probabilistic Model paper Accepted to SANER 2018

code data mausotog published replication

Last synced: 26 Oct 2025

https://github.com/mrnazu/eth-data-library

eth-data-library is a Nodejs library that provides tools for accessing and processing data on the Ethereum blockchain.

blockchain data ethereum nodejs smart-contracts web3

Last synced: 28 Jan 2026

https://github.com/d3oxy/country-state-data

A comprehensive JSON dataset containing countries, states, cities, regions, and languages with TypeScript support. Perfect for building location-based dropdowns, address forms, and geographical applications.

address cities countries currency data dropdown geographical iso json languages location regions states typescript

Last synced: 24 Jan 2026

https://github.com/banbord/data-vis-tornados

This repository includes data files, processing scripts, visualization code, and documentation for our tornado data visualization project. It aims to provide insights into tornado patterns across the United States using interactive and informative visual representations.

d3-visualization d3js data javascript json visualization

Last synced: 24 Feb 2026

https://github.com/peterdavehello/nrd-list-archive

🌐📂 A collection of past NRD lists to explore—perfect for fun, research, or just plain curiosity! 🎉🔍✨

archive data nrd

Last synced: 17 Mar 2026

https://github.com/dhimmel/het.io-rep-data

Data from Project Rephetio for the het.io website

browser data datatables drug-repurposing rephetio

Last synced: 07 Feb 2026

https://github.com/countervolts/apple-music-stats-calculator

how to get your most streamed songs/artists

apple apple-music applemusic calculator data

Last synced: 11 Feb 2026

https://github.com/aleklukanen/chapterhousedb-v1

Allows you to create simple data streaming warehouses written in Golang using Apache Parquet and Arrow.

arrow data database event golang ingestion parquet pipeline processing stream

Last synced: 27 Feb 2026

https://github.com/dewasry/browser-base

A tool to help developer store data ofline on browser

angular borwser data indexeddb nextjs orm query react typescript vite vue

Last synced: 13 Feb 2026

https://github.com/axa-ch/health-insurance-data

Swiss health insurance data

axa data health insurance swiss

Last synced: 19 Mar 2026

https://github.com/mihasm/arso-scraper

Unofficial Python CLI tool for downloading automated sensor weather data from the Slovenian Environment Agency.

api arso cli data historical-data meteorological python slovenia weather

Last synced: 14 Feb 2026

https://github.com/pawelzny/vo

DDD Value Object implementation

data ddd-patterns object python3 value

Last synced: 15 Feb 2026

https://github.com/oliverhennhoefer/shiny-template-interactive-table

Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny

button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface

Last synced: 16 Apr 2026

https://github.com/uk-ipop/open-data-pipeline

A pipeline for processing, enhancing, and sharing open datasets.

actions automation data python

Last synced: 25 May 2026

https://github.com/csadorf/pydata-ann-arbor-2018

Slides and notebooks demonstrating signac for PyData Ann Arbor Meetup 2018

data data-management jupyter signac workflow

Last synced: 04 Jun 2026

https://github.com/0xdir/relief_web_dart

A Future-based wrapper around the Relief Web API, to retrieve information on humanitarian news, reports, training, jobs, and disasters

api dart data humanitarian jobs

Last synced: 11 Jun 2026

https://github.com/nafisalawalidris/advanced-fraud-detection-with-anomaly-detection

This repository demonstrates how to build a robust fraud detection system that combines supervised learning techniques with anomaly detection models. It provides end-to-end implementation, from data preprocessing and model training to deploying a real-time fraud detection API using FastAPI.

anomaly-detection creditcardfrauddetection data dataanalytics fastapi fraud-detection machinelearning modeldeployment python supervised-machine-learning unsupervised-machine-learning

Last synced: 20 Apr 2026

https://github.com/tomwhite/chernoff

A visual mood indicator. One of the first Java programs I ever wrote.

chernoff-faces data visualization

Last synced: 20 Apr 2026

https://github.com/zgbjgg/quetzal-examples

Examples using Quetzal :rocket: :bird:

analytics dashboard data data-visualization elixir erlang plotly web-app

Last synced: 24 Apr 2026

https://github.com/andrewrporter/my-analytics

Analyzes FireFox browsing history with modern python3 features and libraries

analytics data firefox matplotlib python python3 sqlite3

Last synced: 28 Apr 2026

https://github.com/alexandregazagnes/unilasalle-public-resources

UniLaSalle-Public-Ressources : This public repository contains the notebooks and the data used for both : 2nd Year - Practical Statistical Tests 4th Year - Data Analysis with Python

data data-analysis data-analytics data-cleaning data-storytelling education educational exploratory-data-analysis python python3 r r-programming rstudio statistics visualization

Last synced: 28 Apr 2026

https://github.com/andrey-tech/data-storage-php

Простое хранилище данных в виде ключ-значение в JSON-файлах с разделяемой блокировкой на чтение и эксклюзивной блокировкой на запись.

data data-storage files json php php7 storage

Last synced: 29 Apr 2026

https://github.com/nikolaydubina/aws-s3-reader

Efficient Go Reader for large AWS S3 Objects

aws data golang reader s3 streaming

Last synced: 30 Apr 2026

https://github.com/ondata/opensdmx

Python CLI and library for any SDMX 2.1 REST API — Eurostat, ISTAT, OECD, ECB, World Bank and more. AI-ready.

cli data eurostat istat oecd open-data python rest-api sdmx statistics

Last synced: 01 May 2026

https://github.com/banyan-team/banyan-julia-examples

Adventures in massively parallel cloud computing with Banyan Julia!

banyan data data-analytics data-processing data-science julia

Last synced: 02 May 2026

https://github.com/vutran/yahoo-stocks-cli

Fetch stock data from the CLI

cli data finance stocks yahoo

Last synced: 08 Jun 2026

https://github.com/rastmob/wordpress-llms-output-plugin

A WordPress plugin to export posts, pages, and custom post types as JSON for training Language Models (LLMs).

ai data llm llms training training-data wordpress wordpress-development wordpress-plugin

Last synced: 03 May 2026

https://github.com/vyahello/fake-employee-api

👨‍🔧 Simple mock employees data parser (responder + heroku + pytest + github/travis CI)

data employee employer mock responder rest-api

Last synced: 09 Jun 2026

https://github.com/socketsupply/dynavolt

A highly opinionated DynamoDB client for aws-sdk v3 using esm.

aws data database dynamo dynamodb key-value kvstore

Last synced: 05 May 2026

https://github.com/quin1sue/priceguidesph-bettergov

an economic and financial data platform project under bettergov.ph

bettergovph cloudflare data hacktoberfest nextjs priceguides

Last synced: 05 May 2026