An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/fforres/webpack-plugin-dx-metrics

Webpack plugin to track webpack behaviour in datadog

data datadog developer-experience typescript visualization webpack

Last synced: 13 Feb 2026

https://github.com/sungchun12/sqlmesh-demos

SQLMesh project for live demos - provides instructions so you can run this on your own!

data data-engineering sql sqlmesh

Last synced: 24 Oct 2025

https://github.com/anicolaspp/mapr-data-gen

Data generator for MapR Data Platform

data mapr mapr-db mapr-es mapr-streams maprdb parquet scala spark

Last synced: 29 Apr 2026

https://github.com/0xdir/htcds_dart

Human Trafficking Case Data Standard (HTCDS v0.2) objects, for easy creation, storage and transmission of case data related to human trafficking.

data humanitarian schema standards

Last synced: 24 Oct 2025

https://github.com/qeeqbox/data-compliance

Data compliance is the process of following various regulations and standards to ensure that sensitive digital assets (data) are guarded against loss, theft, and misuse

compliance data data-compliance infosecsimplified qeeqbox

Last synced: 19 Mar 2026

https://github.com/juliapsychometrics/psychometrictests.jl

Efficient data structures for psychometric modeling in Julia

data julia psychometrics

Last synced: 11 Jun 2025

https://github.com/codewell/data-kale

The Simple Data Lake - Data Kale

data data-lake python

Last synced: 25 May 2026

https://github.com/tommasoazz/collaborative-location-activity-recommendations

Project for the course Scalable and Cloud Programming

data map mapreduce scala spark

Last synced: 16 Apr 2026

https://github.com/lilingxi01/bloark

Blocks Architecture (BloArk) project package for building Blocks-0 dataset and way beyond.

architecture bloark data revision-based

Last synced: 05 Apr 2026

https://github.com/flexiodata/functions-covid-19-feed

Import Covid-19 data from Johns Hopkins University into Microsoft Excel and Google Sheets.

covid-19 data excel google-sheets import johns-hopkins-csse johns-hopkins-university spreadsheet

Last synced: 10 Mar 2025

https://github.com/onaio/gisida-react

React Dashboard library for Gisida.

dashboard data gisida map react visualization

Last synced: 28 Apr 2025

https://github.com/josephbarbierdarnal/data-matplotlib-journey

datasets for matplotlib-journey.com

data matplotlib

Last synced: 11 Mar 2026

https://github.com/ryanmorr/fastmap

Accelerated hash maps

data hashmap javascript map performance

Last synced: 10 Oct 2025

https://github.com/vaibhavpandeyvpz/cbse-scraper

This script scrapes information about schools affiliated with CBSE for a given state.

cbse crawler data schools scraper

Last synced: 12 Jul 2025

https://github.com/d8a-tech/d8a

A data collection service fully compatible with GA4 tracking protocols. Ingest into ClickHouse or BigQuery database while maintaining complete control over your data.

bigquery clickhouse data ga4 tracker

Last synced: 10 Apr 2026

https://github.com/ggreen/data-orchestration-with-scdf-showcase

data-orchestration-with-scdf-showcase

data orchestration scdf spring

Last synced: 14 Jan 2026

https://github.com/smolsoftboi/php-faker-providers

Faker providers that generate fake data for you.

data faker faker-generator faker-provider generator php

Last synced: 22 Apr 2025

https://github.com/geopython/pygeoapi-examples

Example pygeoapi deployment patterns and configurations

api data geospatial ogc ogc-api osgeo pygeoapi

Last synced: 11 Oct 2025

https://github.com/gadenbuie/crantrack

Hourly snapshots of CRAN's incoming packages folder

cran data r-packages

Last synced: 12 Mar 2026

https://github.com/wireservice/lookupr

Fetch common lookup tables and join them to your data. (A port of agate-lookup to R.)

data dplyr lookup r tables tidyverse

Last synced: 04 Oct 2025

https://github.com/build-on-aws/the-grad-project

This is the repository where you can download and run notebooks in SageMaker Studio Lab, in partnership with webinars taking place on YouTube.

data data-science python sql

Last synced: 16 May 2025

https://github.com/jesusgraterol/bitcoin-blockchain-dataset-builder

The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.

bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation machine-learning

Last synced: 06 May 2026

https://github.com/wonderium/browser-feature-compatibility

This repository contains browser support details for HTML, CSS, JS and SVG features.

browsers compatability css data html js json releases support svg wonderium

Last synced: 27 Jan 2026

https://github.com/quin1sue/priceguidesph-bettergov

an economic and financial data platform project under bettergov.ph

bettergovph cloudflare data hacktoberfest nextjs priceguides

Last synced: 05 May 2026

https://github.com/stefen-taime/real-time-data-pipeline-snake-game

Dynamic Snake Game: Unleashing Real-Time Streaming Analytics with Redis, Kafka, Flink, ClickHouse & Chart.js in an Online Snake Game via Flask API

chartjs clickhouse confluent-cloud data flask kafka-streams pipeline redis

Last synced: 04 May 2026

https://github.com/mark-summerfield/uxf

Uniform eXchange Format (uxf) is a plain text human readable optionally typed storage format that supports custom types. It may serve as a convenient alternative to csv, ini, json, sqlite, toml, xml, or yaml.

data ini json parser pretty-printer sqlite storage-engine toml xml yaml

Last synced: 08 Oct 2025

https://github.com/financejs/discord-bot

A Discord Bot Used In Financejs Discord Server

data discord discord-bot discordjs-bot finance financejs financial

Last synced: 13 Apr 2026

https://github.com/ozanarkancan/sailx

This repo contains the code for generating artificial navigational instruction following data.

data grounded-language-learning

Last synced: 08 Jan 2026

https://github.com/luminati-io/Pinterest-dataset-samples

Two sample datasets of over 1000 Pinterest profiles and posts, extracted using the Bright Data API, ideal for market research, influencer marketing, and product development.

data data-extraction data-mining database datasets pinterest pinterest-api structured-data web-scraping

Last synced: 09 Apr 2025

https://github.com/assem-elqersh/creativa-data-science-bootcamp

Jupyter notebooks from the Creativa Data Science Bootcamp, covering key data science concepts and practices across multiple sessions, from data preprocessing to model building and time series analysis.

data data-science eda exploratory-data-analysis machine-learning pandas time-series-analysis xgboost xgboost-classifier

Last synced: 03 May 2026

https://github.com/natylaza89/covid19-il

Python package which brings a "Facade" interface for the client for using official covid 19 data of israeli data gov. ★19K+ Downloads★

api covid covid19 covid19-data data israel pandas python

Last synced: 13 Apr 2026

https://github.com/figuran04/big-data

📃 Praktikum Big Data

anaconda big data hadoop hive mongodb pig spark

Last synced: 21 Jan 2026

https://github.com/kshitij1235/boxdb

This a database managment lib made for python, which works like any Libraries and is very lite no aditional setup require but there is some procedure to create a project is very easy.

boxdb data database library python

Last synced: 14 Jan 2026

https://github.com/bdpedigo/neuropull

A (soon to be) lightweight Python package for accessing single-cell connectome networks with metadata.

connectome connectomes connectomics data dataset networks networks-biology

Last synced: 05 Oct 2025

https://github.com/cbartram/advancedai

AdvancedAI Selection Option for Command and Conquer Generals Zero Hour

data games java streams

Last synced: 30 May 2026

https://github.com/oliverhennhoefer/shiny-template-interactive-table

Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny

button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface

Last synced: 16 Apr 2026

https://github.com/ashwinpn/visualization

Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.

analysis data data-analysis data-science data-visualization graphs plots python python3 visualization

Last synced: 13 Apr 2026

https://github.com/mystpi/crossings

🌉 A tiny library focused on easily connecting JS to HTML.

connect data frontend html javascript reactive simple small tiny

Last synced: 10 Jun 2026

https://github.com/frnt-end/weather-app-react

:atom_symbol: React project - Fetch and Toggle display of current weather in Berlin, Paris, New York & London (tabs) - using axios for API fetch. Watch DEMO 🌞 https://Frnt-End.github.io/Weather-App-React 👈

api axios axios-react background card current-weather data fetch gh-pages react reactjs tabs toggle ui usestate usestate-hook weather weather-app weather-information weatherapp

Last synced: 18 Feb 2026

https://github.com/muhammadibrahim313/start-your-data-science-journey

In this Repo i will be Sharing all Resources that we will be Learning during December Data Science Workhops on iCode Guru

btajicrew data data-science eda icodeguru machine-learning matplotlib pandas python

Last synced: 03 Feb 2026

https://github.com/andrey-tech/data-storage-php

Простое хранилище данных в виде ключ-значение в JSON-файлах с разделяемой блокировкой на чтение и эксклюзивной блокировкой на запись.

data data-storage files json php php7 storage

Last synced: 29 Apr 2026

https://github.com/machu-gwu/constant2-project

provide extensive way of managing your constant variable.

configuration constants data developer-tools python

Last synced: 26 May 2026

https://github.com/codewithmide/solexplorer

Next-generation AI powered Solana data explorer

dashboard data explorer solana svm

Last synced: 14 Feb 2026

https://github.com/joaocarmo/react-very-simple-data-table

When all you want is a table

data react simple table

Last synced: 06 Mar 2025

https://github.com/rikvdh/zabuffer

Zero-Allocation buffer handling in C

buffer c clib data embedded memory string zero-allocation

Last synced: 03 Mar 2025

https://github.com/y0hnn/slack-file-downloader

Download files from Slack servers with an export dataset. Useful when wanting to quit Slack but keep your files with you.

channels data export gdpr privacy slack

Last synced: 27 Apr 2026

https://github.com/guslovesmath/top_tech_sp_500_forecasting

Forecasting the stock market is difficult. I sought to observe the relationship between Apple's stock price and others in the S&P500. In doing this, I was able to conclude that stocks in the tech industry can help predict a trend in Apple's Percent change.

arima-forecasting arima-model data data-science forecasting vector-autoregression

Last synced: 14 Mar 2025

https://github.com/lxcoding06/e-gereja

Website CRUD untuk Gereja, untuk mengatur data jemaat, data kematian, data pernikahan dan data baptis

data data-gereja e-gereja gereja gereja-online jemaat kematian pernikahan

Last synced: 15 May 2025

https://github.com/mollybeach/cherryether

CherryEther: Typescript Staking Deposits Ethereum Transactions

blockchain data data-science ethereum typescripts

Last synced: 21 May 2026

https://github.com/abuzar-alvi/employee-data-to-info-card-generator-with-python

This Python project is made by me, Python project for improving python skills.

card data data-generator employee python

Last synced: 03 Feb 2026

https://github.com/countervolts/apple-music-stats-calculator

how to get your most streamed songs/artists

apple apple-music applemusic calculator data

Last synced: 11 Feb 2026

https://github.com/bkamapantula/india-pc-nfhs4

Parliamentary constituency factsheet for indicators of nutrition, health, and development in India using NFHS4 data.

data government health india nfhs nfhs4

Last synced: 19 Mar 2026

https://github.com/mahmoud-saeed-mahmoud/loading_state_handler

The StateHandlerWidget manages different UI states—loading, error, empty, and normal—allowing you to customize the displayed widgets for each state.

dart data error flutter flutter-package flutter-widget loading state

Last synced: 10 Mar 2026

https://github.com/karashiiro/lodestone-id-time

Data scraper, formula and reference implementation for the estimated creation time of a FFXIV character given its Lodestone ID.

data ffxiv ffxiv-character lodestone

Last synced: 30 Jun 2025

https://github.com/rn0x/aliexpress_product_data

استخراج بيانات المنتج من موقع علي إكسبريس

aliexpress aliexpress-api aliexpress-bot aliexpress-data aliexpress-json api data dropshipping express json nodejs

Last synced: 03 Oct 2025

https://github.com/keosariel/nairagazer-clustered-news

Providing clustered News data specifically Nigeria news. In hindsight this repo contain nigeria news and it's coverage. Data is from Nairagazer

ai data data-science news nigeria nigerian-data python

Last synced: 30 Aug 2025

https://github.com/stdlib-js/array-ones-like

Create an array filled with ones and having the same length and data type as a provided array.

array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector

Last synced: 05 Jan 2026

https://github.com/erwan-simon/aws-data-platform-framework

A unified framework to industrialize data ingestion, transformation and pipeline execution on AWS using Terraform, from infrastructure provisioning to runtime execution, designed as a reusable and standalone data platform.

aws data data-framework datalake docker iceberg python spark step-functions terraform terraform-module

Last synced: 23 May 2026

https://github.com/peterdavehello/nrd-list-archive

🌐📂 A collection of past NRD lists to explore—perfect for fun, research, or just plain curiosity! 🎉🔍✨

archive data nrd

Last synced: 17 Mar 2026

https://github.com/asirihewage/simplest-xpath-web-scraper

Simplest web scraper created using Python3 and MongoDB

data data-mining python3 scraper web webscrping

Last synced: 29 Jan 2026

https://github.com/mohasarc/treeviz

The best tree data-structures visualization tool

data structures visualization visualization-tools

Last synced: 25 Apr 2026

https://github.com/d3oxy/country-state-data

A comprehensive JSON dataset containing countries, states, cities, regions, and languages with TypeScript support. Perfect for building location-based dropdowns, address forms, and geographical applications.

address cities countries currency data dropdown geographical iso json languages location regions states typescript

Last synced: 24 Jan 2026

https://github.com/pjt3591oo/exchange-crawler

업비트, 코인원 크롤러

crawler data exchange python

Last synced: 27 Oct 2025

https://github.com/sdhutchins/jxn-open-data-api

Access Jackson, MS open government data using a python API wrapper.

api data jackson jxn mississippi open-gov

Last synced: 08 Apr 2025

https://github.com/olegegoism/datagenerator

Django web application for managing database connections and generating test data.

app application big-data csv data database dataset db django fake generator schema teable work

Last synced: 26 Oct 2025

https://github.com/rdmpage/checklist-of-the-freshwater-snails-of-sabah

Data from A preliminary checklist of the freshwater snails of Sabah (Malaysian Borneo) deposited in the BORNEENSIS collection, Universiti Malaysia Sabah https://doi.org/10.3897/zookeys.673.12544

checklist data gbif google-earth kmz sabah

Last synced: 09 Mar 2026

https://github.com/bastgau/snow-revoke-privileges

Script designed to simplify the management of permissions in your Snowflake databases.

data database dba dev-container python snowflake

Last synced: 20 Apr 2025

https://github.com/blakedrumm/scvmm-scripts-and-sql

The Scripts provided here are compatible with System Center Virtual Machine Manager

collector data powershell scripts scvmm sql

Last synced: 11 May 2025

https://github.com/praveenpuglia/css-support

The source of truth for CSS browser support of info

api browser compatibility css data properties selectors support

Last synced: 31 Mar 2025

https://github.com/zoo-js/zoo-data

🍩 The data for zoo-js.

actions data js json nodejs workflow

Last synced: 22 Apr 2025

https://github.com/stdlib-js/ndarray-base-dtype-resolve-enum

Return the enumeration constant associated with a supported ndarray data type value.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs stdlib types util utilities utility utils

Last synced: 13 Apr 2025

https://github.com/a3r0id/lightshot-data-miner

A random idea I had a while back to make a data miner for lightshot. Never released this but after a friend sent me a post about lightshot's transparency I figured it'd be a good time to release this. I've included some output from a run before making the repo. I am not responsible for the imagery or it's contents.

brute-force bruteforce data dataset face-recognition image-processing lightshot mining scraper scraping text-recognition

Last synced: 19 Oct 2025

https://github.com/cptpiepmatz/tabledatamerge

🔀 Merge plain text tables together.

cli data format latex table tdm

Last synced: 24 Feb 2026

https://github.com/steelcake/cherry-pipelines

A collection of pipelines built with cherry

blockchain clickhouse data pipeline pyhton

Last synced: 09 Mar 2026

https://github.com/secret-guest/file_organizer

Files Organizer is a versatile tool for sorting and organizing files efficiently, ideal for managing recovered data.

c c-development data data-recovery file-management file-manager files sorting sorting-algorithms subdirectories subdirectory

Last synced: 10 Jun 2026

https://github.com/norton120/dfmock

Python Pandas DataFrame mock generator. You need mock'd data in a dataframe? this is what you need.

data mock pandas pandas-dataframe python python37

Last synced: 19 Jan 2026

https://github.com/stdlib-js/datasets-suthaharan-multi-hop-sensor-network

Labeled wireless sensor network data set collected from a multi-hop wireless sensor network deployment using TelosB motes.

data dataset datasets javascript labeled machine-learning ml mote motes network node node-js nodejs outlier outliers sample sensor statistics stats stdlib

Last synced: 10 Oct 2025

https://github.com/priyanka7411/customer-segmentation-churn-dashboard

📊 Streamlit + Plotly dashboard for customer segmentation, RFM analysis, and churn prediction using machine learning.

churn data machine-learning pandas prediction python rfm rfm-analysis streamlit visualization

Last synced: 14 Apr 2026

https://github.com/thyringer/cast

CLI tool for reading strings or complex data sets from CSV files to output them in other text formats.

csv-converter data data-preprocessing python python3 sql-builder

Last synced: 02 Feb 2026

https://github.com/zalweny26/tools

Just a bunch of tools made in TypeScript.

algorithms data dimensionality distances helpers reduction sortings structures tools utils

Last synced: 03 Feb 2026