An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/theseus-rs/rsql

Command line SQL interface for relational databases and common data file formats

cockroachdb command-line csv data database duckdb json mariadb mysql parquet postgres postgresql redshift snowflake sql sqlite sqlite3 sqlserver

Last synced: 16 May 2025

https://github.com/MLWhiz/data_science_blogs

A repository to keep track of all the code that I end up writing for my blog posts.

blogging chatbot data datascience gan graphs machine-learning mcmc python spark streamlit time-series xgboost

Last synced: 05 May 2025

https://github.com/mlwhiz/data_science_blogs

A repository to keep track of all the code that I end up writing for my blog posts.

blogging chatbot data datascience gan graphs machine-learning mcmc python spark streamlit time-series xgboost

Last synced: 06 Apr 2025

https://github.com/artus9033/chartjs-plugin-dragdata

Draggable data points plugin for Chart.js

chartjs data drag plugin

Last synced: 12 Apr 2025

https://github.com/dat-ecosystem-archive/datBase

Open data sharing powered by Dat [ DEPRECATED - More info on active projects and modules at https://dat-ecosystem.org/ ]

dat data datproject p2p registry search sharing

Last synced: 03 Apr 2025

https://github.com/jldbc/coffee-quality-database

Building the Coffee Quality Institute Database

agriculture coffee data data-science dataset

Last synced: 09 Apr 2025

https://github.com/thiagokimo/faker

Provides fake data to your Android apps :)

android data faker mock mocking

Last synced: 22 Aug 2025

https://github.com/awslabs/amazon-s3-find-and-forget

Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)

amazon-s3 aws big-data ccpa data data-erasure data-lake gdpr parquet privacy right-to-be-forgotten s3

Last synced: 04 Apr 2025

https://github.com/OpenIntroStat/openintro

📦 R package for data and supplemental functions for OpenIntro resources

data openintro rstats rstats-package

Last synced: 30 Jul 2025

https://github.com/www-zerocode-net-cn/ERD-Online

ERD Online is an online collaborative data warehouse design software. It does not need to install applications locally and operate databases online. It is an excellent alternative to desktop data modeling tools.

bigdata collaborative data database design erd java lowcode metadata nocode online sql

Last synced: 24 Mar 2025

https://github.com/Synthoid/ExportSheetData

Add-on for Google Sheets that allows sheets to be exported as JSON or XML.

data esd google-sheets json tools xml

Last synced: 14 Mar 2025

https://github.com/anaconda/anaconda-project

Tool for encapsulating, running, and reproducing data science projects

anaconda conda-environment data datascience encapsulation reproducibility running

Last synced: 11 Dec 2025

https://github.com/data-dot-all/dataall

A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.

aws aws-glue aws-lake-formation aws-s3 data data-science etl-framework lakeformation lakehouse redshift

Last synced: 29 Jul 2025

https://github.com/hugovk/top-pypi-packages

A regular dump of the most-downloaded packages from PyPI

data dump json pypi python

Last synced: 12 Apr 2025

https://github.com/koldlight/curso-python-analisis-datos

Curso de python básico orientado al análisis de datos, en español

course data data-analysis folium hacktoberfest numpy pandas python requests seaborn spanish

Last synced: 12 Apr 2025

https://github.com/RTradeLtd/Temporal

☄️ Temporal is an easy-to-use, enterprise-grade interface into distributed and decentralized storage

data ethereum ethereum-swarm golang i2p infrastructure ipfs ipfs-cluster ipns pinning storage swarm temporal

Last synced: 06 Apr 2025

https://github.com/rtradeltd/temporal

☄️ Temporal is an easy-to-use, enterprise-grade interface into distributed and decentralized storage

data ethereum ethereum-swarm golang i2p infrastructure ipfs ipfs-cluster ipns pinning storage swarm temporal

Last synced: 30 Sep 2025

https://github.com/harinij/100daysofcode

#100DaysOfCode - Learn by developing 100 unique apps to explore exciting tech stacks

100daysofcode ai appdev coding-challenge data developer-challenge learning-by-doing machine-learning opensource reactjs

Last synced: 17 Jun 2025

https://github.com/vinyzu/chrome-fingerprints

A Collection of 10.000 collected Windows Chrome Fingerprints. Usable with an easy-to-use API, available as a compressed (lzma) or full-size Json (view Releases). Its just 1.4mb in size in compressed form, and fast in read times.

automation botright browser data dataset fingerprinting fingerprints fingerprints-generator playwright

Last synced: 16 May 2025

https://github.com/owent/libatbus

用于搭建高性能、全异步、树形结构的BUS消息系统的跨平台框架库

bus channel cpp cxx data ip ipv4 ipv6 linux macos message osx performance queue shared-memory shm socket tcp transfer windows

Last synced: 12 Apr 2025

https://github.com/bcapathshala/dsa-supreme-2-0-notes

DATA STRUCTURE USING CPP NOTES

algorithms cpp data data-structures dsa

Last synced: 12 Apr 2025

https://github.com/datadesk/california-coronavirus-data

The Los Angeles Times' open-source archive of California coronavirus data

altair binder coronavirus covid csv data data-journalism journalism jupyter news pandas python

Last synced: 06 Apr 2025

https://github.com/borisflesch/vue-good-table-next

An easy to use powerful data table for Vue 3.x with advanced customizations including sorting, column filtering, pagination, grouping etc. Based on Vue-good-table (Vue 2.x).

data datatable table vue vue3 vuejs vuejs3

Last synced: 02 Aug 2025

https://github.com/nycdb/nycdb

Database of NYC Housing Data

civic-data data database housing nyc open-data psql python3

Last synced: 15 May 2025

https://github.com/ronellsicat/DxR

DXR is a Unity package for rapid prototyping of immersive data visualizations in augmented, mixed, and virtual reality (AR, MR, VR) or XR for short.

ar augmented charts data graphs hololens immersive mixed mixed-reality reality unity virtual visualization vr

Last synced: 01 Apr 2025

https://github.com/hiyali/vue-smooth-picker

🏄🏼 A SmoothPicker for Vue 2 (like native datetime picker of iOS)

awesome data datetime picker smooth vue vue-picker vue2

Last synced: 13 Apr 2025

https://github.com/mendableai/firecrawl-app-examples

🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.

ai ai-scraping data examples html-to-markdown llm markdown rag scrapers templates web-crawler

Last synced: 13 Apr 2025

https://github.com/umitkaanusta/reddit-detective

Play detective on Reddit: Discover political disinformation campaigns, secret influencers and more

analysis analytics api data database elt etl graph graph-database neo4j network politics reddit social social-media social-network

Last synced: 06 Apr 2025

https://github.com/telefonicaid/fiware-orion

Context Broker and CEF building block for context data management, providing NGSI interfaces.

context-information-management data fiware fiware-ngsi fiware-orion orion-context-broker

Last synced: 04 Apr 2025

https://github.com/dataplane-app/dataplane

Dataplane is an Airflow inspired unified data platform with additional data mesh and RPA capability to automate, schedule and design data pipelines and workflows. Dataplane is written in Golang with a React front end.

airflow data data-analysis data-engineering data-integration data-pipelines data-science dataplane datawarehouse etl finance golang kubernetes pipelines robotics-process-automation rpa scheduler workflow workflow-automation workflows

Last synced: 27 Dec 2025

https://github.com/iterative/vscode-dvc

Machine learning experiment tracking and data versioning with DVC extension for VS Code

data data-science dvc machine-learning python visual-studio-code vscode vscode-extension

Last synced: 18 Jun 2025

https://github.com/robmsmt/ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

asr audio-data data speech speech-activities speech-recognition speech-to-text

Last synced: 11 Mar 2025

https://github.com/felikcat/unlimited-hotspot

Remove speed restrictions on your hotspot internet (iOS, iPadOS, Android, Quectel), and allows hotspots on any plan (rooted Android & Quectel only).

android bypass-throttling cellphone data hotspot hotspot-wifi internet ios linux macos mobile phone qualcomm quectel tablet tether tethering unlimited unlimited-data windows

Last synced: 05 Apr 2025

https://github.com/reubano/csv2ofx

A Python library and command line tool for converting csv to ofx and qif files

cli csv data featured finance library ofx qif

Last synced: 12 Apr 2025

https://github.com/ropensci-archive/opendata

:no_entry: ARCHIVED :no_entry:

cran data opendata r task-view

Last synced: 30 Jul 2025

https://github.com/robmarkcole/hass-data-detective

Explore and analyse your Home Assistant data

data data-science home home-assistant home-automation

Last synced: 16 May 2025

https://github.com/Swirrl/grafter

Linked Data & RDF Manufacturing Tools in Clojure

clojure data etl grafter linked-data rdf semantic-web

Last synced: 02 Apr 2025

https://github.com/swirrl/grafter

Linked Data & RDF Manufacturing Tools in Clojure

clojure data etl grafter linked-data rdf semantic-web

Last synced: 04 Apr 2025

https://github.com/mirador/mirador

Tool for visual exploration of complex data.

data exploratory-data-analysis tabular-data visualization

Last synced: 17 Jan 2026

https://github.com/robmarkcole/HASS-data-detective

Explore and analyse your Home Assistant data

data data-science home home-assistant home-automation

Last synced: 06 Apr 2025

https://github.com/teaxyz/chai

tea’s package dataset

data packages

Last synced: 16 May 2025

https://github.com/igorkamyshev/farfetched

The advanced data fetching tool for web applications

async data data-fetching effector fetch

Last synced: 07 Apr 2025

https://github.com/apache/texera

Collaborative Machine-Learning-Centric Data Analytics Using Workflows

artificial-intelligence data data-analytics data-science machine-learning texera workflow

Last synced: 15 Dec 2025

https://github.com/easystats/datawizard

Magic potions to clean and transform your data 🧙

data dplyr hacktoberfest janitor manipulation r-package reshape rstats tidyr wrangling

Last synced: 04 Apr 2025

https://github.com/streamthoughts/azkarra-streams

🚀 Azkarra is a lightweight java framework to make it easy to develop, deploy and manage cloud-native streaming microservices based on Apache Kafka Streams.

apache-kafka azkarra-streams cloud-native data interactive-queries java kafka kafka-streams micro-framework microservices webui

Last synced: 19 Aug 2025

https://github.com/criccomini/awesome-infra

A curated list of infrastructure projects and companies.

ai awesome awesome-list data database infrastructure ml stream-processing streaming workflow

Last synced: 24 Jul 2025

https://github.com/taleshape-com/shaper

Build Data Dashboards all in SQL. Powered by DuckDB.

analytics dashboards data duckdb

Last synced: 12 Jan 2026

https://github.com/HubSpot/general-store

Simple, flexible store implementation for Flux. #hubspot-open-source

data dispatcher flux hubspot javascript react store

Last synced: 31 Mar 2025

https://github.com/hubspot/general-store

Simple, flexible store implementation for Flux. #hubspot-open-source

data dispatcher flux hubspot javascript react store

Last synced: 14 Oct 2025

https://github.com/stestagg/pytubes

A module for getting data into python from large data sources

cpp cpp11 cython data numpy python

Last synced: 06 Apr 2025

https://github.com/triggerdotdev/apihero

Make every API you use faster and more reliable with one line of code ⚡️

api data gateway http http-client observability proxy rest typescript

Last synced: 15 Apr 2025

https://github.com/unytics/airbyte_serverless

Airbyte made simple (no UI, no database, no cluster)

airbyte bigquery data data-analysis data-engineering data-warehouse elt etl pipeline

Last synced: 16 May 2025

https://github.com/looker/lookerbot

Lookerbot lets you access all your Looker data from Slack! Super fun!

chat chatbot data data-visualization looker slack slash-commands

Last synced: 01 Apr 2025

https://github.com/esri/geodev-hackerlabs

A place to learn how to build geo apps with the ArcGIS Platform.

arcgis-js-api arcgis-online arcgis-platform data design geodev javascript

Last synced: 18 Jul 2025

https://github.com/gambolputty/wikitable2csv

A web tool to convert Wiki tables to CSV 📈

converter csv data table wikipedia

Last synced: 16 Jan 2026

https://github.com/apis-is/apis

Making data readily available to anyone interested

api data iceland javascript node public-data

Last synced: 13 May 2025

https://github.com/ropensci/rgbif

Interface to the Global Biodiversity Information Facility API

api biodiversity data gbif lifewatch oscibio r r-package rstats species spocc

Last synced: 12 Apr 2025

https://github.com/jonschlinkert/data-store

Easily get, set and persist config data. Fast. Supports dot-notation in keys. No dependencies.

cache conf config configstore data javascript json nodejs persist store stort

Last synced: 04 Apr 2025

https://github.com/dagster-io/mdsfest-opensource-mds

Demo Project for Open Source MDS

dagster data duckdb mds modern stack

Last synced: 29 Dec 2025

https://github.com/luojilab/datatranshub

跨平台Android/iOS海量数据上报组件,基于Xlog完善,解决Xlog痛点问题。

android data data-report ios logger xlog

Last synced: 21 Aug 2025

https://github.com/censusreporter/census-api

The home for the API that powers the Census Reporter project.

census data

Last synced: 27 Jul 2025

https://github.com/oxinabox/DataDeps.jl

reproducible data setup for reproducible science

data data-science open-science

Last synced: 13 Nov 2025

https://github.com/ropensci/dataspice

:hot_pepper: Create lightweight schema.org descriptions of your datasets

data dataset metadata r r-package rstats schema-org unconf unconf18

Last synced: 13 Jul 2025

https://github.com/bacinger/f1-circuits

A repository of Formula 1™ circuits in GeoJSON format.

data data-repository f1-circuits formula-1 geojson geojson-data geojson-format

Last synced: 15 Apr 2025

https://github.com/edwindj/daff

Diff, patch and merge for data.frames, see http://paulfitz.github.io/daff/

daff data diff r

Last synced: 14 May 2025

https://github.com/oxinabox/datadeps.jl

reproducible data setup for reproducible science

data data-science open-science

Last synced: 14 Mar 2025

https://github.com/datafusion-contrib/datafusion-dft

Batteries included CLI, TUI, and server implementations for DataFusion.

arrow cli data database datafusion tui

Last synced: 10 May 2025

https://github.com/kedarvj/mysql-random-data-generator

This is the easiest MySQL random test data generator tool. Load the procedure and execute to auto detect column types and load data.

data dataset dummy-data dummy-data-generator mysql procedure random-generation testdata testdatabuilder

Last synced: 16 Mar 2025

https://github.com/pydap/pydap

A Python library implementing the Data Access Protocol (DAP, aka OPeNDAP).

dap data dods opendap science

Last synced: 21 Oct 2025

https://github.com/asad70/wallstreetbets-sentiment-analysis

This program finds the most mentioned ticker on r/wallstreetbets and uses Vader SentimentIntensityAnalyzer to calculate the sentiment analysis.

algotrading analysis data docker-container reddit sentiment-analysis trading vader-sentimentintensityanalyzer wallstreetbets wallstreetbets-sentiment-analysis

Last synced: 23 Oct 2025

https://github.com/Appsilon/data.validator

validate your data and create nice reports straight from R

data r reporting rhinoverse rstudio validation

Last synced: 06 May 2025

https://github.com/Tauffer-Consulting/domino

User friendly and open source platform for workflow creation and monitoring

ai airflow containers data gui kubernetes open-source python workflows

Last synced: 22 Apr 2025

https://github.com/ingoscholtes/pathpy

pathpy is an OpenSource python package for the modeling and analysis of pathways and temporal networks using higher-order and multi-order graphical models

analysis data data-mining graph graphical-models machine-learning model-selection multi-order network-analysis networks pathways python sequential-data temporal-correlations temporal-networks

Last synced: 25 Sep 2025

https://github.com/leinelissen/aeon

📡 Scan the internet for your personal information and modify or remove it

data electron gdpr git

Last synced: 05 Apr 2025

https://github.com/robjhyndman/fpp3

All data sets required for the examples and exercises in the book "Forecasting: principles and practice" (3rd ed, 2020) by Rob J Hyndman and George Athanasopoulos <http://OTexts.org/fpp3/>. All packages required to run the examples are also loaded.

cran data forecasting r

Last synced: 19 Jul 2025

https://github.com/join-monster/join-monster-graphql-tools-adapter

Use Join Monster to fetch your data with Apollo Server.

apollo batch data graphql join schema sql

Last synced: 11 Jul 2025