An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/bluegreen-labs/oneflux_containers

Containerized (docker) versions of the ONEFlux processing pipeline

data ecosystem fluxes micrometeorology processing

Last synced: 07 Oct 2025

https://github.com/justfairdev/web-stack-query

🤖 Powerful asynchronous state management, server-state utilities and data fetching for the web. TS/JS, React Query, Solid Query, Svelte Query and Vue Query.

async cache data fetch graphql hooks query react resources rest

Last synced: 09 May 2026

https://github.com/datalayer/desktop

Ξ 🖥️ Datalayer Destkop.

ai data data-analysis data-science datalayer desktop electron

Last synced: 25 Oct 2025

https://github.com/anthonykrivonos/ts-algo-masterclass

👾 Giant TypeScript algorithm and data structure masterclass to be constantly updated with important CS concepts.

algorithm class-project computer concepts data data-structures fundamentals giant library masterclass science structures typescript

Last synced: 11 May 2026

https://github.com/plurid/deon

DeObject Notation Format

data file-format

Last synced: 12 Apr 2025

https://github.com/arda-guler/binsonograph

Encode any binary file into an audio file. Sister project of https://github.com/arda-guler/binGallery

audio converter data encoder proof-of-concept sonification sound

Last synced: 21 Jun 2025

https://github.com/devtin/duckfficer

Zero-dependencies light-weight library for modeling, validating and sanitizing data 🦆 🐵 👁

coercion data duck-typing json parsing schema validation

Last synced: 01 Mar 2025

https://github.com/lilingxi01/bloark

Blocks Architecture (BloArk) project package for building Blocks-0 dataset and way beyond.

architecture bloark data revision-based

Last synced: 05 Apr 2026

https://github.com/yezz123/awsflowutils

Improve your data workflow with enhanced simplicity and robustness in handling common data tasks ✨

aws data redshift s3 s3-bucket workflow

Last synced: 07 Jan 2026

https://github.com/latex-lsp/latex-completion-data

A data set that can be used to provide code completion for LaTeX

completion data latex

Last synced: 10 Jun 2025

https://github.com/gematik/spec-isik-connect

The ISiK module "Security" (Sicherheit) provides FHIR resources as a specification and an implementation guide. The security module includes two use cases that allow authentication and authorization of different actors within the hospital so that secured data can be read from or stored in an ISiK FHIR endpoint.

data fhir isik specification

Last synced: 19 Apr 2025

https://github.com/mrlynn/30-min-data-web-form

30 Minutes to a Data Enabled Web Form with MongoDB

beginner data html html-form javascript mongodb mongodb-atlas mongodb-database web webforms

Last synced: 15 Apr 2026

https://github.com/knightchaser/mitreattackscrapper

A simple scrapper for MITRE ATT&CK information written in Python3.

cti data json package pypi scrapper

Last synced: 07 May 2025

https://github.com/gematik/spec-ISiK-Connect

The ISiK module "Security" (Sicherheit) provides FHIR resources as a specification and an implementation guide. The security module includes two use cases that allow authentication and authorization of different actors within the hospital so that secured data can be read from or stored in an ISiK FHIR endpoint.

data fhir isik specification

Last synced: 16 Apr 2025

https://github.com/snowplow-archive/iglu-ruby-client

Ruby and JRuby client for Iglu

data iglu json json-schema snowplow validation

Last synced: 20 Apr 2025

https://github.com/ejfox/election-helpers

A collection of resources, tools, and patterns for election data analysis and viz

data elections helpers

Last synced: 15 Apr 2025

https://github.com/definetlynotai/llm_data

A bunch of very famous repos source code's in python as pure localdocs all in this repo to train CODE AI

c code-examples cpp cuda data data-dum jupyter-notebook llm llm-code llm-datasets programming-data programming-data-sets python3

Last synced: 08 Oct 2025

https://github.com/ravi-prakash1907/caterapp

A Quick & Secured Data Sharing Application!

application cater caterapp data data-sharing pip python

Last synced: 06 Sep 2025

https://github.com/maltzsama/sumeh

Sumeh — Unified Data Quality Framework Sumeh is a unified data quality validation framework supporting multiple backends (PySpark, Dask, Polars, DuckDB, Pandas) with centralized rule configuration.

dask-dataframes data data-quality data-quality-analysis data-quality-assessment data-quality-checks data-quality-framework data-quality-measurement data-quality-report duckdb duckdb-extension pandas pandas-library polars polars-dataframe polars-extensions pyspark pyspark-dataframes

Last synced: 09 Mar 2026

https://github.com/josechirif/reviews-and-satisfaction-analysis-of-airbnb-brazil-and-mexico-from-june-2010-to-february-2021

This project analyzes the reviews and satisfaction of customers who used AirBnB services. It also studies if there is a relationship between another variables.

data data-analysis data-visualization powerbi sql-server

Last synced: 25 Feb 2026

https://github.com/baked-libs/bstats-discord-integration

A simple program which queries https://bstats.org/ and presents this data in a highly customizable discord webhook

bstats data discord discord-webhook javascript minecraft notifications paper plugin spigot statistic stats typescript webhook

Last synced: 28 Apr 2025

https://github.com/thitlwincoder/browser_data

Dart package to retrieve browser's data.

bookmark browser dart data flutter history package

Last synced: 23 Feb 2026

https://github.com/corentinb/txtoredis

:fire: Push each line of a text file, to a Redis set

data datascience dataset go golang redis set

Last synced: 24 Apr 2026

https://github.com/skhg/weather-station

🌥 ESP8266-based personal weather station project

air-quality arduino cpp data electronics environment esp8266 weather

Last synced: 16 Jan 2026

https://github.com/mutasim77/dbt-analytics

🍉 Repo for analytics engineering with dbt, transforming raw data into actionable insights.

big-query data data-analysis dbt warehouse

Last synced: 25 Feb 2026

https://github.com/EmirhanServeren/NFT-CollaBot

NFT CollaBot is a data-oriented project designed by the requirements of NFT ecosystem and aims to strengthen community.

data data-analysis data-analytics nft streamlit streamlit-webapp tezos tezos-api tezos-blockchain tezoswallet

Last synced: 17 Apr 2025

https://github.com/djthorpe/data

Data extraction, transformation, processing and visualisation

canvas csv data data-extraction data-transformation dom golang svg visualization

Last synced: 07 Sep 2025

https://github.com/phelipe-sempreboni/data-engineering

Repository for tutorials, information, notes and projects about data engineering.

data dataengineering engine engineering enviroment etl etl-pipeline pipeline project python

Last synced: 04 Oct 2025

https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm

📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.

big-data data data-analysis data-science data-visualization eda gotomarket

Last synced: 13 Jun 2025

https://github.com/arcticsnow/climatepy

Collection of tools to perform timeseries analysis on climate data (Observation and Downscaled)

climate data era5 meteorological-data noaa-data pandas timeseries weather wmo xarray

Last synced: 05 Feb 2026

https://github.com/synthead/timex-datalink-assembler

Toebes' Timex Datalink WristApp assembler wrapped in a Docker image with Wine

150 150s 6800 6805 assembler compiler data data-link datalink docker link timex toebes wine wristapp

Last synced: 10 May 2026

https://github.com/junkwaxhero/cardlists

Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!

baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck

Last synced: 24 Apr 2025

https://github.com/figuran04/big-data

📃 Praktikum Big Data

anaconda big data hadoop hive mongodb pig spark

Last synced: 21 Jan 2026

https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard

A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot

analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics

Last synced: 04 Apr 2026

https://github.com/efark/data-receiver

Small web server to receive data through HTTP requests.

data gin-gonic go golang http

Last synced: 23 Feb 2026

https://github.com/mmaithani/loan-approvel-ml-model-with-insights

This project will approved or reject the loan applications. Public api, data insights and predictive models for loan prediction project are also provided

data data-science loan-prediction-analysis machine-learning visualization

Last synced: 16 Aug 2025

https://github.com/OliverHennhoefer/shiny-template-interactive-table

Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny

button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface

Last synced: 30 Jul 2025

https://github.com/secret-guest/file_organizer

Files Organizer is a versatile tool for sorting and organizing files efficiently, ideal for managing recovered data.

c c-development data data-recovery file-management file-manager files sorting sorting-algorithms subdirectories subdirectory

Last synced: 10 Jun 2026

https://github.com/xtrendence/comp2001-coursework

Grade: 98%. COMP2001 Coursework by Khodadad (Adrian) Nouchin. A RESTful authentication API, and a linked data application.

api asp-net csharp data dataset linked-data php restful restful-api

Last synced: 13 Apr 2026

https://github.com/fabriziosalmi/phpapi

A public, free and simple interface to get data.

api data php public service

Last synced: 07 Apr 2025

https://github.com/debdutto/algorhythm

Algorithmic music driven by data and / or algorithms

algorithm data music nodejs

Last synced: 18 Apr 2026

https://github.com/alexandregazagnes/unilasalle-public-resources

UniLaSalle-Public-Ressources : This public repository contains the notebooks and the data used for both : 2nd Year - Practical Statistical Tests 4th Year - Data Analysis with Python

data data-analysis data-analytics data-cleaning data-storytelling education educational exploratory-data-analysis python python3 r r-programming rstudio statistics visualization

Last synced: 28 Apr 2026

https://github.com/ballerina-platform/module-ballerina-data.csv

The Ballerina CSV Data Library is a comprehensive toolkit designed to facilitate the handling and manipulation of CSV data within Ballerina applications. It streamlines the process of converting CSV data to native Ballerina data types, enabling developers to work with CSV content seamlessly and efficiently.

ballerina ballerina-csv csv csv-data data

Last synced: 29 Jan 2026

https://github.com/olegegoism/datagenerator

Django web application for managing database connections and generating test data.

app application big-data csv data database dataset db django fake generator schema teable work

Last synced: 26 Oct 2025

https://github.com/a3r0id/lightshot-data-miner

A random idea I had a while back to make a data miner for lightshot. Never released this but after a friend sent me a post about lightshot's transparency I figured it'd be a good time to release this. I've included some output from a run before making the repo. I am not responsible for the imagery or it's contents.

brute-force bruteforce data dataset face-recognition image-processing lightshot mining scraper scraping text-recognition

Last synced: 19 Oct 2025

https://github.com/ium101/files-and-folders-lister-z

Files and Folders Lister Z is a utility for listing the contents of directories on your computer. It provides both a command-line and a graphical user interface (GUI) for easy use.

application application-code brasil brazil cmd command data database databases exe filemanagement filesystem linux lowcode macos python sh tool utility windows

Last synced: 09 Oct 2025

https://github.com/ssiarhei115/customer-classification

Developing ML model predicting bank' customer inclination to open a deposit

big-data big-data-analytics data data-science data-visualization mashine-learning

Last synced: 09 Apr 2025

https://github.com/plabayo/datapoints.earth

Earth data liberation for and by its citizens.

data foss free scrape

Last synced: 15 Mar 2026

https://github.com/mystpi/crossings

🌉 A tiny library focused on easily connecting JS to HTML.

connect data frontend html javascript reactive simple small tiny

Last synced: 10 Jun 2026

https://github.com/georgetdn/syscppcplinux

Store Linux C++ class data in a file ( persistence ) and manipulate it programmatically or using Small SQL (included)

class data framework linux object persistence serialize sql

Last synced: 12 Feb 2026

https://github.com/pawelzny/vo

DDD Value Object implementation

data ddd-patterns object python3 value

Last synced: 15 Feb 2026

https://github.com/espoirmur/balobi_nini

An End to End Data Science Project, where I used Tweepy and Airflow to collect tweets related to the DRC and topic modeling technics to discover which topics Congolese are talking about on Twitter.

data nlp nlp-machine-learning

Last synced: 24 Aug 2025

https://github.com/imtiaz-emu/exploratory-data-analysis-with-r

Data Transformation, Descriptive statistics, data visualization, Linear regression using R

data dplyr ggplot2 r rstudio visualization

Last synced: 15 Mar 2025

https://github.com/camara94/data-visualization-with-python

Data visualization and some of the best practices when creating plots and visuals. The history and architecture of Matplotlib, and how to do basic plotting with Matplotlib. Generating different visualization tools using Matplotlib such as line plots, area plots, histograms, bar charts, box plots, and pie charts. Seaborn, another data visualization library in Python, and how to use it to create attractive statistical graphics. Folium, and how to use to create maps and visualize geospatial data.

data data-science data-structures data-visualization python3

Last synced: 16 May 2026

https://github.com/countervolts/apple-music-stats-calculator

how to get your most streamed songs/artists

apple apple-music applemusic calculator data

Last synced: 11 Feb 2026

https://github.com/pjt3591oo/exchange-crawler

업비트, 코인원 크롤러

crawler data exchange python

Last synced: 27 Oct 2025

https://github.com/dantesc03/uberpool-case-study

This project was designed to understand the statistical effects of longer wait times on uber rides. Particularly on the user and driver experience with the Uber Pool System.

analysis data excel jupyter jupyternotebooks learn python seaborn statistics t-tests uber visualization

Last synced: 16 Apr 2026

https://github.com/slipke/eurlex-model-go

This projects implements the EUR-Lex XML data model in Golang. For more information see README.md

data datamodel eur-lex eurlex webservice

Last synced: 09 Mar 2026

https://github.com/bastgau/snow-revoke-privileges

Script designed to simplify the management of permissions in your Snowflake databases.

data database dba dev-container python snowflake

Last synced: 20 Apr 2025

https://github.com/chaitanyac22/hr_policy_query_resolution_with_retrieval_augmented_generation_rag

This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.

artificial-intelligence data hr large-language-models llm mistral-7b nlp pipeline prompt-engineering quantization rag retrieval-augmented-generation

Last synced: 12 Feb 2026

https://github.com/jinsyin/datalink

⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink

batch big-data bigdata cdc data data-collection data-exchange data-integration data-pipeline data-synchronization datalink etl flink flink-cdc framework integration pipeline spark streaming

Last synced: 19 Jul 2025

https://github.com/blakedrumm/scvmm-scripts-and-sql

The Scripts provided here are compatible with System Center Virtual Machine Manager

collector data powershell scripts scvmm sql

Last synced: 11 May 2025

https://github.com/mitranim/gt

Short for "Go Types". Important data types missing from the Go standard library.

data date datetime go golang interval json null-safety sql time url uuid

Last synced: 10 May 2026

https://github.com/squareslab/probabilisticmodel_saner2018

Paper and supporting materials of the Probabilistic Model paper Accepted to SANER 2018

code data mausotog published replication

Last synced: 26 Oct 2025

https://github.com/tusharnankani/analysis-2.0

An Exhaustive WhatsApp Chat Data Analysis 2.0

analysis data data-science plots trends visualization

Last synced: 31 Mar 2025

https://github.com/andrey-tech/data-storage-php

Простое хранилище данных в виде ключ-значение в JSON-файлах с разделяемой блокировкой на чтение и эксклюзивной блокировкой на запись.

data data-storage files json php php7 storage

Last synced: 29 Apr 2026

https://github.com/mrsaeeddev/data-science-roadmap-for-beginners

📈 A minimal and easy road map for beginners who want to dive into the field of Data Science

data data-science datascience python

Last synced: 29 Jun 2025

https://github.com/socketsupply/dynavolt

A highly opinionated DynamoDB client for aws-sdk v3 using esm.

aws data database dynamo dynamodb key-value kvstore

Last synced: 05 May 2026

https://github.com/skywarth/fenrir-wolfpack-simulator

Simulating wolfpack behaviours and future of the pack in an environment using Javascript and data trees.

data data-structures javascript max-heap simulation simulations wolfpack

Last synced: 14 Oct 2025

https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation

we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.

algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning

Last synced: 05 Feb 2026

https://github.com/jesusgraterol/bitcoin-blockchain-dataset-builder

The dataset builder script extracts all the relevant block information from the Bitcoin Blockchain through Mempool.space's public API. The data is stored in a .csv file, facilitating its use in data science and machine learning projects.

bitcoin blockchain blockchain-technology data datascience datascience-machinelearning dataset dataset-generation machine-learning

Last synced: 06 May 2026

https://github.com/openpeeps/zxc-nim

Bindings to the ZXC compression library, a LZ77-based compressor optimized for high decompression speed

archive compression compressor data decompression game-assets lossless lossless-compression lz77 nim nim-bindings nim-package nim-wrapper openpeeps zxc

Last synced: 07 Jun 2026

https://github.com/oliverhennhoefer/shiny-template-interactive-table

Example of interactively adding rows / deleting rows by selecting directly in a data.table (DT) in Shiny

button data delete dt r select selection server shiny shiny-applications shiny-apps shiny-r shinyapps table ui userinterface

Last synced: 16 Apr 2026

https://github.com/tayeva/eia-client-python

EIA Open Data API Client - Python

data open-source python python-3 python3

Last synced: 14 Oct 2025

https://github.com/lukekim/demo

Luke's Spice.ai demo app

ai data web3

Last synced: 18 Jan 2026

https://github.com/nononoexe/setariaviridis

🌾 Field-collected data of green foxtail

data data-science dataset rpackage

Last synced: 27 Feb 2026

https://github.com/techbureau/zaifdata

:blue_book: Data Reader for zaif Exchange

bitcoin blockchain cryptocurrency data exchange nem token trading xem zaif

Last synced: 19 Apr 2026

https://github.com/cptpiepmatz/tabledatamerge

🔀 Merge plain text tables together.

cli data format latex table tdm

Last synced: 24 Feb 2026

https://github.com/codecentric/reedelk-bookingintegrationservice

Example service for the blog post series about Reedelk

api api-gateway data integration integration-flow

Last synced: 16 Oct 2025

https://github.com/automators-com/datamaker-js

The official Node.js / Typescript library for the DataMaker API

data javascript nodejs typescript

Last synced: 11 Oct 2025

https://github.com/subnwa/sql

A starter code for creating a SQL database.

base data database master microsoft sql

Last synced: 06 Mar 2025