An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/rbruinier/mysqlbulkimportbenchmark

Benchmarking some methods to import big data sets into mysql tables

benchmark data database mysql php

Last synced: 02 May 2026

https://github.com/tushar2704/insurance-cross-sell

This project harnesses the power of cutting-edge technologies including H2O AutoML, MLflow, FastAPI, and Streamlit to enhance cross-selling campaigns and boost efficiency.

data datascience h20automl machine-learning mlflow python streamlit-tushar2704

Last synced: 08 Oct 2025

https://github.com/harmanveer-2546/supply-chain

Supply chain analytics is a valuable part of data-driven decision-making in various industries such as manufacturing, retail, healthcare, and logistics. It is the process of collecting, analyzing and interpreting data related to the movement of products and services from suppliers to customers.

customer-segmentation-analysis data data-analysis data-cleaning data-insights ggplot2 numpy pandas performance-evaluation predictive-analytics-for-business python risk-assessment sales-analysis statistical-analysis supply-chain tidyverse trend-analysis

Last synced: 10 Apr 2026

https://github.com/dominhduy09/my-links

All of my links and websites I have been creating - For saving all of my website's links

data database link linked-list linktree list save storage website

Last synced: 25 Jun 2026

https://github.com/lunastev/wson-rust

WSON data serialization parser

data parser serialization

Last synced: 07 Apr 2025

https://github.com/woctezuma/download-steam-screenshots-data

Data consisting of Steam screenshots.

data steam steam-api

Last synced: 19 Feb 2026

https://github.com/jackokring/www

Generic www flask server with phinka module

compression data flask phinka python

Last synced: 16 Jan 2026

https://github.com/real-veersandhu/cia-country-comparison

Data analysis system on the CIA World Factbook

data

Last synced: 25 Feb 2025

https://github.com/marcelo-earth/h5n8-data

🔢🦠 Confirmed cases of H5N8 in humans - Feel free to open Pull Requests with new data.

csv data h5n8 h5n8-cases h5n8-virus russia

Last synced: 19 Jan 2026

https://github.com/ishaansathaye/data40x-1_2_3

Fall 2025 Cal Poly Data 401 Data Science Process and Ethics, 402 Mathematical Foundations of Data Science, 403 Projects Lab

capstone-prep data data-science ethics lab python

Last synced: 04 May 2026

https://github.com/ybelenko/openapi-data-mocker-server-middleware

PSR-15 HTTP Server Middleware to create mock responses from OpenAPI Schemas(OAS 3.0).

data fake faker middleware mock mocker oas oas3 openapi psr-15 swagger

Last synced: 15 Jun 2025

https://github.com/perceptronv/miscellaneous

A huge variety of materials, mostly training data for AI. Not a lot of source code yet.

data gan machine-learning nlp text-generation

Last synced: 04 May 2026

https://github.com/kucingkode/dmerge

Small javascript library to help you merge same formatted data in a string

cithak data data-merge javascript library lightweight lightweight-javascript-library merge open-source

Last synced: 04 May 2026

https://github.com/avahoffman/dataplay

🤸‍♂️ Load data to play with

data data-package r r-package rstats

Last synced: 25 Mar 2025

https://github.com/issacto/animmender

Deployed Web App

angularjs anime data

Last synced: 05 May 2026

https://github.com/stdlib-js/ndarray-base-output-policy-str2enum

Return the enumeration constant associated with an output ndarray data type policy string.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs policy stdlib types util utilities utility utils

Last synced: 15 Apr 2026

https://github.com/thiagopanini/datadelivery

Um módulo Terraform open source capaz de proporcionar um toolkit completo de infraestrutura para que usuários iniciem suas respectivas jornadas de exploração em serviços de Analytics na AWS.

analytics athena aws catalog crawler data datamesh glue s3 terraform

Last synced: 29 Nov 2025

https://github.com/igorskyflyer/npm-adblock-header-extract

✂️ Parse and extract ad-block filter list headers with ease. Works on strings or files, trims whitespace, and returns clean metadata for tooling and automation. 📃

adblock back-end biome data filter header igorskyflyer javascript js metadata node nodejs npm string ts typescript utility

Last synced: 11 Mar 2026

https://github.com/spiceai/datasets

Spice AI curated dataset definitions for Spice.ai

ai bitcoin blockchain data ethereum polygon

Last synced: 20 Apr 2026

https://github.com/goncaloperes/datavisualization

Here I will share some of my data visualizations using a variety of datasets, technologies and tools.

d3js data dataset datavisualization dataviz ggplot matplotlib rawgraphs seaborn tableau visualization yellowbrick

Last synced: 04 Feb 2026

https://github.com/nafisalawalidris/sales-performance-dashboard

Sales Performance Dashboard: Analyze and visualize sales data using Power BI. Gain insights into trends, customer segments, product performance, and geographic distribution. Make data-driven decisions to optimize sales strategies and maximize revenue.

analytics-revenue dashboard-power-bi data data-analysis intelligence-sales optimization performance sales visualization-business

Last synced: 03 Feb 2026

https://github.com/priyanshubiswas-tech/deloitte-daikibo-forensic-analysis-task-2

Forensic pay equity analyzer for Deloitte. Processes compensation data to classify gender equality scores into Fair/Unfair/Discriminative tiers. Outputs modified Excel with 3-tier evaluation system.

data data-analysis deloitte excel forensic-analysis

Last synced: 06 Feb 2026

https://github.com/jinsyin/datagovernance

公众号:「数据之道」

data data-governance datagovernance governance

Last synced: 30 Jan 2026

https://github.com/rayenfathallah/students_analysis

This projects contains an analysis of the different fadtors affecting students performance in their final exams. The project uses D3.js to create interactive dashboards that are compelling and easy to interpret.

analysis d3 data education javascript python students

Last synced: 12 Apr 2026

https://github.com/bilalmehrban/data-log-monitor

A simple yet elegant desktop c# application based on 3 Tier architecture, designed to have a look at the logs stored in the database using Nlog or other logging framework's.

csharp data desktop-app logging

Last synced: 14 Mar 2025

https://github.com/scienxlab/datasets

Some small datasets for demos, courses, testing, etc.

data open-data sample-data teaching-resources

Last synced: 09 Oct 2025

https://github.com/cliffano/birthmap

Mapping birth places of groups of prominent people

birthmap data maps

Last synced: 22 Jun 2026

https://github.com/zituocn/dean

Task flow framework for data processing

data golang task

Last synced: 18 Jan 2026

https://github.com/kamal-singh22/ai-driven-emotional-sentiments-analysis

This project leverages machine learning to analyze and classify the emotional sentiment of textual data. The goal is to accurately identify and categorize emotions, aiding applications in customer feedback analysis, social media sentiment analysis, and mental health monitoring.

analysis artificial-intelligence data emotion nlp-machine-learning python sentiment-analysis streamlit text-classification

Last synced: 14 Apr 2026

https://github.com/tbrowder/classfactory

Provides tools to create a data collection with classes to manipulate the persistent data.

class data persistent raku

Last synced: 04 Apr 2025

https://github.com/garcane/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 01 Mar 2026

https://github.com/kirkalyn13/portfolio-dashboard-site

Portfolio Site; Initially a Service Provider Metrics Dashboard using React.

dashboard data data-visualization react

Last synced: 15 Apr 2026

https://github.com/lookininward/data-formatter-demo

You have directories containing data files and specification files. The specification files describe the structure of the data files. Write an app that reads format definitions from specification files. Use these definitions to convert the parsed files to NDJSON files.

csv data demo files json ndjson python txt unittest

Last synced: 27 Apr 2026

https://github.com/grycap/cdmi-client-go

A basic Go library to perform CDMI core operations

cdmi cloud data go

Last synced: 21 Jan 2026

https://github.com/neelravi/fairtool

A CLI tool for FAIR processing of computational materials science data.

computational data data-analytics fair management materials physics python science

Last synced: 14 Jan 2026

https://github.com/stdlib-js/strided-base-dtype-str2enum

Return the enumeration constant associated with a strided array data type string.

array data dtype dtypes enum javascript multidimensional node node-js nodejs stdlib strided types util utilities utility utils

Last synced: 30 Apr 2025

https://github.com/tupizz/data-processing-pipeline-aws

This project is a serverless application built with the Serverless Framework, TypeScript, and AWS services. It provides an enrichment service that processes contact information and enriches it with additional data.

aws data pipeline serverless typescript

Last synced: 13 May 2026

https://github.com/diegoperea20/own_dataset_segmentation_yolov8

Segmentacion y detection de objetos con propio dataset usando YOLOV8 , en el que se utiliza un dataset propio de una moneda de 200 pesos colombianos del año 2023.

coins colombia data opencv own python segmentation tensorflow yolov8

Last synced: 12 Apr 2026

https://github.com/cosmos-loops/cosmos-dapper

Cosmos.Dapper is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of StackExchange.Dapper to improve development efficiency.

dapper data mysql mysqlconnector oracle postgresql sql-query sqlite sqlkata sqlserver

Last synced: 11 Apr 2026

https://github.com/aniketkkajania/wassupanalyzer

WhatsAnalyzer is a powerful statistical analysis tool designed for analyzing WhatsApp chats. With the ability to process chat files exported from WhatsApp, this tool provides valuable insights by generating various plots and statistics.

data data-science datavisualization streamlit streamlit-webapp webapp whatsapp whatsapp-chat

Last synced: 25 Feb 2026

https://github.com/stdlib-js/ndarray-base-to-reversed

Return a new ndarray where the order of elements of an input ndarray is reversed along each dimension.

base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure to-reversed types vector view

Last synced: 12 Apr 2026

https://github.com/so-cool/uobrain

My solution to the University of Bristol PURE Data Challenge

competition data modeling

Last synced: 09 Sep 2025

https://github.com/jub0t/eso

An application to manage all your Encryption & Decryption keys and other related tools.

data encryption encryption-decryption hacking hacking-tool keys pgp privacy private

Last synced: 07 Feb 2026

https://github.com/fredhutch/gdscnsoilsites

Homepage for BioDIGS Project. Learn about the project and download data.

biodigs data metagenomics student-research

Last synced: 25 Mar 2025

https://github.com/rafaelfloressouza/Covid-19-Dashboard

Python web application to display COVID19 data from the world using Plotly and Dash

bootstrap covid-19 css data datavisualization plotly-dash python3

Last synced: 10 Mar 2025

https://github.com/giladbarnea/to

A simple CLI tool to convert and diff between JSON, YAML, TOML, JSON5 and Python collections.

conversion data data-conversion json json5 parser script terminal toml yaml

Last synced: 08 Feb 2026

https://github.com/bastianolea/comisarias_chile

Base de datos con las comisarías, retenes, tenencias y otras instalaciones de Carabineros

chile data estado social

Last synced: 23 Jun 2025

https://github.com/spine-tools/metreload

Python application for downloading meteorological reanalysis data

data python reanalysis

Last synced: 01 Jul 2025

https://github.com/stdlib-js/array-base-none-by

Test whether all elements in an array fail a test implemented by a predicate function.

all array data every generic javascript node node-js nodejs predicate stdlib structure test types validate

Last synced: 15 Apr 2026

https://github.com/xdrokra/road-accident-analytics

A data visualization project that maps and analyzes road accidents across major Italian municipalities in 2023

analytics data design italy javascript

Last synced: 30 Aug 2025

https://github.com/aranfononi/h4x0r-news-section-17-project

A SwiftUI-powered app that displays top stories from Hacker News. Users can open articles directly within the app, utilizing SwiftUI’s NavigationLink and custom WebView integration.

app-development data data-binding data-binding-library ios swift swiftui xcode

Last synced: 18 May 2026

https://github.com/priyanshubiswas-tech/aws-etl-pipeline-on-cloud-using-glue-athena-lambda-and-redshift

Serverless ETL pipeline on AWS using Glue, Lambda, Athena, and Redshift — automates data ingestion, transformation, and analytics with scalable, event-driven architecture.

athena aws aws-glue data data-engineering etl etl-pipeline lambda redshift

Last synced: 02 May 2026

https://github.com/ajityadav2621/datadoom

Currently working on backend, and as user interaction has been done so updated also deployed for reference. will be adding up many things.

ai data

Last synced: 09 Feb 2026

https://github.com/lmuffato/project-ting-trybe

Projeto ting - Projeto avaliativo da Trybe do Bloco 37: Estrutura de Dados II: Listas, Filas e Pilhas

data data-analysis python queue read-file stack trybe trybe-projects

Last synced: 12 Jun 2025

https://github.com/mitevpi/vue-d3-bar-chart

Reusable, reactive, animated bar chart using D3 + Vue.js. Written in idiomatic Vue, rather than D3 syntax.

d3 data data-visualization frontend interactive svg vue web

Last synced: 18 May 2026

https://github.com/ginga1402/chinook_database

Microsoft SQL Server Management Studio

business-query data sql-server

Last synced: 30 Mar 2025

https://github.com/SAP-archive/signavio-qualtrics-di

Setup an SAP Data Intelligence data pipeline to connect Qualtrics surveys data to SAP Signavio Process Intelligence via Ingestion API.

data intelligence process-intelligence qualtrics sample sap-data-intelligence sap-signavio-process-intelligence signavio

Last synced: 09 May 2025

https://github.com/definetlynotai/vulnscan_data

Logicytics VulnScan Module's Training Data and old model archive

ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data

Last synced: 11 Oct 2025

https://github.com/gbv/cocoda-mappings

concordances, mappings and conversion scripts to create JSKOS mappings

coli-conc data jskos

Last synced: 28 Oct 2025

https://github.com/canelmas/data-producer

Fake data producer for Kafka, console and http endpoints

data fake-content fake-data fakerjs kafka kafka-producer

Last synced: 05 Apr 2025

https://github.com/famarks/grafarg

Grafarg is an interactive data analytics and graphical data visualization application. Grafarg being a progressive fork of Grafana 7.5.17 continues to be available under open source Apache 2.0 License

analytics charts data data-analysis data-science data-visualization grafana grafarg graph

Last synced: 19 Jan 2026

https://github.com/mikezange/laravel-encryptable

A simple encryptable trait for encrypting model fields in laravel

data encrypt field gdpr laravel model trait

Last synced: 16 May 2026

https://github.com/inc44/raqua

Raqua 💧, a set of Python scripts and Rust program, is designed to scan an ocean of disk copies and retrieve files lacking conventional signatures, by creating an overflowing cache

cli console data data-recovery files linux macos python python3 recovery rust search terminal tool windows

Last synced: 11 Apr 2026

https://github.com/jhpoelen/bats

self-documenting data publication on Bat (Chiroptera) specimen

biodiversity data natural-history-collections provenance specimen

Last synced: 18 Mar 2026

https://github.com/masu-baumgartner/dbsync.net

A c# mysql model sync library

cshap data library mysql

Last synced: 13 May 2026

https://github.com/gmersy/data-carbon

Repository accompanying the paper: Toward a Life Cycle Assessment for the Carbon Footprint of Data

carbon-emissions carbon-footprint climate-change data data-science sustainability sustainable-software

Last synced: 31 Mar 2025

https://github.com/stefanpietrusky/facts

Repository for the article in the online magazine Data Science Collective.

ai arxiv-papers beautifulsoup data flask-application gensim llama matplotlib ollama plotly pyldavis python selenium webdriver

Last synced: 09 May 2026

https://github.com/edugmenes/azure-data-engineering

This repository contains my first end-to-end Data Engineering project, built using Microsoft Azure Cloud and Azure Databricks with PySpark.

azure cloud data data-engineering data-lakehouse data-structures databricks delta-lake etl-pipelines lakehouse lakehouse-architectures medallion-architecture microsoft-azure pyspark spark

Last synced: 29 Jan 2026

https://github.com/willdev12/rjson

Encryptable Json file format for .NET projects!

csharp csharp-library data dotnet json json-data json-plugin variables vbdotnet vbnet

Last synced: 11 Apr 2026

https://github.com/rohancyberops/rp1

This project performs an analysis of Starbucks (SBUX) stock returns using R. The analysis includes both simple returns and continuously compounded returns (CC returns) for a period of one month. It also calculates the growth of $1 invested in SBUX and provides visual insights through various plots.

analysis cc data r rlanguage sbux

Last synced: 15 Mar 2025

https://github.com/ferhatgec/tuc

TinyUrl CLI, generate short link/s from terminal.

data little python3 request script

Last synced: 18 Feb 2026

https://github.com/maxnowack/elastic-sync

Connector to sync mongodb documents into a elasticsearch index

data elasticsearch mongodb sync

Last synced: 20 Jan 2026

https://github.com/agavitalis/sample-c-codes

A collection of small projects I carried out on audino as an electronic engineering student despite felling in love with website development.

ageteller atm binary data gpcalculator logging

Last synced: 09 Apr 2025

https://github.com/jessielw/parse-fel-master-data

Simple CLI to parse Dolby Vision master data via the RPU/MediaInfo and output data needed for x265

data dolby fel master mediainfo mi parse rpu vision

Last synced: 26 Aug 2025

https://github.com/bastianolea/campamentos_chile

Datos del Catastro de campamentos nacional 2024, del Ministerio de Vivienda y urbanismo

chile comunas data pobreza social

Last synced: 24 Aug 2025

https://github.com/flowsynx/plugin-csv

FlowSynx plugin to reads and writes CSV files, enabling easy batch data import/export operations and integration with spreadsheet-based data workflows.

comma-separated-values csv data data-platform flowsynx

Last synced: 10 Mar 2026

https://github.com/undistraction/grid-model

A small API for creating a grid and accessing the positions of the cells, rows and columns within it.

2d calculations cells data grid layout model

Last synced: 04 Aug 2025

https://github.com/stdlib-js/array-base-assert-is-real-floating-point-data-type

Test if an input value is a supported array real-valued floating-point data type.

array assert base check data dtype is javascript node node-js nodejs stdlib test types util utilities utility utils valid validate

Last synced: 12 Oct 2025