An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/LisaKey/convert-csv-to-sav

We used python 🐍 to convert a csv file into a sav file with all the modifications needed to open it in IBM spss and be able to analyse our data.

analysis chardet convert csv data databases ibm os pandas pyreadstat python sav spss sys transformations

Last synced: 03 Mar 2025

https://github.com/2kabhishek/pokemon-stats

Gotta stat 'em all 🖲🐭

d3 data emoji pokemon rollup statistics

Last synced: 14 May 2026

https://github.com/jackokring/www

Generic www flask server with phinka module

compression data flask phinka python

Last synced: 16 Jan 2026

https://github.com/stdlib-js/array-base-filled4d-by

Create a filled four-dimensional nested array according to a provided callback function.

alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types

Last synced: 07 Sep 2025

https://github.com/horisystems/uk_ev_data_analysis

Analysis of Electric Vehicle charging infrastructure in the United Kingdom.

data data-science electric-vehicles ev python uk united-kingdom

Last synced: 12 Jan 2026

https://github.com/sbdk-dev/sbdk.dev

A complete reference implementation of a local-first ecosystem for AI-powered analytics. This repository contains the source code for the SBDK.dev website, the central hub for the SBDK suite of open-source tools.

ai-powered-analytics data data-engineering data-engineeringlocal-first data-pipeline-automation data-pipelines dbt dlt duckdb elt etl-pipeline llm local-first machine-learning pipeline sbdk semantic-layer

Last synced: 27 May 2026

https://github.com/sergkash7/fdc-facade

Facade for The FoodData Central API.

api center data food usda

Last synced: 15 May 2026

https://github.com/umbaji/yodi

This is the official repository for Yodi, the speech recognition model for 8 words, in Ewè. The yodi package is also useful for rapid inference inference on speech data, especially on the mini_speech datasets.

data data-visualization keras python3 speech-recognition tensorflow

Last synced: 12 Jan 2026

https://github.com/mmaithani/singapore-residents-data-eda

The data contains Population by ethnicity, age and gender for the country of Singapore from the year 1957 to 2018

data data-visualization ethnicity kaggle-dataset python singapore singapore-residents-data

Last synced: 16 Apr 2026

https://github.com/dbrennand/rm-content

A Python 3.7 script to remove a specific string from all files and repos (owned by the user).

content data erase eraser privacy privacy-protection privacy-tools remove remover rm-content

Last synced: 29 Mar 2025

https://github.com/exoticknight/juhe

simple way to analyze complex data in one chain call

aggregation aggregator analysis data statistic typescript

Last synced: 21 May 2026

https://github.com/clabe45/kaz

Minimalistic local storage cli

cli data minimalistic storage utility

Last synced: 17 Jul 2025

https://github.com/cosmos-loops/cosmos-dapper

Cosmos.Dapper is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of StackExchange.Dapper to improve development efficiency.

dapper data mysql mysqlconnector oracle postgresql sql-query sqlite sqlkata sqlserver

Last synced: 11 Apr 2026

https://github.com/margostino/job-pulse

PoC to analyse the hiring market

data golang mongodb visualization

Last synced: 16 May 2026

https://github.com/xpotify/scraper

Scraper designed for Xpotify's client to gather information from websites🌟

axios cheerio data javascript scraper webscraper

Last synced: 07 Jul 2025

https://github.com/millengustavo/salarios-data-science

Aplicativo Streamlit de exploração dos dados da Pesquisa de mercado de Data Science feita pelo Data Hackers

brasil brazil ciencia-de-dados data data-science heroku salarios salary

Last synced: 07 Oct 2025

https://github.com/stdlib-js/ndarray-slice-assign

Assign element values from a broadcasted input ndarray to corresponding elements in an output ndarray view.

assign assignment copy data javascript matrix ndarray node node-js nodejs set setitem slice stdlib structure types vector view

Last synced: 11 Apr 2025

https://github.com/puzzlef/hybrid-csr

Comparing space usage of regular vs hybrid CSR.

csr data graph hybrid regular space structure usage

Last synced: 06 Apr 2025

https://github.com/cobluestars/dataherd-raika

"Dataherd-Raika is a library designed to simulate large-scale user behavior datasets. It takes a single user event (like a click or keyword input) and, by applying simple probability distributions and custom variables, expands it into a vast dataset."

big-data data data-generation data-generator data-science front-end javascript machine-learning npm-package simulator statistics typescript user-behavior user-experience

Last synced: 02 Jan 2026

https://github.com/davidgamero/gatech-covid-chart

Line chart showing COVID19 cases per day at Georgia Tech

covid covid19 data gatech

Last synced: 04 Jul 2026

https://github.com/derstimmler/aokexporter

Exporter for data from the statutory health insurance company AOK

aok cocona console csharp data dotnet export polly

Last synced: 15 May 2026

https://github.com/cqllum/schema2dwh

⚡ Automatically produce a data model on your database using its information schema using GenAI.

ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design

Last synced: 13 Mar 2025

https://github.com/bredalis/datastructure

📚 Estructuras de Datos en Python

algorithms data data-structure python

Last synced: 12 Apr 2026

https://github.com/nitsc/spell-from-threebodytrilogy

Implemented the process of extrapolating from Gaia stellar data, to 3D visualizations, to three-views, to three-view signals, to three-view audio of signals, and even their inversions. This project proves the feasibility of the Logic (Luoji)'s “spell” from “The Three Body Problem” trilogy.

3d 3d-graphics astronomy astronomy-astrophysics audio audio-processing data data-science data-visualization gaia graph information-technology information-visualization numpy python python-3 python3 signal signal-processing visiualization

Last synced: 02 May 2026

https://github.com/cintia0528/data_analytics_and_visualization-sql_tableau

Evaluate Magist as a strategic partner for Eniac's Brazilian expansion. Use SQL to analyze growth, tech accessory sales potential, delivery times, and customer satisfaction in Magist's database.

data dataanalysis datavisualization sql strategy tableau

Last synced: 31 Mar 2025

https://github.com/grycap/cdmi-client-go

A basic Go library to perform CDMI core operations

cdmi cloud data go

Last synced: 02 Jul 2026

https://github.com/castelao/bufr

BUFR binary data format from WMO

binary data format meteorology oceanography wmo

Last synced: 13 Jul 2025

https://github.com/ayush585/fireducksblog

BLOG: Unlocking AI Efficiency: How FireDucks Revolutionizes Data Preprocessing

data processing

Last synced: 28 Apr 2026

https://github.com/tsvikas/covid-19-israel-data

Unofficial Github with the data published by The Israel Ministry of Health, regarding The Coronavirus disease

coronavirus-disease covid-19 csv daily-reports data health israel

Last synced: 05 Jan 2026

https://github.com/wamphlett/smart-data-objects

An easy solution for capturing and validating data into usable DTO's

data dto forms php php7 validation

Last synced: 17 May 2026

https://github.com/reiiyuki/once-data-manager

Once Data Manager is temporary data management utility kit for Unity.

data manager playerprefs preference scene temporary unity

Last synced: 17 May 2026

https://github.com/priyanshubiswas-tech/ev-data-analysis-dashboard

An interactive dashboard analyzing EV trends, including total vehicles, BEV vs. PHEV breakdown, model popularity, state-wise distribution, and CAFV eligibility. Visualizes key insights for data-driven decisions in the EV industry. 📊

dashboard data data-analysis data-science data-visualization tableau tableau-public

Last synced: 17 Feb 2026

https://github.com/ttitcombe/timekeep

Defensive timeseries analysis in python

data data-science sklearn time-series time-series-analysis timeseries

Last synced: 05 Jan 2026

https://github.com/lmuffato/project-ting-trybe

Projeto ting - Projeto avaliativo da Trybe do Bloco 37: Estrutura de Dados II: Listas, Filas e Pilhas

data data-analysis python queue read-file stack trybe trybe-projects

Last synced: 12 Jun 2025

https://github.com/priyanka7411/customer-flight-prediction-app-mlflow

A comprehensive project predicting flight prices and customer satisfaction using machine learning models, deployed through interactive Streamlit apps.

classification customer-satisfaction data data-cleaning data-visualization feature-engineering flight-price-prediction machine-learning mlflow python regression streamlit

Last synced: 12 May 2026

https://github.com/hughrawlinson/github-data-scripts

Scripts to grab data about repos of interest to compare

data github-graphql github-repo-organizer graphql scripts typescript

Last synced: 09 Jul 2025

https://github.com/bacross/datamunger

python package for handling nan's and outliers

data data-frame datamunger knn nan outliers python scikit-learn

Last synced: 17 May 2026

https://github.com/marians/tour-tracker

Track the general classification development of the Tour De France, stage over stage

cycling data sports statistics

Last synced: 24 Jun 2025

https://github.com/rustytake-off/datasets

Various datasets for 🤗 HuggingFace

data datasets docs huggingface

Last synced: 27 Mar 2025

https://github.com/lmuffato/project-job-insights-trybe

Projeto job insights - Projeto avaliativo da Trybe do Bloco 32: Introdução à Python

data data-science data-transformation filter python

Last synced: 12 Jun 2025

https://github.com/bastianolea/fonasa_beneficiarios

Datos de beneficiarios del Fondo Nacional de Salud, por tramo del sistema, edad, tramo de edad, sexo, y comuna.

chile comunas data estado genero salud social

Last synced: 27 Feb 2026

https://github.com/denisecase/nw-network-data-analytics

Network for those earning a NW Masters of Applied Data Science

analytics data

Last synced: 02 Feb 2026

https://github.com/denko5/sales-analysis

A complete SQL-based sales analysis project covering Africa, showcasing data cleaning, exploratory analysis, insights, and lessons learned. The project highlights sales trends, regional performances, and marketing effectiveness across multiple platforms.

africa data data-analysis data-science exploratory-data-analysis insights kenya sales sql

Last synced: 24 Jan 2026

https://github.com/0xleif/onionstash

Store Onions 🧅

data swift

Last synced: 05 Apr 2025

https://github.com/bileljegham/api-sport-cli

Cli for https://api-sports.io/ Retreive data and convert to sql file

cli data database match nodejs sports sports-analytics

Last synced: 08 May 2026

https://github.com/nia-cloud-official/datascript

DataScript: A Hypothetical Data Scripting Language, DataScript is designed for simplifying data manipulation and analysis tasks. It serves as a scripting language tailored specifically for handling various data operations efficiently.

data data-scripting scripting-language

Last synced: 22 Jun 2025

https://github.com/jayantur13/kountry

Node module variant of the Country API

api data jsdelivr kountry nodejs npm npm-module npm-package unpkg yarn

Last synced: 26 Jan 2026

https://github.com/shgysk8zer0/schema

A PHP implementation of schema.org structured data objects

data microdata schema seo structured-data

Last synced: 24 Jun 2025

https://github.com/dostuffthatmatters/circadian-scp-upload

Resumable, interruptible, SCP upload client for any files or directories generated day by day

checksum daily data directories files library python scp ssh synchronization time-series upload utilities

Last synced: 24 Jun 2025

https://github.com/dennyglee/open-covid19-public

A collaboration between SCRI and Databricks on the analysis of open COVID-19 datasets.

covid-19 data data-analytics data-engineering data-science nlp

Last synced: 22 Jun 2025

https://github.com/kockarevicivan/dot-net-snippets

Set of .NET code snippets: algorithms, data structures, graph searches etc, created for demonstration purposes.

algorithms binary c-sharp data generics graphs-pathfinding list structures

Last synced: 27 Mar 2025

https://github.com/bayer-group/cmc-ontologies

This is a submodule of cmc-knowledge-graph-setup. It contains ontologies and relevant data graph files

data ontologies owl turtle

Last synced: 16 Jun 2025

https://github.com/giscience/measures-rest-sparql

A SPARQL endpoint for the Measures REST OSHDB App framework.

data osm quality semantics sparql sparql-endpoints

Last synced: 24 Jun 2025

https://github.com/tsiarokhin/student_bsu_by

Tool for parsing various BSU student information from student.bsu.by website.

belarus bsu data grades python students study university

Last synced: 28 May 2026

https://github.com/muhammad-fiaz/ason

ASON: Adaptive Structured Object Notation - Python library for dynamic data serialization, providing flexibility and simplicity.

adaptive-structure-object-notation api ason cli client data file file-format file-sharing file-upload json json-data json-parser open-source opensource parser parsing python python3

Last synced: 02 Feb 2026

https://github.com/d-ganchar/thedus

Thedus is a lightweight migration tool for Clickhouse

cli clickhouse data database migration migrations python

Last synced: 12 Apr 2025

https://github.com/lucaaszsx/spyder

A powerful schema-based web scraping library for Node.js built for fast, structured, and reliable data extraction.

cheerio crawler data dom dom-manipulation html json json-ld parser scraper web xml

Last synced: 11 Jun 2026

https://github.com/tushar2704/interview-quest

Interview-Quest is comprehensive collection of interview questions and answers that can help you prepare for technical interviews. Whether you're a seasoned developer looking to brush up on your skills or a job seeker preparing for your next big opportunity, this repository aims to provide valuable resources to enhance your interview readiness.

artificial-intelligence data data-science interview interview-questions machine-learning

Last synced: 23 Jan 2026

https://github.com/andygeiss/pipeline

Build your own data pipeline to gather, organize and transform data by using protobuf as an intermediate format.

data data-pipeline data-science go golang machine-learning protobuf protobuf-compiler

Last synced: 31 Mar 2025

https://github.com/chrisru/f1stats

🗄️ Speedy API for Formula 1 statistics

api data fast formula1

Last synced: 20 Mar 2025

https://github.com/exponea/exponea-python-sdk

⚠️ DEPRECATED Python SDK for Exponea Data and Tracking API

api data exponea python sdk

Last synced: 09 Apr 2026

https://github.com/yazeed44/reform-api

A platform that harnesses the power of multiple data streams including satellite imagery and drone photos to visualize multiple urban planning indices and provide descriptive analytics that will empower local Saudi authorities to make data-driven decision that contribute to neighborhood quality of life.

data geojson python

Last synced: 18 May 2026

https://github.com/dataship/beam

Get collimate'd data into Frame, in Node or the Browser

column-store data data-science

Last synced: 27 Apr 2026

https://github.com/hmeleiro/r_dataviz

Data visualization projects with R / Proyectos de visualización de datos con R

data dataviz r rmd-files social-science survey-data

Last synced: 21 Jun 2026

https://github.com/stdlib-js/array-base-every

Test whether all elements in an array are truthy.

all array data every generic javascript node node-js nodejs stdlib structure test types validate

Last synced: 07 May 2025

https://github.com/greatwoman23/market-basket-analysis

Unlock the power of data-driven sales optimization with Market Basket Analysis. Explore frequent itemsets and association rules to strategically enhance product placement, design targeted promotions, and adapt to seasonal trends. Elevate your business strategy with insights tailored for boosting sales and engaging customers effectively.

analysis analytics analytics-product data data-science jupyter medium-articles notebook-jupyter python

Last synced: 28 Apr 2026

https://github.com/parzibyte/cifrar-descifrar-php

Cifrar y descifrar datos con PHP usando la librería php-encryption; cifrar con clave general o con claves generadas por contraseñas de usuarios

crypto data decrypt encryption password php security

Last synced: 20 May 2026

https://github.com/elvis-not-presley-one/lostcassowary

LostCassowary is an Minecraft data miner that searches region files/.MCA files for data from the game, this one can search for banners, signs, biomes, blocks

data data-mining data-science dataminer minecraft nbt nbt-parser scraper

Last synced: 12 Apr 2025

https://github.com/nafisalawalidris/elfeenah

Configuration files for my GitHub profile. Welcome to my GitHub profile! I'm Nafisa Lawal Idris, a passionate Data Scientist with a strong interest for blockchain technology. Explore my GitHub portfolio to delve into the exciting world where data science and blockchain converge.

artificial-intelligence bitcoin blockchain config data data-science-portfolio data-science-projects datascience datascientist deep-learning github-config machinelearning

Last synced: 11 Sep 2025

https://github.com/themost-framework/memory

MOST Web Framework in-memory data adapter for testing environments

adapter data orm

Last synced: 01 Jul 2026

https://github.com/panda-official/driftcli

CLI Client for Drift Platform

cli click command-line data

Last synced: 17 Feb 2026

https://github.com/davidgamero/gatech-covid-data-scraper

Utility for scraping GATech Exposure Alert Information into a CSV file with automated case number extraction and aggregation

covid data gatech georgia scraper

Last synced: 31 Mar 2025

https://github.com/inzhenerka/scooters_data_uploader

Загрузка данных в PostgreSQL в рамках курса по dbt от Инженерка.Тех

data dbt postgresql

Last synced: 04 May 2026

https://github.com/DataHerb/dataherb-flora

DataHerb Flora: The core of DataHerb

data data-mining data-science datascience dataset datasets

Last synced: 08 May 2025

https://github.com/kingtous/bots_task_result

Result of the Barcelona OpenMP Tasks Suite (BOTS) using ompTG

data openmp

Last synced: 09 Jul 2025

https://github.com/mikebairdrocks/fluky

[floo-kee]: obtained by chance rather than skill.

data framework mock netcore netstandard nuget random vscode

Last synced: 17 May 2026

https://github.com/viisix/corecat

Core repository of DanceCats project.

data lightweight python3

Last synced: 25 May 2026

https://github.com/fjc0k/vue-merge-data

Intelligently merge data for Vue render functions.

data merge-data render-functions vue

Last synced: 17 May 2026

https://github.com/bluecolor/lauda

Cross database data transfer tool

data database etl extract jdbc load

Last synced: 02 May 2026

https://github.com/emomaxd/flog

header-only logging library

c-plus-plus data files formatting logging stdout

Last synced: 20 Mar 2025