An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/williamwutq/bllist

Durable, crash-safe, checksummed block-based linked list allocators stored in a single file

data data-storage data-structure database file-based linkedlist

Last synced: 25 Jun 2026

https://github.com/williamwutq/bblock

Persistent checksummed blocks built on top of bstack's allocators

allocation binary block data data-structures database rust rust-crate rust-library serialization

Last synced: 25 Jun 2026

https://github.com/diddypod/crop-data-converter

A Python script to convert crop data from .txt to .xlsx format

converter crop data openpyxl python

Last synced: 29 Jun 2026

https://github.com/anuveyatsu/cloudflare-data-fabric

Cloudflare Data Fabric: Use Cloudflare's global infrastructure to build a flexible, resilient framework for data solutions.

cloudflare data data-lake fabric lakehouse mesh

Last synced: 29 Jun 2026

https://github.com/codeforafrica/ckanext-followy

[ARCHIVED] A CKAN extension to show the datasets a user is following.

ckan ckan-extension ckanext-followy data dataset followy-extension open-data

Last synced: 29 Jun 2026

https://github.com/cont-limno/lagosus-reservoir

Data module classifying lakes as natural lakes or reservoirs in the conterminous U.S.

data module

Last synced: 17 Jan 2026

https://github.com/stdlib-js/array-base-to-reversed

Return a new array with elements in reverse order.

array data generic javascript node node-js nodejs rev reverse stdlib structure swap types

Last synced: 11 Apr 2025

https://github.com/aikuyun/flinkx

flinkx 一些修改

data flink

Last synced: 04 Apr 2025

https://github.com/cainmi/easy-pull-from-repository

A repository to pull code and files from, may be used to store page data links, code etc. mainly used for python for now

data html javascript python schema

Last synced: 04 Apr 2025

https://github.com/erictleung/2017-new-coder-survey

:beginner: Code to help clean and format the 2017 New Coder Survey by freeCodeCamp

coder-survey data data-cleaning dplyr freecodecamp

Last synced: 03 Apr 2025

https://github.com/finnspartronics/orpheus

A took for looking at FRC (First Robotics Competition) scouting data

data first-robotics-competition scouting scouting-data spartronics

Last synced: 28 Mar 2025

https://github.com/mmaithani/singapore-residents-data-eda

The data contains Population by ethnicity, age and gender for the country of Singapore from the year 1957 to 2018

data data-visualization ethnicity kaggle-dataset python singapore singapore-residents-data

Last synced: 16 Apr 2026

https://github.com/margostino/job-pulse

PoC to analyse the hiring market

data golang mongodb visualization

Last synced: 16 May 2026

https://github.com/jvrck/australianpayphones

Get Australian payphone data in GeoJSON format.

australia data geojson geojson-data scraper

Last synced: 04 Apr 2025

https://github.com/stdlib-js/ndarray-slice-assign

Assign element values from a broadcasted input ndarray to corresponding elements in an output ndarray view.

assign assignment copy data javascript matrix ndarray node node-js nodejs set setitem slice stdlib structure types vector view

Last synced: 11 Apr 2025

https://github.com/geo-c/oct-ckan

The Open City Toolkit (more information about the project: http://geo-c.eu)

cities collaboration data open participation transparency

Last synced: 16 May 2026

https://github.com/alhonaut/quant-assigment

Code for quant analyz Morpho Markets and simulation reallocation process in MetaMorpho

analysis data defi quantitative-finance

Last synced: 16 May 2026

https://github.com/rustytake-off/datasets

Various datasets for 🤗 HuggingFace

data datasets docs huggingface

Last synced: 27 Mar 2025

https://github.com/kockarevicivan/dot-net-snippets

Set of .NET code snippets: algorithms, data structures, graph searches etc, created for demonstration purposes.

algorithms binary c-sharp data generics graphs-pathfinding list structures

Last synced: 27 Mar 2025

https://github.com/chrisru/f1stats

🗄️ Speedy API for Formula 1 statistics

api data fast formula1

Last synced: 20 Mar 2025

https://github.com/parzibyte/cifrar-descifrar-php

Cifrar y descifrar datos con PHP usando la librería php-encryption; cifrar con clave general o con claves generadas por contraseñas de usuarios

crypto data decrypt encryption password php security

Last synced: 20 May 2026

https://github.com/elvis-not-presley-one/lostcassowary

LostCassowary is an Minecraft data miner that searches region files/.MCA files for data from the game, this one can search for banners, signs, biomes, blocks

data data-mining data-science dataminer minecraft nbt nbt-parser scraper

Last synced: 12 Apr 2025

https://github.com/stdlib-js/array-base-every

Test whether all elements in an array are truthy.

all array data every generic javascript node node-js nodejs stdlib structure test types validate

Last synced: 07 May 2025

https://github.com/emomaxd/flog

header-only logging library

c-plus-plus data files formatting logging stdout

Last synced: 20 Mar 2025

https://github.com/codedotjs/indiaartfairy

:beetle: Data & More - India Art Fair • 2018 - 2024

csv data python scraper

Last synced: 16 Jun 2025

https://github.com/d-ganchar/thedus

Thedus is a lightweight migration tool for Clickhouse

cli clickhouse data database migration migrations python

Last synced: 12 Apr 2025

https://github.com/camilajaviera91/dbt-transformations-sql-mock-data

This repository contains the transformations and documentation for the data model generated in sql-mock-data.

data dbt postgresql sql

Last synced: 02 Feb 2026

https://github.com/yogaprasadk/dbms_course_a_to_z

it is a repository for complete lecture of Database Management Systems taught by riti kumari

acidproperties btree data database dbms filesystem normalform normalization sql

Last synced: 20 Mar 2025

https://github.com/FAIMS/OpenDataPresentation

Brian Ballsun-Stanton's presentation

context data presentation

Last synced: 03 Apr 2025

https://github.com/lagden/injection

Inject data into file

data file inject nodejs

Last synced: 24 Apr 2026

https://github.com/eugenedakin/steganography-pictures

Add and remove a picture-in-a-picture with steganography

compare data steganography steganography-tools xojo

Last synced: 12 Feb 2026

https://github.com/danieljdufour/fast-bin

Quickly Convert an Array of Numbers into their Minimal Binary Representations

array binarize binary bits data nbits numbers unbinarize

Last synced: 13 Apr 2025

https://github.com/gbburleigh/quick-seeders

Generate realistic test data quickly with Quick-Seeders, a Python library offering a wide range of data types and schema definitions. Control data variance, probabilities, and output formats, including SQL. Simplify your data seeding process and improve testing efficiency.

data dataset faker generator python seeder sql test

Last synced: 03 Apr 2025

https://github.com/benji-lewis/archivord

An archival bot for Discord servers designed to retain as much data as possible to show future generations how we communicated.

archive data data-mining discord discord-bot typescript

Last synced: 16 May 2026

https://github.com/lisakey/convert-csv-to-sav

We used python 🐍 to convert a csv file into a sav file with all the modifications needed to open it in IBM spss and be able to analyse our data.

analysis chardet convert csv data databases ibm os pandas pyreadstat python sav spss sys transformations

Last synced: 08 May 2026

https://github.com/divithraju/divith-raju-data-mining

This project focuses on customer segmentation using data mining techniques, specifically K-Means clustering, to classify customers into distinct groups based on their purchasing behaviors. The goal is to analyze customer data and segment them into clusters for targeted marketing strategies and better customer relationship management.

algorthims analytics apache business client connector data dataarchitecture database dataengineering datamining datascience hadoop k-means-clustering mysql project project-repository pyspark python3 spark

Last synced: 06 Mar 2026

https://github.com/yasir13001/moonai_api

This MoonAI API service built with FastAPI that calculates and provides detailed Moon and Sun astronomical data based on user input such as date, latitude, longitude, elevation, and timezone.

ai almanac api astro-ai astronomy data data-science fastapi fastapi-api gemini groq-api hilal-detection html islamic-calenda llama llm-integration moon python

Last synced: 20 Jun 2025

https://github.com/utkarshverma439/simple-sms-spam-detector

Built a Python text classification model for spam detection in SMS. Explored data, preprocessed text, utilized TF-IDF, trained a classifier, and addressed visualization challenges, yielding practical insights.

data data-science data-visualization spam-detection

Last synced: 20 Jun 2025

https://github.com/alireza29675/goudi

GOUDI is a multi-layer data visualization application, inspired by mind maps and some other thinking and describing methods.

analysis data goudi visualization

Last synced: 11 Jul 2025

https://github.com/harmonydata/harmony_examples

Example Jupyter notebook and R scripts using Harmony in real research problems

data data-harmonisation data-harmonization harmonisation psychology python r research

Last synced: 11 Jul 2025

https://github.com/lunastev/reflectlm

ReflectLM is a self-reflective, language-structure-only AI model that learns exclusively through interaction. It starts with zero factual knowledge but can engage in dialogue, evaluate its own responses, and remember conversations for future learning.

ai data language-model llm model open-source ts web

Last synced: 22 Jun 2025

https://github.com/dennyglee/open-covid19-public

A collaboration between SCRI and Databricks on the analysis of open COVID-19 datasets.

covid-19 data data-analytics data-engineering data-science nlp

Last synced: 22 Jun 2025

https://github.com/nia-cloud-official/datascript

DataScript: A Hypothetical Data Scripting Language, DataScript is designed for simplifying data manipulation and analysis tasks. It serves as a scripting language tailored specifically for handling various data operations efficiently.

data data-scripting scripting-language

Last synced: 22 Jun 2025

https://github.com/pbinkley/tweets-libraries-covid19

A twarc harvest of tweets related to libraries during the COVID-19 outbreak, starting 2020-03-02

data social

Last synced: 06 Mar 2026

https://github.com/evoluteur/madeleinology

Playing with data science by taking a look at the proportions of flour, sugar, butter, and eggs in 147 Madeleine recipes (the traditional French sponge cake).

baking cake cooking cooking-recipes data data-science data-visualization dessert exploratory-analysis exploratory-data-analysis exploratory-data-visualizations food histogram longtail madeleine recipe visualization

Last synced: 23 Jun 2025

https://github.com/flownrecords/flightTracker

A mobile app built to record essential flight data for post-flight review and debriefing.

aviation data gps tracking

Last synced: 23 Jun 2025

https://github.com/elazar/pycopyql

Exports a subset of data from a relational database.

data database export relational tool utility

Last synced: 16 May 2026

https://github.com/nichtich/wikidata-taxonomy-examples

Extract classifications from Wikidata

coli-conc data knowledge-organization wikidata

Last synced: 12 Jul 2025

https://github.com/nafisalawalidris/dr.-semmelweis-and-the-discovery-of-handwashing

Uncover the revolutionary impact of handwashing on mortality rates in healthcare. Explore the story of Dr. Semmelweis and his groundbreaking findings.

data data-analysis handwashing healthcare-analysis medical-breakthrough mortality-rates

Last synced: 13 Jul 2025

https://github.com/dineshpinto/geist-finance-subgraph

Subgraph for the Geist Finance protocol on the Fantom blockchain.

assemblyscript blockchain data fantom graphql typescript

Last synced: 17 May 2026

https://github.com/plurid/deserve

Own Your Data · Control The Code

data owner

Last synced: 16 Jul 2025

https://github.com/shuklayash02/complete_data_analysis_project

A Full Data Analysis project where a sales data is ask,prepare,process,analyze,share and act through data analysis process

data data-visualization dataanalysis database datacleaning powerbi sql

Last synced: 16 Jul 2025

https://github.com/clabe45/kaz

Minimalistic local storage cli

cli data minimalistic storage utility

Last synced: 17 Jul 2025

https://github.com/mustika-putri-m/-tableu-laporan-data-karyawan-growian

I am currently pursuing a data analysis certification at GROWIA, where I've learned to use tools such as Python, SQL, Google Big Query, Google Data Studio, Advanced Microsoft Excel, and Tableau. This course has enhanced my ability to analyze data using KPIs and business metrics, enabling me to solve business problems more effectively

data data-visualization tableau

Last synced: 17 Feb 2026

https://github.com/giscience/measures-rest-oshdb-docker

Scripts for starting measures for geospatial datasets in docker container, using the OSHDB

data dggs docker geospatial mesure openstreetmap rest

Last synced: 18 Apr 2026

https://github.com/saboye/web-scraping-with-python

A web scraping project using Python's "Requests" and "BeautifulSoup" libraries to extract structured data from one or more websites. This project involves sending HTTP requests to the target website(s), retrieving the HTML content of the website(s), and parsing this content to extract the desired data in a usable format.

beautifulsoup csv data data-harvesting data-mining python request web webscraping

Last synced: 18 Jul 2025

https://github.com/desilinguist/hanukkah-of-data-2022

My solutions to Hanukkah of Data 2022

2022 data hanukkah pandas python

Last synced: 17 May 2026

https://github.com/am-i-groot/summer-intern-iitguwahati-spml

Developed an automated Water Quality Monitoring System (WQMS) at IIT Guwahati, using the pH-W218 sensor and K-Means Clustering to assess water potability. The project enhances water quality evaluation through machine learning-based classification.

algorithm data data-visualization kmeans-clustering machine-learning python report sensor signal-processing

Last synced: 17 May 2026

https://github.com/bytraembedded/Laptop-Price-Prediction-with-Machine-Learning

The Laptop Price Prediction with Machine Learning project provides a system to predict the price of laptops based on various features such as processor type, RAM size, storage capacity, and more/

airflow data data-science data-visualization fastapi heroku-deployment machine-learning-algorithms matplotlib-pyplot numpy pandas python reactjs seaborn

Last synced: 30 Dec 2025

https://github.com/luminati-io/crunchbase-dataset-samples

A sample of 1001 Crunchbase companies with key data points, extracted using the Bright Data API.

crunchbase crunchbase-api crunchbase-scraper data database datasets webscraper-api webscraping

Last synced: 17 Mar 2025

https://github.com/prioritizr/prioritizrdata

Conservation planning data sets

data r spatial-data

Last synced: 19 Jul 2025

https://github.com/snegovoy98/data-storage

This is test version of data storage

data of storage test version

Last synced: 19 Jul 2025

https://github.com/abhaysingh71/india-censes-data-analysis

This repo is a india censes data analysis in many domains

data data-science data-visualization dataanalysis streamlit

Last synced: 15 May 2026

https://github.com/ate329/nsl-kdd-feature-extractor

Python-based tool designed to process network traffic packets and extract features compliant with the NSL-KDD dataset format.

cyber-security cybersecurity data data-science extractor feature-extraction machine-learning network-analysis nsl-kdd nsl-kdd-dataset

Last synced: 30 Oct 2025

https://github.com/DataHerb/dataherb-flora

DataHerb Flora: The core of DataHerb

data data-mining data-science datascience dataset datasets

Last synced: 08 May 2025

https://github.com/mundra-ankur/msw_ai_pipeline

Municipal solid waste (MSW) characterization, AI and Data pipeline to charcterize solid waste in real time into diffrent buckets using Yolo

artificial-intelligence data datapipeline solid-waste-segregation yolo

Last synced: 11 Apr 2025

https://github.com/fjc0k/vue-merge-data

Intelligently merge data for Vue render functions.

data merge-data render-functions vue

Last synced: 17 May 2026

https://github.com/mikebairdrocks/fluky

[floo-kee]: obtained by chance rather than skill.

data framework mock netcore netstandard nuget random vscode

Last synced: 17 May 2026

https://github.com/inzhenerka/scooters_data_uploader

Загрузка данных в PostgreSQL в рамках курса по dbt от Инженерка.Тех

data dbt postgresql

Last synced: 04 May 2026

https://github.com/erinaldi/bmn2-lattice

Data analysis of lattice Monte Carlo simulations of quantum matrix models.

data data-science data-visualisation lattice

Last synced: 27 Mar 2025

https://github.com/LisaKey/convert-csv-to-sav

We used python 🐍 to convert a csv file into a sav file with all the modifications needed to open it in IBM spss and be able to analyse our data.

analysis chardet convert csv data databases ibm os pandas pyreadstat python sav spss sys transformations

Last synced: 03 Mar 2025

https://github.com/sergkash7/fdc-facade

Facade for The FoodData Central API.

api center data food usda

Last synced: 15 May 2026

https://github.com/muhammad-fiaz/ason

ASON: Adaptive Structured Object Notation - Python library for dynamic data serialization, providing flexibility and simplicity.

adaptive-structure-object-notation api ason cli client data file file-format file-sharing file-upload json json-data json-parser open-source opensource parser parsing python python3

Last synced: 02 Feb 2026

https://github.com/derstimmler/aokexporter

Exporter for data from the statutory health insurance company AOK

aok cocona console csharp data dotnet export polly

Last synced: 15 May 2026