An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/aikuyun/flinkx

flinkx 一些修改

data flink

Last synced: 04 Apr 2025

https://github.com/cainmi/easy-pull-from-repository

A repository to pull code and files from, may be used to store page data links, code etc. mainly used for python for now

data html javascript python schema

Last synced: 04 Apr 2025

https://github.com/jvrck/australianpayphones

Get Australian payphone data in GeoJSON format.

australia data geojson geojson-data scraper

Last synced: 04 Apr 2025

https://github.com/geo-c/oct-ckan

The Open City Toolkit (more information about the project: http://geo-c.eu)

cities collaboration data open participation transparency

Last synced: 16 May 2026

https://github.com/alhonaut/quant-assigment

Code for quant analyz Morpho Markets and simulation reallocation process in MetaMorpho

analysis data defi quantitative-finance

Last synced: 16 May 2026

https://github.com/elvis-not-presley-one/lostcassowary

LostCassowary is an Minecraft data miner that searches region files/.MCA files for data from the game, this one can search for banners, signs, biomes, blocks

data data-mining data-science dataminer minecraft nbt nbt-parser scraper

Last synced: 12 Apr 2025

https://github.com/stdlib-js/array-base-every

Test whether all elements in an array are truthy.

all array data every generic javascript node node-js nodejs stdlib structure test types validate

Last synced: 07 May 2025

https://github.com/d-ganchar/thedus

Thedus is a lightweight migration tool for Clickhouse

cli clickhouse data database migration migrations python

Last synced: 12 Apr 2025

https://github.com/danieljdufour/fast-bin

Quickly Convert an Array of Numbers into their Minimal Binary Representations

array binarize binary bits data nbits numbers unbinarize

Last synced: 13 Apr 2025

https://github.com/benji-lewis/archivord

An archival bot for Discord servers designed to retain as much data as possible to show future generations how we communicated.

archive data data-mining discord discord-bot typescript

Last synced: 16 May 2026

https://github.com/divithraju/divith-raju-data-mining

This project focuses on customer segmentation using data mining techniques, specifically K-Means clustering, to classify customers into distinct groups based on their purchasing behaviors. The goal is to analyze customer data and segment them into clusters for targeted marketing strategies and better customer relationship management.

algorthims analytics apache business client connector data dataarchitecture database dataengineering datamining datascience hadoop k-means-clustering mysql project project-repository pyspark python3 spark

Last synced: 06 Mar 2026

https://github.com/yasir13001/moonai_api

This MoonAI API service built with FastAPI that calculates and provides detailed Moon and Sun astronomical data based on user input such as date, latitude, longitude, elevation, and timezone.

ai almanac api astro-ai astronomy data data-science fastapi fastapi-api gemini groq-api hilal-detection html islamic-calenda llama llm-integration moon python

Last synced: 20 Jun 2025

https://github.com/utkarshverma439/simple-sms-spam-detector

Built a Python text classification model for spam detection in SMS. Explored data, preprocessed text, utilized TF-IDF, trained a classifier, and addressed visualization challenges, yielding practical insights.

data data-science data-visualization spam-detection

Last synced: 20 Jun 2025

https://github.com/alireza29675/goudi

GOUDI is a multi-layer data visualization application, inspired by mind maps and some other thinking and describing methods.

analysis data goudi visualization

Last synced: 11 Jul 2025

https://github.com/harmonydata/harmony_examples

Example Jupyter notebook and R scripts using Harmony in real research problems

data data-harmonisation data-harmonization harmonisation psychology python r research

Last synced: 11 Jul 2025

https://github.com/lunastev/reflectlm

ReflectLM is a self-reflective, language-structure-only AI model that learns exclusively through interaction. It starts with zero factual knowledge but can engage in dialogue, evaluate its own responses, and remember conversations for future learning.

ai data language-model llm model open-source ts web

Last synced: 22 Jun 2025

https://github.com/dennyglee/open-covid19-public

A collaboration between SCRI and Databricks on the analysis of open COVID-19 datasets.

covid-19 data data-analytics data-engineering data-science nlp

Last synced: 22 Jun 2025

https://github.com/nia-cloud-official/datascript

DataScript: A Hypothetical Data Scripting Language, DataScript is designed for simplifying data manipulation and analysis tasks. It serves as a scripting language tailored specifically for handling various data operations efficiently.

data data-scripting scripting-language

Last synced: 22 Jun 2025

https://github.com/evoluteur/madeleinology

Playing with data science by taking a look at the proportions of flour, sugar, butter, and eggs in 147 Madeleine recipes (the traditional French sponge cake).

baking cake cooking cooking-recipes data data-science data-visualization dessert exploratory-analysis exploratory-data-analysis exploratory-data-visualizations food histogram longtail madeleine recipe visualization

Last synced: 23 Jun 2025

https://github.com/flownrecords/flightTracker

A mobile app built to record essential flight data for post-flight review and debriefing.

aviation data gps tracking

Last synced: 23 Jun 2025

https://github.com/elazar/pycopyql

Exports a subset of data from a relational database.

data database export relational tool utility

Last synced: 16 May 2026

https://github.com/nichtich/wikidata-taxonomy-examples

Extract classifications from Wikidata

coli-conc data knowledge-organization wikidata

Last synced: 12 Jul 2025

https://github.com/nafisalawalidris/dr.-semmelweis-and-the-discovery-of-handwashing

Uncover the revolutionary impact of handwashing on mortality rates in healthcare. Explore the story of Dr. Semmelweis and his groundbreaking findings.

data data-analysis handwashing healthcare-analysis medical-breakthrough mortality-rates

Last synced: 13 Jul 2025

https://github.com/dineshpinto/geist-finance-subgraph

Subgraph for the Geist Finance protocol on the Fantom blockchain.

assemblyscript blockchain data fantom graphql typescript

Last synced: 17 May 2026

https://github.com/plurid/deserve

Own Your Data · Control The Code

data owner

Last synced: 16 Jul 2025

https://github.com/shuklayash02/complete_data_analysis_project

A Full Data Analysis project where a sales data is ask,prepare,process,analyze,share and act through data analysis process

data data-visualization dataanalysis database datacleaning powerbi sql

Last synced: 16 Jul 2025

https://github.com/clabe45/kaz

Minimalistic local storage cli

cli data minimalistic storage utility

Last synced: 17 Jul 2025

https://github.com/mustika-putri-m/-tableu-laporan-data-karyawan-growian

I am currently pursuing a data analysis certification at GROWIA, where I've learned to use tools such as Python, SQL, Google Big Query, Google Data Studio, Advanced Microsoft Excel, and Tableau. This course has enhanced my ability to analyze data using KPIs and business metrics, enabling me to solve business problems more effectively

data data-visualization tableau

Last synced: 17 Feb 2026

https://github.com/giscience/measures-rest-oshdb-docker

Scripts for starting measures for geospatial datasets in docker container, using the OSHDB

data dggs docker geospatial mesure openstreetmap rest

Last synced: 18 Apr 2026

https://github.com/saboye/web-scraping-with-python

A web scraping project using Python's "Requests" and "BeautifulSoup" libraries to extract structured data from one or more websites. This project involves sending HTTP requests to the target website(s), retrieving the HTML content of the website(s), and parsing this content to extract the desired data in a usable format.

beautifulsoup csv data data-harvesting data-mining python request web webscraping

Last synced: 18 Jul 2025

https://github.com/desilinguist/hanukkah-of-data-2022

My solutions to Hanukkah of Data 2022

2022 data hanukkah pandas python

Last synced: 17 May 2026

https://github.com/am-i-groot/summer-intern-iitguwahati-spml

Developed an automated Water Quality Monitoring System (WQMS) at IIT Guwahati, using the pH-W218 sensor and K-Means Clustering to assess water potability. The project enhances water quality evaluation through machine learning-based classification.

algorithm data data-visualization kmeans-clustering machine-learning python report sensor signal-processing

Last synced: 17 May 2026

https://github.com/bytraembedded/Laptop-Price-Prediction-with-Machine-Learning

The Laptop Price Prediction with Machine Learning project provides a system to predict the price of laptops based on various features such as processor type, RAM size, storage capacity, and more/

airflow data data-science data-visualization fastapi heroku-deployment machine-learning-algorithms matplotlib-pyplot numpy pandas python reactjs seaborn

Last synced: 30 Dec 2025

https://github.com/prioritizr/prioritizrdata

Conservation planning data sets

data r spatial-data

Last synced: 19 Jul 2025

https://github.com/snegovoy98/data-storage

This is test version of data storage

data of storage test version

Last synced: 19 Jul 2025

https://github.com/ate329/nsl-kdd-feature-extractor

Python-based tool designed to process network traffic packets and extract features compliant with the NSL-KDD dataset format.

cyber-security cybersecurity data data-science extractor feature-extraction machine-learning network-analysis nsl-kdd nsl-kdd-dataset

Last synced: 30 Oct 2025

https://github.com/DataHerb/dataherb-flora

DataHerb Flora: The core of DataHerb

data data-mining data-science datascience dataset datasets

Last synced: 08 May 2025

https://github.com/cont-limno/lagosus-reservoir

Data module classifying lakes as natural lakes or reservoirs in the conterminous U.S.

data module

Last synced: 17 Jan 2026

https://github.com/fjc0k/vue-merge-data

Intelligently merge data for Vue render functions.

data merge-data render-functions vue

Last synced: 17 May 2026

https://github.com/mikebairdrocks/fluky

[floo-kee]: obtained by chance rather than skill.

data framework mock netcore netstandard nuget random vscode

Last synced: 17 May 2026

https://github.com/inzhenerka/scooters_data_uploader

Загрузка данных в PostgreSQL в рамках курса по dbt от Инженерка.Тех

data dbt postgresql

Last synced: 04 May 2026

https://github.com/muhammad-fiaz/ason

ASON: Adaptive Structured Object Notation - Python library for dynamic data serialization, providing flexibility and simplicity.

adaptive-structure-object-notation api ason cli client data file file-format file-sharing file-upload json json-data json-parser open-source opensource parser parsing python python3

Last synced: 02 Feb 2026

https://github.com/0xleif/onionstash

Store Onions 🧅

data swift

Last synced: 05 Apr 2025

https://github.com/denko5/sales-analysis

A complete SQL-based sales analysis project covering Africa, showcasing data cleaning, exploratory analysis, insights, and lessons learned. The project highlights sales trends, regional performances, and marketing effectiveness across multiple platforms.

africa data data-analysis data-science exploratory-data-analysis insights kenya sales sql

Last synced: 24 Jan 2026

https://github.com/bacross/datamunger

python package for handling nan's and outliers

data data-frame datamunger knn nan outliers python scikit-learn

Last synced: 17 May 2026

https://github.com/priyanshubiswas-tech/ev-data-analysis-dashboard

An interactive dashboard analyzing EV trends, including total vehicles, BEV vs. PHEV breakdown, model popularity, state-wise distribution, and CAFV eligibility. Visualizes key insights for data-driven decisions in the EV industry. 📊

dashboard data data-analysis data-science data-visualization tableau tableau-public

Last synced: 17 Feb 2026

https://github.com/marians/tour-tracker

Track the general classification development of the Tour De France, stage over stage

cycling data sports statistics

Last synced: 24 Jun 2025

https://github.com/shgysk8zer0/schema

A PHP implementation of schema.org structured data objects

data microdata schema seo structured-data

Last synced: 24 Jun 2025

https://github.com/panda-official/driftcli

CLI Client for Drift Platform

cli click command-line data

Last synced: 17 Feb 2026

https://github.com/giscience/measures-rest-sparql

A SPARQL endpoint for the Measures REST OSHDB App framework.

data osm quality semantics sparql sparql-endpoints

Last synced: 24 Jun 2025

https://github.com/stdlib-js/ndarray-base-dtype-resolve-str

Return the data type string associated with a supported ndarray data type value.

array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs stdlib types util utilities utility utils

Last synced: 06 Mar 2026

https://github.com/ibnz36/arrowpipe

Build complex pipelines easily

cargo crate data pipe rust

Last synced: 13 Apr 2025

https://github.com/dbrennand/rm-content

A Python 3.7 script to remove a specific string from all files and repos (owned by the user).

content data erase eraser privacy privacy-protection privacy-tools remove remover rm-content

Last synced: 29 Mar 2025

https://github.com/junkwaxdata/cardlists

Sports Card set lists in easily consumable JSON Format for databases, apps, websites, and more!

baseball baseball-cards baseball-data bowman data dataset datasets donruss fleer json json-schema panini topps upper-deck

Last synced: 13 Mar 2025

https://github.com/randomfractals/chicago-transport

Exploratory data analysis of public Chicago transportation datasets.

chicago data data-tools duckdb sql transportation

Last synced: 01 May 2026

https://github.com/amyflo/cs448b

Exploring r/LoveLetters

d3-visualization d3js data react reactjs visualization

Last synced: 18 May 2026

https://github.com/pythongiant/data-analytics-wolfram-alpha

A data analysis porgram using wolfram alpha

analytics api data wolfram-alpha

Last synced: 04 Apr 2025

https://github.com/sambacha/yearn-finance-data

data repo for proposed YIP-DATA

cryptocurrency data erc20 ethereum exchange yearn yip yyip

Last synced: 18 May 2026

https://github.com/bastianolea/censo_viviendas

Censo de Viviendas procesado con R para disponibilizarlo con códigos/nombres de comunas, regiones, y etiquetas de sus variables. En formato original (6,5 millones de filas) y en conteo por comunas.

chile comunas data poblacion rural

Last synced: 30 Oct 2025

https://github.com/glassflow/pipelines-push-action

This Github Action lets you automate GlassFlow pipelines deployments as code

data data-processing datastreaming deployment github-actions glassflow python real-time stream-processing

Last synced: 19 May 2026

https://github.com/diddypod/crop-data-comparer

A Python script to compare crop data over years

comparison crop data openpyxl python

Last synced: 28 Jun 2026

https://github.com/stdlib-js/ndarray-base-reverse-dimension

Return a view of an input ndarray in which the order of elements along a specified dimension is reversed.

base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure types vector view

Last synced: 07 Mar 2026

https://github.com/panukatan/senso

An Interface to the Philippine Census of Population and Housing Data

census data philippines r rstats

Last synced: 29 Jun 2026

https://github.com/gcoronelc/ucv_gdi-1_202302-b2

Taller de Gestión de Datos e Información I con Gustavo Coronel.

data data-science data-structures database databases online oracle query relational-databases security sql sql-server

Last synced: 19 May 2026

https://github.com/viveknathani/maketest

A command line tool to generate test data. 📊

command-line data golang testing-tools

Last synced: 08 Jun 2026

https://github.com/coral/ddp

Distributed Display Protocol (DDP) in Go

data ddp distributed golang led pixel protocol wled

Last synced: 26 Jun 2025

https://github.com/lmuffato/project-mysql-one-for-all-trybe

Projeto mysql one for all - Projeto avaliativo da Trybe do Bloco 21: Normalização e Modelagem de Banco de Dados

back-end data database database-modeling mysql mysqlworkbench query sql trybe-projects

Last synced: 08 May 2026

https://github.com/marxmit7/kaggle

Kaggle competitions

data kaggle kaggle-competition

Last synced: 19 May 2026

https://github.com/frequentlymisseddeadlines/chessfessor

Command line tool to extract game data from Lichess.org and Chess.com

chess data extract lichess pgn

Last synced: 19 May 2026

https://github.com/jbdesbas/custom-scripts

Custom SQL functions or scripts

data database sql

Last synced: 28 Jun 2026

https://github.com/cliffano/volothamp

Random D&D stuffs my son and I dabble with

data dungeons-and-dragons info little-godzilla

Last synced: 06 Apr 2025