An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/jeroenvdmeer/feyod

Open data set of matches played by Feyenoord.

data dataset feyenoord open-data sql

Last synced: 25 Jan 2026

https://github.com/shysolocup/aepl

A Node.JS multi-layered class creation package with built-in parenting systems that let you get info from classes above as well as better function and property makers for easier to read and understand development and modding support inspired by Roblox's Studio API.

aepl backend classes data framework game-development game-framework javascript js js-class js-framework lightweight nodejs package

Last synced: 28 Oct 2025

https://github.com/hariprashad-ravikumar/ai-datascience-lab

AI‑DataScience‑Lab is a web app for uploading CSV datasets, cleaning with Pandas, and running quick exploratory analyses and regression models using scikit‑learn. Its modular design supports future AI extensions, like deep learning with TensorFlow or insight generation via the OpenAI API.

ai api azure cloudcomputing data data-analysis data-science data-visualization mathplotlib numpy openai pandas python scikit-learn

Last synced: 02 Aug 2025

https://github.com/karlinarayberinger/karbytes_23andme_ancestry_data

This repository contains web page source code and media files which constitute (some of) the intellectual property encapsulated by the website named Karbytes For Life Blog dot WordPress dot Com.

2023 23andme ancestry blog data dna genome karbytes website wordpress

Last synced: 12 May 2025

https://github.com/jasondrawdy/compendio

Collection of common and noteworthy extension methods, security tools, and filesystem functions generally found in most applications; focusing on extensibility and portability.

compendium converters cryptography data extensions generators hashing library security utilities validation windows

Last synced: 18 May 2026

https://github.com/polina-prokofieva/viewjson

The class for convenient visualization of json with some settings.

data data-visualization es5 es6 javascript json

Last synced: 15 May 2026

https://github.com/nazar-pc/fixed-size-multiplexer

A tiny library for multiplexing data chunks into blocks of fixed size and vice versa

chunk data demultiplex demux fixed multiplex mux size

Last synced: 31 Oct 2025

https://github.com/marcosvidolin/firestore-bulk-loader

A simple tool to load data to Cloud Firestore.🔥

bulk-loader cloud data database firebase firestore import load loader tools

Last synced: 23 Jun 2025

https://github.com/woctezuma/download-steam-banners-data

Data consisting of Steam banners.

data steam steam-api

Last synced: 06 Jan 2026

https://github.com/longzheng/southeastwater-usage-scraper

Extract hourly water usage data from South East Water portal website for digital water meters

australia data iot playwright southeastwater victoria water

Last synced: 06 Feb 2026

https://github.com/coatless-rpkg/ucimlrepo

An unofficial R port of the Python package to download data off of the UCI ML repository

data data-science machine-learning rstats statistics uci-machine-learning uci-machine-learning-repository web-api

Last synced: 28 Jun 2025

https://github.com/sharmadhiraj/free-json-datasets

Collection of free JSON data that are scraped and parsed from different websites.

collection crawler data data-scraping datasets json sports statistics web-scraping

Last synced: 28 Mar 2025

https://github.com/alexandregazagnes/scikit-res

Very Basic package to store results of ML models Grid search results are hard to exploit. This package aims to store them in a more convenient way.

data machine-learning mlops mlops-workflow results scikit-learn

Last synced: 20 Jan 2026

https://github.com/petermeissner/statsgrokse

R-API-binding to stats.grok.se server providing Wikipedia page view statistics for 2008 up to 2015

api binding data pageviews r wikipedia

Last synced: 17 May 2026

https://github.com/mzazakeith/puppetmaster

Puppeteer & Crawl4AI microservice for web automation, scraping, and AI processing with Bull queues

agent ai automation bull bullmq chrome crawl4ai crawler data data-extraction extraction gemini llm llms openai playwright puppeteer web-automation

Last synced: 13 May 2025

https://github.com/heikomuller/histore

Library for maintaining snapshots of evolving tabular data sets

data version-control

Last synced: 10 Apr 2025

https://github.com/alexandregazagnes/ghisa

ghisa - Github Import Statistic Analyzer is a free and open-source software, app and python package that helps you to analyze the import statistics of your github repositories.

analytics data dependencies git github github-api import package pypi python skills tool

Last synced: 27 Jun 2025

https://github.com/geo-c/oct-ckan

The Open City Toolkit (more information about the project: http://geo-c.eu)

cities collaboration data open participation transparency

Last synced: 16 May 2026

https://github.com/d-ganchar/thedus

Thedus is a lightweight migration tool for Clickhouse

cli clickhouse data database migration migrations python

Last synced: 12 Apr 2025

https://github.com/jvrck/australianpayphones

Get Australian payphone data in GeoJSON format.

australia data geojson geojson-data scraper

Last synced: 04 Apr 2025

https://github.com/cainmi/easy-pull-from-repository

A repository to pull code and files from, may be used to store page data links, code etc. mainly used for python for now

data html javascript python schema

Last synced: 04 Apr 2025

https://github.com/danieljdufour/fast-bin

Quickly Convert an Array of Numbers into their Minimal Binary Representations

array binarize binary bits data nbits numbers unbinarize

Last synced: 13 Apr 2025

https://github.com/aikuyun/flinkx

flinkx 一些修改

data flink

Last synced: 04 Apr 2025

https://github.com/stdlib-js/array-base-to-reversed

Return a new array with elements in reverse order.

array data generic javascript node node-js nodejs rev reverse stdlib structure swap types

Last synced: 11 Apr 2025

https://github.com/benji-lewis/archivord

An archival bot for Discord servers designed to retain as much data as possible to show future generations how we communicated.

archive data data-mining discord discord-bot typescript

Last synced: 16 May 2026

https://github.com/divithraju/divith-raju-data-mining

This project focuses on customer segmentation using data mining techniques, specifically K-Means clustering, to classify customers into distinct groups based on their purchasing behaviors. The goal is to analyze customer data and segment them into clusters for targeted marketing strategies and better customer relationship management.

algorthims analytics apache business client connector data dataarchitecture database dataengineering datamining datascience hadoop k-means-clustering mysql project project-repository pyspark python3 spark

Last synced: 06 Mar 2026

https://github.com/yasir13001/moonai_api

This MoonAI API service built with FastAPI that calculates and provides detailed Moon and Sun astronomical data based on user input such as date, latitude, longitude, elevation, and timezone.

ai almanac api astro-ai astronomy data data-science fastapi fastapi-api gemini groq-api hilal-detection html islamic-calenda llama llm-integration moon python

Last synced: 20 Jun 2025

https://github.com/utkarshverma439/simple-sms-spam-detector

Built a Python text classification model for spam detection in SMS. Explored data, preprocessed text, utilized TF-IDF, trained a classifier, and addressed visualization challenges, yielding practical insights.

data data-science data-visualization spam-detection

Last synced: 20 Jun 2025

https://github.com/alireza29675/goudi

GOUDI is a multi-layer data visualization application, inspired by mind maps and some other thinking and describing methods.

analysis data goudi visualization

Last synced: 11 Jul 2025

https://github.com/harmonydata/harmony_examples

Example Jupyter notebook and R scripts using Harmony in real research problems

data data-harmonisation data-harmonization harmonisation psychology python r research

Last synced: 11 Jul 2025

https://github.com/lunastev/reflectlm

ReflectLM is a self-reflective, language-structure-only AI model that learns exclusively through interaction. It starts with zero factual knowledge but can engage in dialogue, evaluate its own responses, and remember conversations for future learning.

ai data language-model llm model open-source ts web

Last synced: 22 Jun 2025

https://github.com/dennyglee/open-covid19-public

A collaboration between SCRI and Databricks on the analysis of open COVID-19 datasets.

covid-19 data data-analytics data-engineering data-science nlp

Last synced: 22 Jun 2025

https://github.com/nia-cloud-official/datascript

DataScript: A Hypothetical Data Scripting Language, DataScript is designed for simplifying data manipulation and analysis tasks. It serves as a scripting language tailored specifically for handling various data operations efficiently.

data data-scripting scripting-language

Last synced: 22 Jun 2025

https://github.com/erictleung/2017-new-coder-survey

:beginner: Code to help clean and format the 2017 New Coder Survey by freeCodeCamp

coder-survey data data-cleaning dplyr freecodecamp

Last synced: 03 Apr 2025

https://github.com/finnspartronics/orpheus

A took for looking at FRC (First Robotics Competition) scouting data

data first-robotics-competition scouting scouting-data spartronics

Last synced: 28 Mar 2025

https://github.com/evoluteur/madeleinology

Playing with data science by taking a look at the proportions of flour, sugar, butter, and eggs in 147 Madeleine recipes (the traditional French sponge cake).

baking cake cooking cooking-recipes data data-science data-visualization dessert exploratory-analysis exploratory-data-analysis exploratory-data-visualizations food histogram longtail madeleine recipe visualization

Last synced: 23 Jun 2025

https://github.com/flownrecords/flightTracker

A mobile app built to record essential flight data for post-flight review and debriefing.

aviation data gps tracking

Last synced: 23 Jun 2025

https://github.com/elazar/pycopyql

Exports a subset of data from a relational database.

data database export relational tool utility

Last synced: 16 May 2026

https://github.com/nichtich/wikidata-taxonomy-examples

Extract classifications from Wikidata

coli-conc data knowledge-organization wikidata

Last synced: 12 Jul 2025

https://github.com/mmaithani/singapore-residents-data-eda

The data contains Population by ethnicity, age and gender for the country of Singapore from the year 1957 to 2018

data data-visualization ethnicity kaggle-dataset python singapore singapore-residents-data

Last synced: 16 Apr 2026

https://github.com/nafisalawalidris/dr.-semmelweis-and-the-discovery-of-handwashing

Uncover the revolutionary impact of handwashing on mortality rates in healthcare. Explore the story of Dr. Semmelweis and his groundbreaking findings.

data data-analysis handwashing healthcare-analysis medical-breakthrough mortality-rates

Last synced: 13 Jul 2025

https://github.com/margostino/job-pulse

PoC to analyse the hiring market

data golang mongodb visualization

Last synced: 16 May 2026

https://github.com/stdlib-js/ndarray-slice-assign

Assign element values from a broadcasted input ndarray to corresponding elements in an output ndarray view.

assign assignment copy data javascript matrix ndarray node node-js nodejs set setitem slice stdlib structure types vector view

Last synced: 11 Apr 2025

https://github.com/stonecharioteer/renfield

Synchronize and Search through Hard Drives

catalogue data search storage synchronization

Last synced: 09 Feb 2026

https://github.com/dineshpinto/geist-finance-subgraph

Subgraph for the Geist Finance protocol on the Fantom blockchain.

assemblyscript blockchain data fantom graphql typescript

Last synced: 17 May 2026

https://github.com/plurid/deserve

Own Your Data · Control The Code

data owner

Last synced: 16 Jul 2025

https://github.com/shuklayash02/complete_data_analysis_project

A Full Data Analysis project where a sales data is ask,prepare,process,analyze,share and act through data analysis process

data data-visualization dataanalysis database datacleaning powerbi sql

Last synced: 16 Jul 2025

https://github.com/ouverz/governed_arr

A dbt project showing end-to-end ARR definitions, compute, transformation, validation and governance

arr data data-modeling dbt-core governance semantic-layer snowflake

Last synced: 02 Jul 2026

https://github.com/grycap/cdmi-client-go

A basic Go library to perform CDMI core operations

cdmi cloud data go

Last synced: 02 Jul 2026

https://github.com/clabe45/kaz

Minimalistic local storage cli

cli data minimalistic storage utility

Last synced: 17 Jul 2025

https://github.com/mustika-putri-m/-tableu-laporan-data-karyawan-growian

I am currently pursuing a data analysis certification at GROWIA, where I've learned to use tools such as Python, SQL, Google Big Query, Google Data Studio, Advanced Microsoft Excel, and Tableau. This course has enhanced my ability to analyze data using KPIs and business metrics, enabling me to solve business problems more effectively

data data-visualization tableau

Last synced: 17 Feb 2026

https://github.com/giscience/measures-rest-oshdb-docker

Scripts for starting measures for geospatial datasets in docker container, using the OSHDB

data dggs docker geospatial mesure openstreetmap rest

Last synced: 18 Apr 2026

https://github.com/saboye/web-scraping-with-python

A web scraping project using Python's "Requests" and "BeautifulSoup" libraries to extract structured data from one or more websites. This project involves sending HTTP requests to the target website(s), retrieving the HTML content of the website(s), and parsing this content to extract the desired data in a usable format.

beautifulsoup csv data data-harvesting data-mining python request web webscraping

Last synced: 18 Jul 2025

https://github.com/desilinguist/hanukkah-of-data-2022

My solutions to Hanukkah of Data 2022

2022 data hanukkah pandas python

Last synced: 17 May 2026

https://github.com/am-i-groot/summer-intern-iitguwahati-spml

Developed an automated Water Quality Monitoring System (WQMS) at IIT Guwahati, using the pH-W218 sensor and K-Means Clustering to assess water potability. The project enhances water quality evaluation through machine learning-based classification.

algorithm data data-visualization kmeans-clustering machine-learning python report sensor signal-processing

Last synced: 17 May 2026

https://github.com/bytraembedded/Laptop-Price-Prediction-with-Machine-Learning

The Laptop Price Prediction with Machine Learning project provides a system to predict the price of laptops based on various features such as processor type, RAM size, storage capacity, and more/

airflow data data-science data-visualization fastapi heroku-deployment machine-learning-algorithms matplotlib-pyplot numpy pandas python reactjs seaborn

Last synced: 30 Dec 2025

https://github.com/rustytake-off/datasets

Various datasets for 🤗 HuggingFace

data datasets docs huggingface

Last synced: 27 Mar 2025

https://github.com/patrikmasiar/algorythm-of-the-night

Awesome list of algorithms that help you 🚀 Feel free to contribute 👨🏻‍💻

algorithms data interview-questions logic logic-programming math mathematics science

Last synced: 02 Jul 2026

https://github.com/kockarevicivan/dot-net-snippets

Set of .NET code snippets: algorithms, data structures, graph searches etc, created for demonstration purposes.

algorithms binary c-sharp data generics graphs-pathfinding list structures

Last synced: 27 Mar 2025

https://github.com/chrisru/f1stats

🗄️ Speedy API for Formula 1 statistics

api data fast formula1

Last synced: 20 Mar 2025

https://github.com/parzibyte/cifrar-descifrar-php

Cifrar y descifrar datos con PHP usando la librería php-encryption; cifrar con clave general o con claves generadas por contraseñas de usuarios

crypto data decrypt encryption password php security

Last synced: 20 May 2026

https://github.com/fritzrehde/asciibar

A cli tool to print percentages as ascii bar charts

cli data percentage visualization

Last synced: 02 Jul 2026

https://github.com/prioritizr/prioritizrdata

Conservation planning data sets

data r spatial-data

Last synced: 19 Jul 2025

https://github.com/snegovoy98/data-storage

This is test version of data storage

data of storage test version

Last synced: 19 Jul 2025

https://github.com/emomaxd/flog

header-only logging library

c-plus-plus data files formatting logging stdout

Last synced: 20 Mar 2025

https://github.com/timxor/bitcoind-data-ingestion

crypto payments bitcoind data ingestion

bitcoind data ingestion

Last synced: 02 Jul 2026

https://github.com/codedotjs/indiaartfairy

:beetle: Data & More - India Art Fair • 2018 - 2024

csv data python scraper

Last synced: 16 Jun 2025

https://github.com/ate329/nsl-kdd-feature-extractor

Python-based tool designed to process network traffic packets and extract features compliant with the NSL-KDD dataset format.

cyber-security cybersecurity data data-science extractor feature-extraction machine-learning network-analysis nsl-kdd nsl-kdd-dataset

Last synced: 30 Oct 2025

https://github.com/camilajaviera91/dbt-transformations-sql-mock-data

This repository contains the transformations and documentation for the data model generated in sql-mock-data.

data dbt postgresql sql

Last synced: 02 Feb 2026

https://github.com/DataHerb/dataherb-flora

DataHerb Flora: The core of DataHerb

data data-mining data-science datascience dataset datasets

Last synced: 08 May 2025

https://github.com/cont-limno/lagosus-reservoir

Data module classifying lakes as natural lakes or reservoirs in the conterminous U.S.

data module

Last synced: 17 Jan 2026

https://github.com/yogaprasadk/dbms_course_a_to_z

it is a repository for complete lecture of Database Management Systems taught by riti kumari

acidproperties btree data database dbms filesystem normalform normalization sql

Last synced: 20 Mar 2025

https://github.com/fjc0k/vue-merge-data

Intelligently merge data for Vue render functions.

data merge-data render-functions vue

Last synced: 17 May 2026

https://github.com/mikebairdrocks/fluky

[floo-kee]: obtained by chance rather than skill.

data framework mock netcore netstandard nuget random vscode

Last synced: 17 May 2026

https://github.com/FAIMS/OpenDataPresentation

Brian Ballsun-Stanton's presentation

context data presentation

Last synced: 03 Apr 2025