An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/kevinsames/microsoft-fabric-data-platform-template

A GitHub starter repository for building modern Data Engineering, ML, and AI solutions on Microsoft Fabric. Includes medallion architecture (Bronze → Silver → Gold), Spark Notebooks, dbt, MLflow, GitHub Actions CI/CD, and arc42-based documentation.

data dbt fabric microsoft python spark

Last synced: 29 Apr 2026

https://github.com/divanny/academixbackend

🧑‍🎓 Academix is a comprehensive academic management system designed to streamline and enhance the educational experience for both students and professors. This repository contains the backend codebase for the Academix system, responsible for handling data processing, authentication, and API endpoints.

backend csharp data net webapi

Last synced: 07 Jun 2026

https://github.com/henrylin03/china-gdp

Analysis and visualisation of China GDP data using Python.

data data-analysis data-visualisation dataset kaggle pandas

Last synced: 01 May 2026

https://github.com/liuliqiang/laueagle

YAML/JSON Lints and Converters

converter data formater json linter python serialization yaml

Last synced: 02 May 2026

https://github.com/walderlansena/datastructureinc

:battery: Algoritmos de Estrutura de Dados em C++

c cplusplus data fila list lista pilha stack struct structure structured-data

Last synced: 03 May 2026

https://github.com/shogunbanik18/budgetify

End-to-End Budget Analysis enables effective budgeting through detailed analysis and strategic planning

analysis data data-engineering data-exploration databricks databricks-notebooks etl etl-process python3

Last synced: 09 Jun 2026

https://github.com/perceptronv/miscellaneous

A huge variety of materials, mostly training data for AI. Not a lot of source code yet.

data gan machine-learning nlp text-generation

Last synced: 04 May 2026

https://github.com/kenmwaura1/nuvo-data-cleaning-functions

Collection of scripts and functions to clean and preprocess data using Nuvo SDK.

data nuvo react

Last synced: 04 May 2026

https://github.com/issacto/animmender

Deployed Web App

angularjs anime data

Last synced: 05 May 2026

https://github.com/bredalis/numpy

✨ Library to work with arrays ✨

arrays data matrix numpy numpy-arrays numpy-library python

Last synced: 06 May 2026

https://github.com/satur-io/estoraje

Estoraje is the simplest distributed system for key-value storage in less than 800 lines of code. It is temporary consistent, high available, lightweight, scalable and gives a good performance.

data database distributed go golang key-value performance training

Last synced: 07 May 2026

https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo

This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.

crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web

Last synced: 08 May 2026

https://github.com/favarettorm/bd_universidade

BD_UNIVERSIDADE V01 - Banco de dados fictício de uma universidade para fins didáticos

data database dataset mariadb mariadb-database mariadb-mysql mysql mysql-database scripts sql university

Last synced: 08 May 2026

https://github.com/chompfoods/stub-jaxrs-resteasy

JAX-RS RESTEasy server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery ingredients jax-rs jax-rs-server nutrition raw recipe-api recipes resteasy server server-stub stub stub-server

Last synced: 08 May 2026

https://github.com/raynardj/r_notes

Learning notebooks of R

data docker guru99 jupyter learning r

Last synced: 09 May 2026

https://github.com/n0nag0n/flee-intercom

For those of you who like to keep your money after Intercom jacks up the prices year after year, but want to keep an export of your data.

again-and-again api data database export exporter flee high-prices intercom mysql php price run save saver year-over-year

Last synced: 09 May 2026

https://github.com/alechash/rndmzr

Randomizer is a random data generator.

data data-science random random-generation random-number-generators

Last synced: 10 Jun 2026

https://github.com/lmuffato/project-mysql-vocabulary-booster-trybe

Projeto mysql vocabulary booster - Projeto avaliativo da Trybe do Bloco 20: Funções SQL, Joins e Subqueries

back-end crud data database mysql mysqlworkbench query sql trybe-projects

Last synced: 10 May 2026

https://github.com/782e616c6d/covid-d.a

Academic project, using Apache Spark for ETL and Data Studio for data analysis.

academic analytics automation cluster covid-19 data database etl python spark sql

Last synced: 10 May 2026

https://github.com/carlossilva2/pybase

An easy to use Database using Python and JSON

data database json python3 storage

Last synced: 11 May 2026

https://github.com/masu-baumgartner/dbsync.net

A c# mysql model sync library

cshap data library mysql

Last synced: 13 May 2026

https://github.com/oneblack333/pizza_sales_analysis

The project involves transforming raw pizza sales data into actionable business intelligence through analysis and visualization. This enables pizza business owners to make data-driven decisions on inventory, staffing, and marketing, ultimately improving performance and profitability.

data data-structures data-visualization excel mysql powerbi

Last synced: 20 Jun 2026

https://github.com/CentralFloridaAttorney/ComfyUI-ZMongo

An Easy-to-Use database framework and parameter library for ComfyUI. Centralize node presets, capture workflow logic, manage structured image collections, and build document-driven text automation pipelines on an offline Local File Store or BusinessProcessApplications.com .

api comfy comfy-ui comfyui comfyui-custom-node comfyui-custom-nodes comfyui-manager comfyui-node comfyui-nodes comfyui-workflow data database

Last synced: 21 Jun 2026

https://github.com/matusf/glasgow_wifi

Script that plots wifi access points to map and labels them by their protection

data data-visualization folium python python3

Last synced: 24 Jun 2026

https://github.com/williamwutq/bblock

Persistent checksummed blocks built on top of bstack's allocators

allocation binary block data data-structures database rust rust-crate rust-library serialization

Last synced: 25 Jun 2026

https://github.com/cracko298/planet-life-save-converter

Convert your Planet-Life Saves To and From Base64 & *.planet files.

base64 base64-decoding base64-encoding data python python-script python3 save-converter save-data save-files

Last synced: 15 Mar 2025

https://github.com/karo23361/toy-store-kpi-power-bi

PowerBI Portfolio Project

csv data data-visualization powerbi

Last synced: 03 Feb 2026

https://github.com/abirsaha111/ipl-2022-analysis

The IPL 2022 Analysis project is a data-driven exploration of the Indian Premier League (IPL) 2022 cricket tournament. The analysis focuses on utilizing Python programming and various libraries to analyze and visualize the performance of teams, players, and key metrics in the IPL 2022 season.

data dataana dataanalytics datavi matplotlib python

Last synced: 07 Jun 2026

https://github.com/purarue/HPI-personal

Personal HPI modules/scripts

data history lifelogging

Last synced: 30 Mar 2025

https://github.com/sanad343/complete-data-analyst

Data analysis is the process of turning raw data into useful information for decision-making.

data data-visualization datamanipulation eda excel exploratory-data-analysis powerbi python-3 sql tableau

Last synced: 30 Jun 2025

https://github.com/openwashdata/ugabore

Borehole repair data from central Uganda associated with a project report completed by Joseph Lwere for the “data science for openwashdata” course

analysis borehole data open-data r uganda wash water

Last synced: 17 Jan 2026

https://github.com/patrickdavies100/pipeline38

An application to automate the creation and execution of SQL queries.

data pandas-dataframe pipeline postgresql psycopg2 sqlalchemy

Last synced: 30 Apr 2026

https://github.com/ohspc89/better_call_jin

A repository containing mentoring materials for a Ph.D. student in Neuroscience

data matlab spss-statistics visualization visualization-tools wrangling-data

Last synced: 08 Oct 2025

https://github.com/satyam4229/iit-and-nit-college-dataset

The dataset for IITs and NITs typically includes information related to these premier engineering institutions in India, such as their names, locations, rankings, academic programs offered, faculty details, student information, admission process, infrastructure and facilities, placements.

college-data csv data excel iit nit

Last synced: 04 Jan 2026

https://github.com/victorowinoke/custmer-segmentation-using-rfm-python-

Customer Segmentation using the Recency, Frequency and Monetary Values

customer-segmentation data data-visualization python3 science time-series-analysis

Last synced: 26 May 2026

https://github.com/muhamedlabs/muhamed_onedrive

Muhamed_OneDrive - це надійне і зручне хмарне сховище для файлів, розроблене для безпечного зберігання і легкого обміну даними.

data html5 onedrive programming style

Last synced: 04 Jan 2026

https://github.com/gdcmarinho/vaultchat

VaultChat is a end-to-end encryption chat service

chat data e2ee encrypted messaging privacy

Last synced: 23 Mar 2025

https://github.com/danieljdufour/fast-b64

Quickly Convert between B64 and Binary Strings

b64 base64 base64-decoding base64-encoding binary bits compression data

Last synced: 08 Oct 2025

https://github.com/istinnew/cook-me-up

[In Progress] Welcome to Cook-Me-Up! This project aims to analyze and organize cooking recipes using data analysis (Python, BigQuery SQL, Looker Studio etc.) and machine learning techniques. The goal is to simplify meal preparation and offer users a comprehensive database of culinary delights.

bigquery clustering cookme culinary data data-science dataanalysis datavisualization looker-studio machine-learning python recipe-search recipes unsupervised-learning

Last synced: 16 May 2026

https://github.com/rahul1582/bank-loan-classification

Classifying whether a person is taking personal loan or not using all the Classification Algorithms.

algorithm analysis classi data

Last synced: 08 Oct 2025

https://github.com/guilyx/airplane-booking

Simple airline ticket reservation program.

algorithms data linked-list

Last synced: 25 Jun 2025

https://github.com/fatihilhan42/olympics-data-analysis-with-python

I will examine the Data Analysis of the Olympics between 1896-2016, which we have done on Python.

data data-science dataanalysis datavisualization jupyter-notebook olympics python

Last synced: 30 Apr 2026

https://github.com/djdhairya/whatsapp-chat-analysis

WhatsApp chat analysis is a multidimensional process that delves into the content, structure, and dynamics of conversations within the platform. It provides valuable insights for personal reflection, organizational decision-making, and improving communication strategies.

data data-science dataanalytics datapreprocessing machine-learning ml

Last synced: 08 Oct 2025

https://github.com/shubhamsoni98/classification-with-random-forest-1

To classify sales into categories (Low, Moderate, High) using Random Forests to inform strategic decisions and optimize marketing strategies.

algorithms anaconda data data-science datacleaning eda jupyter-notebook machine-learning pyhton random-forest scikit-learn visualization

Last synced: 18 Jan 2026

https://github.com/mnz1365/saving-record-time-text

date saving in text file with python

data python txt-files writefile

Last synced: 18 Jul 2025

https://github.com/leevilaukka/alkometriikka

Tool to search Alko database and see some fun stats about different beverages

data gh-pages svelte typescript xlsx

Last synced: 18 May 2026

https://github.com/wlgs/got-dialogues-data-stats

Game of Thrones dialogues data statistics processed with R and SQLite. Project for Probability and Statistics course 21/22 at AGH UST. The project was about manipulating data and getting many pieces of information from it in addition to visualizing these results.

data game-of-thrones got r statistics stats

Last synced: 22 May 2026

https://github.com/flexiui-labs/flexi-grid

Flexi Grid is an advanced, lightweight, and customizable Angular 19 data grid component

angular data filter grid search select sort table

Last synced: 14 May 2026

https://github.com/sharoonjoseph321/insurance_fraud_detection

Fraud Detection using machine learning algorithm-KN Neighbors .Data exploration using Pyspark and matplotlib.

analytics data data-science eda high-performance knn-algorithm knn-classification machine-learning matplotlib-pyplot pyspark python seaborn spark statistics

Last synced: 23 Mar 2025

https://github.com/cityofnewyork/nyco-wp-open-data-transients

Interface for saving Open Data endpoints as WordPress Transients. Maintained by @NYCOpportunity

civic-tech composer data nycopportunity open-data plugin transients wordpress

Last synced: 10 Apr 2026

https://github.com/benmizrahi/reactivejs

microservices event bus for async/sync communications

data microservices nodejs

Last synced: 01 May 2026

https://github.com/ttozatto/sparkify

Churn Prediction for music streaming app with PySpark

analysis churn data learning machine predictive pyspark science spark

Last synced: 16 Jan 2026

https://github.com/anarya22/e-commerce_analysis

E-Commerce_Analysis is a data analysis project performed on the Superstore_USA dataset. It explores various aspects of e-commerce performance, including sales trends, customer demographics, product categories, and regional performance. The analysis includes data cleaning, visualizations, and insights on factors influencing sales and profitability.

analysis analytics cleaning-data data

Last synced: 09 Oct 2025

https://github.com/carlosrs14/parallel-data-preprocessig-system

A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.

barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads

Last synced: 24 Jul 2025

https://github.com/cburmeister/disc-golf-courses

All the disc golf courses i've played at. Maintained with http://geojson.io/.

data geojson

Last synced: 21 Jan 2026

https://github.com/ahmed-naserelden/astro-success-analytics

This project analyzes key factors influencing success in the Space Race using data science techniques. It includes data collection, machine learning modeling, and insightful visualizations to predict mission outcomes.

data dataanalysis python

Last synced: 01 May 2026

https://github.com/team-hydrogen/2025-adc-data

All files relating to the computation of the data provided

data jupyter-notebook nasa-app-development-challenge

Last synced: 11 Apr 2025

https://github.com/shubhammittal-data/hr_dashboard_tableau

An interactive HR Analytics Dashboard built using Tableau. Provides insights into workforce demographics, hiring trends, salary analysis, and employee records for data-driven decision-making.

chatgpt4 data data-analysis data-visualization drawio-tools faker-generator hr-analytics hr-analytics-dashboard human-resources numpy python tableau tableau-public

Last synced: 17 May 2026

https://github.com/nel-zi/zipco_foods

Developed an automated ETL pipeline using Python and Apache Airflow to consolidate fragmented CSV sales data into a normalized Azure SQL database for Zipco Foods.

airflow apache-spark data dataengineering etl pyspark wsl

Last synced: 03 May 2026

https://github.com/mikpom/genomvar

Sequence variant analysis in Python

data genomics

Last synced: 10 Apr 2026

https://github.com/gman-au/white-knight-neo4j

Neo4j implementation of White Knight data abstraction library

abstractions data datastore dotnet neo4j repository-pattern specification-pattern

Last synced: 20 Jan 2026

https://github.com/bilgehangecici/datatypeconverter

Converting integer and floating numbers to appropriate bit-level representation.

data datatypeconverter java machine-level variables

Last synced: 30 Mar 2025

https://github.com/chocolateboy/data

Structured data scraped from unstructured (or semi-structured) sources

data dataset datasets json opendata scrape scraped scraper wikipedia

Last synced: 30 Aug 2025

https://github.com/theleopard65/isa-imitation

This repository contains a simple C++ implementation of a Von-Neumann architecture simulator. The program mimics the behavior of a basic computer architecture that uses a single memory space for both instructions and data. Users can load programs, execute them, and view the current state of the memory and registers.

32-bit 64-bit ac architecture c-plus-plus data executable explained implementation ir isa mar mdr memory pc registers simulation von-neumann x64 x86

Last synced: 18 Mar 2025

https://github.com/lucasnbsb/data-structures-and-algorithms

Studying data structures and algorithms, mostly on leetcode

algorithms data structures

Last synced: 29 Aug 2025

https://github.com/darshjasani/insurance-claim-analysis

This dataset contains insightful information related to insurance claims, giving us an in-depth look into the demographic patterns of those receiving them.

analysis data kaggle sql

Last synced: 27 Aug 2025

https://github.com/ferru97/jsketchfabcrawler

jSketchfabCrawler is a java for the automatic crawling of model's information from sketchfab.com

crawler data database java sketchfab sql

Last synced: 03 Jan 2026

https://github.com/quetz-al/quetzal-openapi-client

Autogenerated Python client for the Quetzal API

client data data-science openapi-client openapi3 python quetzal

Last synced: 10 Oct 2025

https://github.com/rod-persky/sungrowdatacollector

Data collector for a SunGrow SG8.0RT Inverter

data opentelemetry sungrow

Last synced: 19 Jan 2026

https://github.com/paulrosset/cyclone

Network data consumption monitoring

data monitoring network networking

Last synced: 23 Aug 2025

https://github.com/redgoose-dev/baguni

이미지를 보관하고 탐색하는 웹 프로그램

data explorer file management upload

Last synced: 14 Apr 2026

https://github.com/theopenwebjp/theopenweb-data-loader

Package for loading data to local project

data downloader import javascript typings

Last synced: 10 Oct 2025

https://github.com/j-sephb-lt-n/joes_giant_toolbox

A large collection of general python functions and classes that I use in my daily work

ascii browser classifier data dataviz gcp mime nlp python regex search statistics supervised web-scraping

Last synced: 10 Oct 2025

https://github.com/h4fide/politicalcompassbot

This Python project allows you to take a quiz and find out where you fit on the political compass. Give it a try and see where you stand!

bot data greedy-algorithms politics python python3 sql telegram

Last synced: 19 Aug 2025

https://github.com/rationalprabal/book-management-app

A Node.js and Express.js application for managing books, featuring role-based authentication and authorization with JWT, file uploads for book cover pages, robust data validation and documentation using swagger. The project includes user roles such as Admin, Author, and Reader, each with specific permissions.

data expressjs jwt-authentication mongodb mongoose nodejs rbac-roles

Last synced: 10 Apr 2026

https://github.com/loggdme/kyro

Collection of utilities and examples for creating efficient data pipelines in go with parallel queues and, rate limitiers and much more.

data package

Last synced: 14 Jan 2026

https://github.com/chowington/bg-counter-tools

A set of tools that can pull data from Biogents BG-Counter smart mosquito traps and convert them into a Darwin Core compliant format.

bg-counter biogents darwin-core data internet-of-things mosquito-prevalence population-dynamics

Last synced: 10 Oct 2025

https://github.com/ikcede/hinge-data-ts-wrapper

Typescript wrapper for exported Hinge data

data hinge typescript

Last synced: 10 Oct 2025

https://github.com/rijkvanzanten/ds-fa-1

The first final assignment for the data structures class

assignment data final map now parsons structures thenewschool

Last synced: 04 Oct 2025

https://github.com/seqeralabs/ffq-api

A minimal wrapper to make ffq searches available via a REST API.

api data fastq fetch-fastq ffq genomics

Last synced: 15 Aug 2025