An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/sjg/my-search-story

My Search Story is a demo application developed for the Data Portability API Workshop and the #AISprint2025 events. #BuildwithAI

data docker generative-ai google-cloud-platform google-cloud-run nodejs

Last synced: 04 May 2026

https://github.com/jigyasag18/movie-recommendation-system-project

This repository features a personalized movie recommendation system that offers tailored suggestions to users. It leverages a dataset of 5,000 English-language films and utilizes data processing, feature engineering, and a cosine similarity algorithm to analyze user preferences. The system includes an intuitive user interface for easy navigation.

data datacleaning datapreprocessing machine-learning machine-learning-algorithms python streamlit streamlit-webapp

Last synced: 28 May 2026

https://github.com/stefen-taime/llm-rag-mtl-public-hospital

Ce projet développe un modèle de type Retrieve-Augment-Generate (RAG) pour répondre aux questions en utilisant les données publiques des avis laissés sur Google pour des hôpitaux à Montréal

data google-reviews hopital hospital hub ia llm montreal open-source quebec rag

Last synced: 21 Apr 2026

https://github.com/rysteq/abstract-data-structures

This repository contains two programs written in C about the stack and queue ADT's

abstract-data-structures c data queue stack

Last synced: 06 Oct 2025

https://github.com/abdullahashfaqvirk/earth-engine-data-scraper

A Python based web scraper designed to extract and organize dataset metadata from the Google Earth Engine Datasets Catalog for research, and analysis purposes.

beautifulsoup data data-science python requests scraper web-scraping

Last synced: 10 May 2026

https://github.com/infinitode/pyautoplot

PyAutoPlot is an open-source Python library designed to make dataset analysis much easier by generating helpful detailed plots using matplotlib. It automatically generates appropriate plots based on the dataset you feed it.

analysis automatic csv data dataset dataset-analysis generation matplotlib pandas plots plotting-in-python plotting-library python

Last synced: 16 Mar 2025

https://github.com/ryanve/i11

CSS named colors list

colors css data dataset

Last synced: 07 Oct 2025

https://github.com/shahsuvarli/election-voters-data-analysis-pandas

Educational project analyzing Azerbaijan voter demographics with pandas, focusing on data cleaning, grouping, and visualization.

cleaning data grouping matplotlib numpy pandas python visualization

Last synced: 12 Apr 2026

https://github.com/sauravsrivastav/githubreposearcher

GitHub Repo Searcher 🔍 is a Streamlit web application designed to help you search for GitHub repositories based on a query and view the results in a tabular format. You can also download the results in CSV or Excel format for further analysis. 📊📈

data data-export excel github-api python repository-searcher streamlit webapp

Last synced: 20 Jan 2026

https://github.com/cpietsch/breitband

developer repo of breitband-berlin

d3js data threejs visualization

Last synced: 02 May 2026

https://github.com/petzi53/repair

R Datasets of the Open Repair Alliance (ORA).

data r repair repair-cafe

Last synced: 19 May 2026

https://github.com/prajjwol09/sql_retail_analysis_project

This project demonstrates SQL-based data cleaning, exploration, and business analysis on a retail sales dataset. It involves setting up a database, removing null values, performing EDA, and using SQL queries to extract key insights such as top customers, best-selling categories, and monthly sales trends.

data data-analysis datacleaning dataexploration pgadmin4 sql

Last synced: 15 Feb 2026

https://github.com/fatihilhan42/hollywood-theatrical-market-synopsis-1995-to-2021

In this project, the data of hollywood film production companies from 1995 to 2021 were examined. Significant tables and graphs were created using data visualization algorithms, with the tickets sold divided into categories.

data data-analysis data-science data-visualization

Last synced: 23 Mar 2025

https://github.com/iankitnegi/tableautales

"Discover my Tableau journey! Dive into data-driven stories, visualizations, and projects as I explore the power of data visualization."

data data-visualization tableau

Last synced: 21 Jan 2026

https://github.com/vck9521/traffic-accidents

In this project, we analyze the effects of various factors that correlate to traffic fatalities in the United States. Logistic regression is used, with the y variable being Fatality Rate (coded 0 for Survived, 1 for Fatality).

analysis data fatalities r regression rstudio traffic visualization

Last synced: 05 Jun 2026

https://github.com/richardlitt/bird-watching

My birdwatching list and repo

birding data ebird

Last synced: 26 Jan 2026

https://github.com/aiwithqasim/project_allocation_system

Project Allocation System (PAS) automates and simplifies the process of Allocating projects to students. Teachers can simply add details on prompting for input and perform a number of operation modules including Adding Projects, Updating Projects, Searching Projects , Deleting Projects and Display All Projects

algorithms-and-data-structures algorthims c-plus-plus data data-structures linked-list

Last synced: 08 Oct 2025

https://github.com/doppelgunner/baby

A program for storing data just for fun

data doppelgunner java note storing

Last synced: 12 Jun 2026

https://github.com/jacob-pitsenberger/python-electronics-inventory-management-system-object-oriented-programming-project

Welcome to the Python Electronics Inventory Management System project repository! This project is a demonstration of Object-Oriented Programming (OOP) principles in Python for managing an electronic parts inventory.

data data-structures dictionary exception-handling file-io filesystem input-output inventory-management-system management-system modules oop pickle python user-interface

Last synced: 08 Oct 2025

https://github.com/slavos1/covid_data

Use Public Health England daily data to show daily cases.

covid covid-19 covid19 data pandas python3

Last synced: 14 Apr 2026

https://github.com/heitang/fcu-classid

逢甲大學:學院 ID 、 系所 ID 和班級 ID

data fcu project

Last synced: 30 Mar 2025

https://github.com/danieljdufour/fast-b64

Quickly Convert between B64 and Binary Strings

b64 base64 base64-decoding base64-encoding binary bits compression data

Last synced: 08 Oct 2025

https://github.com/andypicke/ev_station_explorer

Shiny App to visualize electric-vehicle charging station data

data electric-vehicles r shiny-apps visualization

Last synced: 29 Jul 2025

https://github.com/shubhamsoni98/classification-with-random-forest-1

To classify sales into categories (Low, Moderate, High) using Random Forests to inform strategic decisions and optimize marketing strategies.

algorithms anaconda data data-science datacleaning eda jupyter-notebook machine-learning pyhton random-forest scikit-learn visualization

Last synced: 18 Jan 2026

https://github.com/leevilaukka/alkometriikka

Tool to search Alko database and see some fun stats about different beverages

data gh-pages svelte typescript xlsx

Last synced: 18 May 2026

https://github.com/schijioke-uche/data-analysis-with-python-an-spss-model

With this Python notebook algorithm, you can use SPSS Model notebook to build machine learning pipelines that you can use to iterate rapidly during the model building process in data analysis. Whether you're trying to find the right algorithm or experimenting with different ways of preparing your data, you can create reproducible research that's easily understood by any member of your team with Hypothesis definition.

anova cp4a cp4d cp4i cp4s data ibm ibm-cloud jeffrey-chijioke-uche jeffrey-solomon-chijioke-uche openshift python python3 redhat t-test

Last synced: 22 Apr 2026

https://github.com/rbcavi/factorio-mod-data

The modpacke data for factorio-viewer

data factorio factorio-data factorio-mod-data

Last synced: 23 Apr 2026

https://github.com/0xHericles/SpamDetector

:email: A Simple Python Spam Detector with Scikit-Learn

data ham machine-learning python sklearn spam

Last synced: 24 Mar 2025

https://github.com/abhijeetdasbakshi/ecommerce-insights

A Dockerized end-to-end project that combines unsupervised machine learning for customer segmentation with scalable data pipelines. It uses MongoDB for data ingestion, Scikit-learn for clustering, Airflow for orchestration, and Streamlit for interactive visualization — enabling actionable insights into e-commerce

airflow airflow-dags ci-cd-pipeline clustering dags data data-pipelines docker docker-compose docker-container dockerfile git great-expectations kafka mongodb pca-analysis postgresql pyspark t-sne umap-learn

Last synced: 04 Apr 2026

https://github.com/preritdas/covidactnow

A wrapper for the Covid Act Now database of live COVID-19 state-based statistics.

api covid covid-19 data python python3 science wrapper

Last synced: 09 Oct 2025

https://github.com/cracko298/planet-life-save-converter

Convert your Planet-Life Saves To and From Base64 & *.planet files.

base64 base64-decoding base64-encoding data python python-script python3 save-converter save-data save-files

Last synced: 15 Mar 2025

https://github.com/chompfoods/stub-scala-akka-http-server

Scala Akka HTTP server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

akka api branded chomp data database food grocery ingredients raw recipe-api recipes scala server stub stub-server

Last synced: 15 Apr 2026

https://github.com/murtaza-arif/all-you-need-to-know-for-data-engineer

This repository is designed to showcase various aspects of data engineering, including tools, frameworks, and end-to-end projects. It covers everything from data ingestion and transformation to data warehousing and cloud-based solutions.

cassandra data data-engineering data-science kafka kafka-consumer kafka-streams pyarrow spark

Last synced: 07 May 2026

https://github.com/s-babaeizadeh/next-mini-app

nextjs mini application

css data nextjs reactjs

Last synced: 11 Apr 2026

https://github.com/shadeglare/genum

The ES Next tools to process data in a LINQ manner

data linq processing typescript

Last synced: 13 Apr 2026

https://github.com/opdev1004/totjs

Not totally new but a file format for managing human readable data in a file. JS version.

data data-storage data-store database database-management hacktoberfest hactoberfest-accepted nodejs

Last synced: 31 Jan 2026

https://github.com/jigyasag18/iit-guhawati

Empower Sakhi is a data-driven platform that uses machine learning to identify women at risk of domestic violence in India. It offers confidential self-assessments, survivor stories, and emergency resources through a trauma-informed, privacy-focused web app. The project also provides NGOs with actionable insights via Power BI dashboard for support.

aiml data dataset datavisualization domestic-violence eda jupyter-notebook label-encoding machine-learning machine-learning-algorithms machine-learning-models machinelearning machinelearningprojects powerbi python python-app random-forest random-forest-classifier streamlit streamlit-webapp

Last synced: 08 May 2026

https://github.com/dms-codes/scrape-kesaintblanc-id

Kesaintblanc Data Scraper This Python script is designed to scrape product data from the Kesaintblanc website. It collects information about products, including product name, URL, price, image URLs, status, stock, and more. The scraped data is saved to a CSV file for further analysis.

data kesaintblanc python webscraper

Last synced: 27 May 2026

https://github.com/aidan-zamfir/the-iliad

Data analysis & relationship network for the characters of Homers Iliad

data data-analysis dataframes networks networkx python selenium spacy webscraping

Last synced: 08 May 2026

https://github.com/purarue/HPI-personal

Personal HPI modules/scripts

data history lifelogging

Last synced: 30 Mar 2025

https://github.com/davorg/towerbridge

When is Tower Bridge lifting?

data hacktoberfest london perl web-scraping

Last synced: 29 Jun 2026

https://github.com/abhash-rai/regression-car-price-prediction

This repository contains my first complete data science project from web scrapping for data to data preprocessing, cleaning, exploratory data analysis, model training and deployment.

data data-science data-visualization eda exploratory-data-analysis machine-learning neural-network prediction prediction-model regression

Last synced: 08 May 2026

https://github.com/ashu3291/blinkit-app-store-

conducted a comprehensive analysis of Blinkit's sales performance, customer satisfaction and inventory distribution to improve the sales performance.

cleaning-data data dataanalysis-projects powerbi-visuals powerbidashboard sql

Last synced: 05 Jan 2026

https://github.com/ms140569/loki-example-store

Testdata for loki password manager

data

Last synced: 26 Feb 2026

https://github.com/mehmetkahya0/earthquake-tracker

Earthquake Tracker, A real-time earthquake monitoring application that visualizes seismic activity worldwide using interactive maps and data visualization.

ai api css cursor data data-vizualisation earth-observation earthquake earthquake-data earthquake-visualization earthquakes html js modern-web scrape ui ui-design web

Last synced: 15 Apr 2026

https://github.com/satyam4229/iit-and-nit-college-dataset

The dataset for IITs and NITs typically includes information related to these premier engineering institutions in India, such as their names, locations, rankings, academic programs offered, faculty details, student information, admission process, infrastructure and facilities, placements.

college-data csv data excel iit nit

Last synced: 04 Jan 2026

https://github.com/randomfractals/unfolded-map-snippets

Html, CSS, JavaScript, and Python 🐍 vscode snippets ✂️ extension for Unfolded Map 🗺️ and Data SDKs

code data extension map sdk snippets template unfolded vscode

Last synced: 08 May 2026

https://github.com/q-aware-labs/bias-insights

Bias detection project for the Chicago Face Database (CFD)

ai chicago-data-portal data data-science llm statistical-analysis

Last synced: 21 Jan 2026

https://github.com/bishtrishu/netflix_movies_dashboard

This project is a comprehensive dashboard for analyzing Netflix movies and shows. Using a combination of Power BI, Python, and Excel, this dashboard provides insights into various aspects of Netflix's content library.

ai artifical-intelligense dashboard data dataanalysis dataanalyst dataanalytics datacleaning datahandling datascience datavisualization excel machine-learning msexcel powerbi report

Last synced: 09 Feb 2026

https://github.com/primetdmomega/webscraper

A data web scraper that looks for jobs on Glassdoor.com

data python web-scraper

Last synced: 25 Mar 2025

https://github.com/davidkhala/ai

GenAI index

data dify huggingface

Last synced: 27 Feb 2026

https://github.com/kena0ki/dddl

generates test Data from DDL.

data database db ddl generator sql table test

Last synced: 30 Apr 2026

https://github.com/softloud/spunk

Nutritional interventions for male infertility: a systematic review and meta-analysis

cochrane data evisynth living

Last synced: 18 Mar 2026

https://github.com/bilgehangecici/datatypeconverter

Converting integer and floating numbers to appropriate bit-level representation.

data datatypeconverter java machine-level variables

Last synced: 30 Mar 2025

https://github.com/bkestelman/dasy-ml

DaSy DataSynthesizer - Create synthetic data with desired statistical properties for machine learning research.

data data-science machine-learning

Last synced: 14 Jan 2026

https://github.com/fiedsch/data_util

misc. Utilities for data files like variable name lists

data helper management php

Last synced: 14 Jun 2025

https://github.com/writetome51/page-load-access

A TypeScript/Javascript class that loads a batch (array) of data from a larger set too big to be loaded all at once.

batch class data javascript load loader typescript

Last synced: 16 May 2026

https://github.com/ppabam/eda-bam

Navigating data from one thing to another.

cli data eda python

Last synced: 11 Feb 2026

https://github.com/writetome51/public-data-container-interface

Just a TypeScript interface with 1 property: 'data'

container data interface typescript

Last synced: 15 May 2026

https://github.com/kunalthakur204/visualization-on-flower

🌸 Flower Dataset Visualization Visualizing patterns and relationships in flower data through charts and plots. Perfect for exploring floral characteristics and trends! 📊

data data-visualization dataanalysis flowerdataset python

Last synced: 16 Apr 2026

https://github.com/miniql/miniql-csv

A MiniQL query resolver that loads data from CSV files.

comma-separated-values csv data query query-language

Last synced: 08 May 2026

https://github.com/project-renard/test-data

Files for testing

data

Last synced: 27 Feb 2026

https://github.com/victorowinoke/custmer-segmentation-using-rfm-python-

Customer Segmentation using the Recency, Frequency and Monetary Values

customer-segmentation data data-visualization python3 science time-series-analysis

Last synced: 26 May 2026

https://github.com/khalyomede/request

Function to validate request data for V.

data function request validate vlang

Last synced: 12 Feb 2026

https://github.com/diegoperea20/datos-secuenciales-con-ia

Realizacion de procesamiento de señales unidimensionales con modelos auto regresivos, convolución 1d, convolución 2d usando el espectrograma y redes recurrentes

ai artificial-intelligence convolutional-neural-networks data ia secuential-data spectrogram uao

Last synced: 06 Feb 2026

https://github.com/bzekeria/quran_dataset

The Holy Quran (Islam) Dataset

data islam quran religion

Last synced: 12 Feb 2026

https://github.com/pawamoy/keycut-data

Keyboard shortcuts data stored in YAML files

data keyboard-shortcuts

Last synced: 12 Feb 2026

https://github.com/dug22/jjournal

A Jupyter like notebook software for Java

data data-analysis data-science java jshell jshell-repl notebook swing swing-application

Last synced: 11 Apr 2026

https://github.com/jeswr/blog

My personal blog

ai blog data semantics solid web

Last synced: 13 Feb 2026

https://github.com/vidushibhadana/covid19-data-exploration-using-sql

Deployed diverse SQL techniques to analyze COVID-19 data for an improved understanding of pandemic's regression.

data database database-management sql

Last synced: 19 Aug 2025

https://github.com/j0a0m4/olympics

Final Project for Data Engineering Accelerated LATAM

data olympics spark

Last synced: 13 Feb 2026

https://github.com/krishkumar/scrobbles

all the music 🎸

data music scrobble

Last synced: 13 Feb 2026