An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/dantetrb/diabetes-readmission-dbt

Predictive analytics on diabetic patient readmissions using dbt, DuckDB and Python – with explainability and clustering.

clustering data dataengineering dbt diabetes duckdb hdbscan healthcare jupyter lime readmission-prediction sql

Last synced: 01 May 2026

https://github.com/dnut/associations

Python 3 library to identify high-dimensional statistical relationships in any data set.

analytics arch-linux association-rules data data-analysis data-mining data-science machine-learning python-modules

Last synced: 01 May 2026

https://github.com/skygenesisenterprise/aether-meet

Aether Meet is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem

applications data docker javascript meeting nextjs notes typescript voip

Last synced: 01 May 2026

https://github.com/ashita-ai/ashita-ai.github.io

Ashita AI - The island of misfit data tools

ai data

Last synced: 19 Feb 2026

https://github.com/hlan22/2025-03-18-data-validation

(no longer useful) DSCI 310 Lecture about Data validation and code testing! Made in tandem with:

data validation

Last synced: 23 Jun 2026

https://github.com/lurenss/healthypandas

A library that takes row output from the export of the Iphone Health app and produce pandas dataframes.

data health ios pandas

Last synced: 02 May 2026

https://github.com/hafs96/prediction_consommation-de-carburant

Dans ce projet, l'objectif est de développer un modèle permettant de prédire si une voiture a une consommation de carburant élevée ou faible en fonction de ses caractéristiques techniques.

analysis data data-visualization machine-learning testing training

Last synced: 09 Jun 2026

https://github.com/badranalyst/movie-correlation-analysis-in-python

This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.

data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/ahmad-ali-rafique/heart-disease-detection-model

A comprehensive project for detecting heart disease using machine learning, including data processing, model training, and evaluation metrics with AUC curve analysis.

artificial-intelligence data datascience heart-disease machine-learning modeling prediction-model

Last synced: 11 Aug 2025

https://github.com/tn3w/moviedb-json

A JSON library with 981,530 films.

data database db json movie movie-database movies

Last synced: 03 May 2026

https://github.com/davorg/towerbridge

When is Tower Bridge lifting?

data hacktoberfest london perl web-scraping

Last synced: 29 Jun 2026

https://github.com/yugsumeet17/churn-analysis-project--power-bi-sql-machine-learning

Dataset Explained, Project Goals & Metrics Required, SQL Server ETL & Data Cleaning, Power BI Data Load, Transformation, Blueprint & Measures, Power BI Visualization - Summary Page, Building Machine Learning Model - Random Forest, Power BI Visualization - Churn Prediction Page

data data-visualization dataanalytics excel postgresql powerbi python3

Last synced: 03 May 2026

https://github.com/joelgombin/intro_r_iau

Introduction à R #WeData

data data-science dataviz gis r

Last synced: 04 May 2026

https://github.com/soham7998/data-analysis-projects

My Data Analysis Projects which are completed by me and gain a hands on Experience from each project. the project showcase different Concepts , Visualization and many things.

data data-analysis data-science machine-learning nlp python soham visualization

Last synced: 04 May 2026

https://github.com/dimitryzub/russo-ukraine-war-prediction-losses

Highlights rusian losses with predictions based on historic data from Ministry Defence of Ukraine 🐱‍👤

data dataanalysis dataanalytics matplotlib pandas prophet python

Last synced: 04 May 2026

https://github.com/bhar2254/sobershift

Simply attendance tracking application

data form ifc jambi java qt tracking utility

Last synced: 05 May 2026

https://github.com/edjoukou/pizza-sales-report

A data analysis project using SQL with MySQL database

analysis data mysql powerbi visualization

Last synced: 05 May 2026

https://github.com/munas-git/codm-review-analysis-and-predictions

Sentiment analysis on Call of Duty Mobile Google Play Store user reviews with ML model to classify new reviews.

data flask machine-learning python sentiment-analysis

Last synced: 05 May 2026

https://github.com/fabsdevx/files-to-database-loader-handout

Data Engineering project for learning purposes. Credits to itversity

csv data data-engineering database json pandas python

Last synced: 09 Apr 2026

https://github.com/mito-ds/mitosheet_helper_config

The mitosheet_helper_config package used by enterprises to configure the mitosheet package.

data data-analytics data-science data-visualization jupyter pandas python

Last synced: 05 May 2026

https://github.com/shibbbbs/fastapi_project

A FastAPI application that reads financial data from an Excel file (capbudg.xls) and provides API endpoints to list available tables (sheet names), fetch row names from a selected table, and calculate the sum of numerical values from a specified row. The API is accessible via a web-based interactive documentation at /docs

data dataanalysis fastapi pandas python

Last synced: 06 May 2026

https://github.com/ksm26/ml-ai-data-science-jobs-in-canada

Explore the latest machine learning, artificial intelligence, and data science job opportunities in Canada. Stay informed about Canadian tech job market trends and find your next career move.

ai-canada ai-careers canada canadian-tech-companies canadian-tech-job-market data data-analysis data-engineering data-science data-science-careers machine-learning prompt-engineering robotics

Last synced: 06 May 2026

https://github.com/parthds02/analyzing-student-success-with-data

Discover key factors influencing student performance through data analysis and visualization. Explore gender, parental education, sports, and ethnicity impacts.

data datascience jupyter-notebook kaggle python pythonlibraries

Last synced: 06 May 2026

https://github.com/tadiusfrank2001/pythonprojects

Compilation of Some Fun Introduction to Python Lab Coding Projects introducing the foundamentals of data science, databases, and pythonlibraries

data data-science databases gamedesign python pythonlibrarires sorting-algorithms sqlite string-manipulation

Last synced: 06 May 2026

https://github.com/ralzz/dibimbing_datascience

This project contains an Exploratory Data Analysis (EDA) of the Estonia Passenger List dataset. I handled missing values, removed duplicate data, and created basic visualizations to find insights.

data data-science eda google-colab kaggle pandas python

Last synced: 06 May 2026

https://github.com/lab5e/loadabledata

Simple framework-agnostic wrapper around loadable data to help encapsulate and use state changes in a UI.

async data loadable state typescript ui

Last synced: 07 May 2026

https://github.com/bryanhe24/data_analysis_app

A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.

ai data data-analysis data-visualization fullstack-development javascript math python reactjs

Last synced: 07 May 2026

https://github.com/safwan2003/randomforest_heart_disease_prediction

A machine learning project using Random Forest Classifier to predict heart disease. Includes data preprocessing (with binning), feature selection, and model evaluation.

binning data data-science datapipeline datapreprocessing datavisaulization deep-learning machine-learning python random-forest-classifier streamlit

Last synced: 07 May 2026

https://github.com/jigyasag18/iit-guhawati

Empower Sakhi is a data-driven platform that uses machine learning to identify women at risk of domestic violence in India. It offers confidential self-assessments, survivor stories, and emergency resources through a trauma-informed, privacy-focused web app. The project also provides NGOs with actionable insights via Power BI dashboard for support.

aiml data dataset datavisualization domestic-violence eda jupyter-notebook label-encoding machine-learning machine-learning-algorithms machine-learning-models machinelearning machinelearningprojects powerbi python python-app random-forest random-forest-classifier streamlit streamlit-webapp

Last synced: 08 May 2026

https://github.com/zsvoboda/olympics

Self service analytics of 120 years of Olympics data

analytics dashboards data datavisualization dataviz olympics open-data open-datasets opendata reports

Last synced: 08 May 2026

https://github.com/randomfractals/unfolded-map-snippets

Html, CSS, JavaScript, and Python 🐍 vscode snippets ✂️ extension for Unfolded Map 🗺️ and Data SDKs

code data extension map sdk snippets template unfolded vscode

Last synced: 08 May 2026

https://github.com/lckylke/vizweb

Web application for data visualization:)

data expressjs nextjs web

Last synced: 08 May 2026

https://github.com/vanshuchaudhary/flightpriceanalysis-

The uploaded file is a Jupyter Notebook titled "Flight Analysis". It likely involves analyzing flight-related data, potentially exploring trends, patterns, or insights using data science techniques. The analysis might include data visualization, statistical analysis, or predictive modeling.

business-analytics data data-analysis data-visualization datainsights datascience matplotlib-pyplot python seaborn seaborn-plots seaborn-python sns statistical-analysis

Last synced: 08 May 2026

https://github.com/miniql/miniql-csv

A MiniQL query resolver that loads data from CSV files.

comma-separated-values csv data query query-language

Last synced: 08 May 2026

https://github.com/chompfoods/sdk-typescript-angular

Angular TypeScript SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

angular api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes sdk typescript

Last synced: 09 May 2026

https://github.com/dsietz/rust-daas

An example of implementing the DaaS pattern using Rust

archconf daas data kafka rust rust-lang

Last synced: 05 Sep 2025

https://github.com/naitiknayak196/tech-layoffs-cleaning-sql-vs-python

This project cleans and analyzes a tech layoffs dataset using MySQL and Python (Pandas) to compare their efficiency in data processing. It provides business insights into workforce trends, industry stability, and economic impacts to support data-driven decision-making.

data datacleaning dataset jyputer-notebook layoffdata layoffs mysql python sql

Last synced: 09 May 2026

https://github.com/jerboaburrow/uk-counties-and-unitary-authorities-may-2023-geojson

UK "Counties" Extracted from Office for National Statistics data

data geojson maps uk

Last synced: 29 Mar 2025

https://github.com/burythehammer/foosbot-results

Foosball results for the OpenCredo foosbot

data foosball machine-learning python

Last synced: 13 Apr 2026

https://github.com/theanujsinha01/mcdonalds-customer-analysis

This project analyzes customer feedback data to understand what drives people to like or dislike McDonald’s. Using Python and data visualization tools in a Jupyter Notebook, we explore how different factors—such as taste, price, health, and visit frequency—affect customer satisfaction.

case-study data data-visualization dataanalysis

Last synced: 05 Sep 2025

https://github.com/dahsie/machine_learning_from_scratch

This project aims to implement some machine learning basic techniques(e.g. MinMaxScaler, StandardScaler, TD-IDF, PCA, Logistic Regression, LDA, KNN, Naive Bayes Classifier) using only pyton, numpy and pandas. This will enable me to have hone my data scientist skills

classification clustering data data-processing datascience machienlearning nlp nltk numpy pandas python regression

Last synced: 04 May 2026

https://github.com/mohamedbilal1800/olympic_history_data_analysis

This project delves into the 120 Years of Olympic History: Athletes and Results dataset, analyzing athlete demographics, medal achievements, and country performances across the Summer and Winter Olympics from 1896 to 2016.

analysis data eda matplotlib-pyplot pandas python seaborn visulaization

Last synced: 09 May 2026

https://github.com/so-cool/junction

My solution to the University of Bristol "Bristol Journey Time" Data Challenge https://So-Cool.github.io/junction

competition data modelling timeseries

Last synced: 02 Apr 2025

https://github.com/etmendz/mendz.data.oracle

Provides a generic Mendz.Data-aware context for ADO.Net-compatible access to Oracle databases.

ado-net context data database datasettings mendz oracle

Last synced: 13 Apr 2026

https://github.com/pietrapaz/bootcamp_dio_ciencia_de_dados

Bootcamp Potência Tech powered by iFood | Ciência de Dados - Dio ⚠️

cienciadedados dados data datascience python

Last synced: 09 Apr 2025

https://github.com/davorg/cookingvinyl

Web site with info about Cooking Vinyl records

cooking-vinyl data hacktoberfest music perl

Last synced: 02 Apr 2025

https://github.com/ginga1402/data_visualization_on_honey_production_dataset

Data Visualization using Matplotlib & Seaborn Libraries

college-project data data-visualization

Last synced: 25 Aug 2025

https://github.com/shadmanshaikh/data-analysis-and-ml-work

All of my work in Data Analysis and Machine learning

analytics artificial-intelligence data machine-learning

Last synced: 05 Jul 2025

https://github.com/rorylshanks/devdb-client

This is the repository for the official command line client for DevDB (https://devdb.cloud)

cloud data database-management development

Last synced: 29 May 2026

https://github.com/trissim/polystore

Framework-agnostic multi-backend storage abstraction for ML and scientific computing

backend data io jax ml multi-framework numpy pytorch scientific-computing storage tensorflow zarr

Last synced: 12 Apr 2026

https://github.com/rohitblaze10/netflix_analysis_using_tableau

The Netflix dashboard in Tableau provides a professional and visually captivating interface for users to explore a vast collection of TV shows and series. With seamless navigation and interactive filters, users can easily personalize their recommendations based on release year, genre, duration, and rating.

data data-analysis data-science data-visualization netflix tableau

Last synced: 04 Feb 2026

https://github.com/mustafaozvardar/selenium-eksisozluk

This project is a simple web scraper built with Python using Selenium. It extracts and prints the content of popular entries from a specific EksiSozluk page.

data python selenium selenium-python

Last synced: 29 Apr 2026

https://github.com/pdoup/enegry

Time-Series dataset combining multiple sources to explain the broader Greek energy market

data dataset day-ahead-auction energy-markets exploratory-data-analysis forecasting futures-market greek-energy-market renewable-energy time-series-data weather-data

Last synced: 07 May 2025

https://github.com/wisdom-osborn/data-analytics-course-online-

🔍 Data Analytics with Python — Hands-on Course Materials Jupyter notebooks, projects, and datasets based on the freeCodeCamp Data Analysis with Python certification. Learn NumPy, Pandas, data cleaning, and visualization through real-world examples

data data-analysis data-science data-visualization freecodecamp numpy pandas pandas-dataframe project python

Last synced: 19 Apr 2026

https://github.com/grace-mengke-hu/redditpushshiftapi

This package is for collecting Reddit dataset and organize the data in Mongo Database

collection data reddit

Last synced: 13 Jun 2025

https://github.com/getconversio/dig-the-data

Data visualizations for the Conversio blog

d3 data data-visualization

Last synced: 12 Apr 2026

https://github.com/yash-chauhan-dev/sf_analytics

Business teams often rely on data analysts to extract insights using SQL. This tool eliminates that dependency by bridging the gap between humans and data using AI.

aiml analytics data dbt langchain llm python snowflake streamlit

Last synced: 07 May 2026

https://github.com/living-with-machines/zoonyper

Code to make it easy to import and process Zooniverse annotations and their metadata in Python/Jupyter Notebooks

crowdsourcing data data-processing data-science python zooniverse

Last synced: 04 Jul 2025

https://github.com/berviantoleo/bervdata

Temporary data definition as db

data

Last synced: 01 Apr 2025

https://github.com/tdjsnelling/hermes

Hermes is a real-time data framework for React + MongoDB

data docker framework mongodb nodejs react react-hooks reactjs real-time typescript websocket

Last synced: 12 Apr 2026

https://github.com/prishabhanot/facial_recognition_pca

A face recognition system using Principal Component Analysis (PCA) for dimensionality reduction and a Support Vector Machine (SVM) classifier for classification. PCA extracts essential features (eigenfaces) from facial images, significantly reducing computational complexity while retaining critical information for accurate recognition.

data eigenfaces facial-recognition pca python reducing-computational-complexity reducing-data-dimensions svm-classifier

Last synced: 01 Mar 2025

https://github.com/denisecase/dc-mailer

Send an email using Python

alerts data email python streaming

Last synced: 11 Apr 2025

https://github.com/keminghe/osu

Unofficial and publicly-available NPM data-package about The Ohio State University.

college data majors ohio-state organizations public students university unofficial

Last synced: 06 Jan 2026

https://github.com/zoetrope69/website

:tada: my website

data javascript personal

Last synced: 12 Jun 2025

https://github.com/posixpascal/apple_appstore_search

📊 get public App Store data of your app in a ruby hash — that's it.

appstore data gem ios ruby

Last synced: 16 Mar 2025

https://github.com/luminati-io/Google-Maps-dataset-samples

A sample dataset of over 1000 Google Maps businesses, extracted using the Bright Data API, ideal for competitor analysis, location-based marketing, and market strategies.

api data dataset google-maps maps web-scraping

Last synced: 09 Apr 2025

https://github.com/bkataru/spotigo

AI-powered local music intelligence platform with a task runner server core to retrieve and backup spotify account data to storage(s) at set periodic intervals

ai backup cron data go intelligence local-llm music ollama rag runner spotify task-runner tool-calling

Last synced: 16 Jan 2026

https://github.com/desoga10/nety-form

In this tutorial, I show you how to send data from a form to the Netlify dashboard. I also show you how to create a form using Materialize.

contact-form css css3 data form forms html html5 materialize materialize-css materializecss-framework netlify

Last synced: 03 Jan 2026

https://github.com/filipnet/infoscreen

Arduino subscribes values by MQTT and view info on an OLED I2C display

arduino data display i2c mqtt oled-display-ssd1306 visualization weather weatherstation

Last synced: 12 Apr 2026

https://github.com/powersyang/visualization

data visualization templates 数据可视化模板

data templates visualization

Last synced: 24 Mar 2025

https://github.com/scx567888/scx-data

✨ SCX Data

data java scx

Last synced: 05 Apr 2025

https://github.com/nagipragalathan/linkedin_backup_datas

This repository contains the backup data from my previous LinkedIn account. Unfortunately, my old LinkedIn account was compromised and subsequently blocked by LinkedIn. As a result, I created a new account, but that too got blocked for reasons unknown to me.

backup blocked data linkedin linkedin-account memory nagipragalathan recovery storage

Last synced: 18 Jan 2026

https://github.com/srvanderplas/statistical_atlas

Framed Charts and the Statistical Atlas of 1870

census data ggplot2 graphics r statistics visualization

Last synced: 29 May 2026

https://github.com/chubek/pyramid-dashboard

A Dashboard to Show Data Made Using Plotly Dash

dash data docker ml plotly plotly-dash python

Last synced: 19 May 2026

https://github.com/shadeglare/genum

The ES Next tools to process data in a LINQ manner

data linq processing typescript

Last synced: 13 Apr 2026

https://github.com/alextanhongpin/node-github-api

:page_with_curl: sample github api queries with nodejs for scraping purposes

data github-api nodejs

Last synced: 06 May 2026

https://github.com/lightdash/quickstart-github

Instant analytics for Github

analytics business-intelligence data dbt github

Last synced: 14 Sep 2025

https://github.com/makcymal/silvera

My researches on ML and statistics, optimization methods, CS algoritms and numerical methods

algorithms data data-structures machine-learning numerical-methods statistics

Last synced: 01 Apr 2025

https://github.com/soenneker/soenneker.constants.data

A set of commonly used constants related to various types of data

constants csharp data dotnet

Last synced: 12 Mar 2026

https://github.com/dahmansphi/analysis_from_start_to_end

The Big Bang of Data Science- Analysis from the Start to The End- [Book Two]

analysis data data-analytics data-mining data-science hypothesis-testing jamovi machine-learning

Last synced: 08 Jan 2026

https://github.com/jooapa/bytebrother

Byte Brother is watching YOU

data data-analysis security

Last synced: 26 Jan 2026

https://github.com/zazza123/hamana

A python library for seamless data extraction, storage, and SQL-based analysis using pandas and SQLite.

analysis data python

Last synced: 14 Jan 2026

https://github.com/yuvrajsaraogi/car-price-prediction-with-machine-learning

The price of a car depends on a lot of factors like the goodwill of the brand of the car, features of the car, horsepower and the mileage it gives and many more. Car price prediction is one of the major research areas in machine learning. So, if you want to learn how to train a car price prediction model then this project is for you.

car-price-prediction-with-machine-learning data data-science deep-learning deep-neural-networks engineer github learning machine-learning mini-project natural-language-processing prediction predictive-modeling project python3 sql

Last synced: 15 Apr 2026

https://github.com/2kabhishek/pybank

Data Analysis for the silliest Bank 💰🏦

csv data data-science learning pandas python topic1 topic2

Last synced: 12 May 2026