An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/shubhamgoyal575/credit-card-fraud-detection

📌 Credit Card Fraud Detection using Machine Learning This project focuses on detecting fraudulent credit card transactions using machine learning models like Random Forest, XGBoost, and Deep Learning. The dataset is preprocessed to handle class imbalance, and multiple models are evaluated based on ROC AUC Score and F1 Score.

adaboost-classifier artificial-neural-networks credit-card-fraud data-analysis data-cleaning data-preprocessing data-science data-visualization deep-learning exploratory-data-analysis lightgbm machine-learning machine-learning-algorithms random-forest-classifer scikit-learn tensorflow xgboost

Last synced: 08 Feb 2026

https://github.com/imnotamr/datasets-used

A comprehensive collection of datasets for machine learning and data science projects, covering topics from advertising and sales to health and sports analytics

ai classification data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning models python regression-models

Last synced: 19 May 2026

https://github.com/samir-atra/share-lm_dataset_analysis

Analysis, studies and optimizations on the ShareLM extension dataset

data-analysis data-visualization gemma3n huggingface huggingface-transformers pandas

Last synced: 19 May 2026

https://github.com/shuyib/london_weather_prediction

The London Weather Project aims to predict the mean temperature in London using historical weather data, involving data cleaning, feature engineering, and modeling with techniques like imputation, transformation, scaling, and the use of Mlflow for tracking model performance and hyperparameters.

data-cleaning data-lab data-science data-visualization datacamp-projects environmental-science feature-engineering forecasting jupyter-notebook machine-learning mlflow open-data python random-forest regression-analysis time-series weather-prediction

Last synced: 29 Mar 2025

https://github.com/borjamome/radiografia-madrid

Análisis de Población, Economía y Sociedad de Madrid con R.

data-analysis data-visualization madrid r

Last synced: 17 Jun 2025

https://github.com/burakahmet/city-based-weather-forecasting-and-visualization-application

A MATLAB App Designer application that visualizes real-time and 5-day weather forecasts for selected cities using the OpenWeatherMap API.

data-visualization matlab matlab-gui visualization weather-api weather-app weather-forecast

Last synced: 26 Mar 2025

https://github.com/no-country-simulation/c21-55-n-data-bi

Trabajo de análisis estadístico en Power Bi, sobre la deserción de alumnos en carreras culturales universitarias de argentina.

data-visualization

Last synced: 18 Feb 2026

https://github.com/ezeparziale/analisis-uso-bicicletas-caba

:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.

data data-science data-visualization

Last synced: 14 Mar 2025

https://github.com/chahelgupta/interactive-data-visualization-tool-java

The JavaFX project aims to build an interactive data visualization tool offering Pie Charts, Bar Graphs, and Line Graphs. Users can input data for each chart type, customize visual aspects like colors and labels, and interact with zooming and tooltips.

data-visualization data-visualization-javafx data-visualization-project java java-application javafx javafx-application javafx-application-framework javafx-project

Last synced: 02 Jan 2026

https://github.com/dona-eric/systeme-de-recommandations

Système de recomandations des chansons similaires. L'objectif est d'analyser les données de spotify pour mettre un système de recommandations des chansons similaires avec Word2Vec. Les données sont disponibles sur kaggle à l'a

data-visualization nltk nltk-python recommendation-system word2vec

Last synced: 20 Mar 2025

https://github.com/nivasharmaa/friskwatch

A Java program for analyzing stop-and-frisk data from the NYPD. Features data import, organization, and statistical analysis to compare occurrences during and after policy implementation.

data-analysis data-visualization dataprocessing datascience file-io java java-oop nypd-data

Last synced: 19 May 2026

https://github.com/analyticalnahid/plotly-tutorial

A intro of Plolty for Data Science

data-science data-visualization ploty python3

Last synced: 28 Mar 2025

https://github.com/shellynagar27/marketing-content-performance-analysis

Analyzed 2024 social media campaign data from TikTok, Instagram, LinkedIn, and X.com using Power BI to uncover performance trends across platforms, content types, and regions. Built an interactive dashboard to drive insights on engagement, optimal posting times, and content strategy.

data-analysis data-modelling data-visualization excel figma marketing-analytics powerbi powerquery wireframing

Last synced: 26 Jun 2025

https://github.com/grascya/sleep-health_-lifestyle-dataset

Classifier to predict the presence of a sleep disorder based on the other columns in the dataset.

data-visualization exploratory-data-analysis joblib machine-learning-algorithms pickle python statistical-analysis

Last synced: 20 May 2026

https://github.com/rmodi6/ieee-cis-fraud-detection

IEEE-CIS Fraud Detection Kaggle Competition notebooks

data-science data-visualization fraud-detection kaggle logistic-regression xgboost

Last synced: 15 May 2026

https://github.com/otsaloma/pollen-chart

Helsinki pollen count visualization

data-visualization javascript lambda pollen python

Last synced: 17 Apr 2026

https://github.com/prsdthkr/viz-design-demo

📈 This repo houses my experiments with D3 (mostly inspired by other's work) for information visualization class project and demo.

d3 data-visualization lgbtq

Last synced: 06 May 2026

https://github.com/holy-angel-university/student-performance-analysis

This project analyzes student data to understand factors affecting final exam scores. Data includes study habits, extracurriculars, family background, school environment, and demographics. The goal is to identify key contributors to academic success.

data-science data-visualization exploratory-data-analysis jupyter-notebook python3

Last synced: 06 Apr 2025

https://github.com/hemanth094/netflix-dashboard

This project features a Power BI dashboard that visualizes Netflix data from the provided CSV file. The repository includes the main Power BI project file, the dataset, and a related image. It's a straightforward data visualization project that demonstrates how to create an interactive dashboard for analyzing Netflix content.

data-visualization powerbi

Last synced: 16 Feb 2026

https://github.com/srinibas-masanta/hotel-revenue-analysis-dashboard

This project focuses on analyzing hotel booking data to uncover key metrics and insights that drive revenue management decisions. By creating an interactive Power BI dashboard, the project aims to improve strategic decision-making, optimize occupancy rates, and enhance overall financial performance within the hospitality industry.

business-analytics data-analysis data-science data-visualization dax-functions hospitality powerbi

Last synced: 12 Jan 2026

https://github.com/prakhar-code/house_sales_analysis

House Sales Analysis Of King County, Washington, USA and Clean Visualization.

data-cleaning data-visualization excel tableau tableau-dashboards tableau-public

Last synced: 12 Jan 2026

https://github.com/iankitnegi/ms-data-analyst-professional-certificate

Journey through the Microsoft Power BI Data Analyst Certificate with notes, projects, and exercises. 🚀

data-visualization microsoft powerbi

Last synced: 24 Jan 2026

https://github.com/mdalamin5/data-science-machine-learning-basics

This repository is a comprehensive guide to Machine Learning algorithms, Python OOP, data preprocessing, and visualization using Pandas, NumPy, Seaborn, Scikit-learn, and more. It includes hands-on Jupyter notebooks, modular Python scripts, and a structured ML pipeline for training and evaluating models. 🚀

data-visualization datapreprocessing machine-learning-algorithms object-oriented-programming

Last synced: 15 May 2026

https://github.com/jibbs1703/airline-data-analysis

This repository contains the Exploratory Data Analysis of the flight delay and cancellation for airline flights in the United States in the year 2015. With this EDA, insights and solutions are suggested for business owners and airport managers.

business-insights business-solution data-analysis data-visualization

Last synced: 20 Mar 2025

https://github.com/shivasairam1706/mlops-project1

End-to-end ML-Ops project using PySpark and AWS, covering environment setup, model training, deployment with data capture, execution, and analysis. CI/CD pipelines (AWS CodePipeline) and monitoring (CloudWatch) ensure automated deployment, performance tracking, and model retraining for production-ready ML solutions.

aws aws-lambda aws-s3 data-engineering data-science data-visualization delta-lake docker forcasting mlops-project pyspark unix-shell

Last synced: 20 May 2026

https://github.com/asimpson/is-steph-mvp

🏀 Compare the 2018 NBA MVP contenders against Steph Curry's historic, unanimous, 2016 MVP season.

data-visualization nba reactjs

Last synced: 20 May 2026

https://github.com/gui-sitton/carsells

In this project I am an analyst on the Crankshaft List. Hundreds of free vehicle advertisements are published on the site every day. I need to study the data collected over the last few years and determine which factors influence the price of a vehicle.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 20 May 2026

https://github.com/youssef-saaed/activity-recognition-using-various-ml-algorithms

This project involves a comprehensive comparative analysis of various machine learning models to classify activities based on a given dataset. The analysis follows a structured approach, including data exploration, model training, model evaluation, and results interpretation to identify the best performing model.

activity-recognition comparative-analysis cross-validation data-exploration data-visualization machine-learning model-evaluation model-training neural-networks

Last synced: 22 Mar 2025

https://github.com/shuddha2021/interactive-data-visualization-app

An interactive web application for visualizing data using Chart.js. Users can explore and analyze data through dynamic charts and customize their view

chart data-visualization event-handling interactive-ui javascript real-time-updates responsive-design web-development

Last synced: 01 Nov 2025

https://github.com/kaustubh-indulkar/te-it-dsbda-assignmnets

This repository contains the solutions for a series of assignments covering Data Science And Big Data Analytics concepts.

big-data big-data-analytics data-analytics data-science data-visualization sppu-2019-pattern sppu-it-dept

Last synced: 29 Mar 2025

https://github.com/mohamed-walied/customer-behavior-analysis-using-r

Customer Behavior Analysis project utilizing the "Groceries Market Basket Dataset" from Kaggle. The project employs a data-driven approach to uncover customer purchasing patterns and relationships within the grocery market using K-means Clustering and Association Rules using Apriori-Algorithm. In collaboration with some friends.

apriori-algorithm association-rule-learning dashboard data-cleaning data-visualization k-means-clustering r-programming-language

Last synced: 26 Jul 2025

https://github.com/dhou22/pulmoscan-project

A collaborative project with PulmoScan company focused on developing an advanced deep learning system for automated detection and classification of pulmonary nodules in chest CT scans, aiming to enhance early lung cancer diagnosis.

computer-vision data-visualization deep-learning lung-cancer-detection python

Last synced: 16 Apr 2026

https://github.com/williamd1k0/metacritic-games

Distribution of Metacritic scores for console games.

data-scraping data-visualization metacritic web-scraping

Last synced: 26 Jun 2025

https://github.com/c2r0b/2q

Manage data and relationships with AI

data-visualization graphql relationships rust tauri

Last synced: 09 Apr 2026

https://github.com/ranxi2001/predicting-mental-health-risk

数据分析案例-精神健康预测(数据来源kaggle)

data-analysis data-visualization eda

Last synced: 27 Jun 2025

https://github.com/samruddhi3012/rfm-analysis

Hi there! In this project I have performed Sales Analysis (RFM Analysis) using SQL and Tableau.

data-analysis data-visualization mssqlserver rfm-analysis segmentation tableau

Last synced: 27 Jun 2025

https://github.com/hannahgsimon/halmodeling2024

Developed code using the Hybrid Automata Library (HAL) to create a spatial agent-based model of radio-immune response to spatially fractionated radiotherapy. This project was in association with the Cleveland Clinic Lerner Research Institute, Jacob Scott Lab.

agent-based-model bifurcation-analysis cancer-models computational-biology data-visualization hybrid-automata immune-response mathematical-modelling ordinary-differential-equations radiation-therapy spatial-model statistics systems-biology

Last synced: 23 Nov 2025

https://github.com/aniruddha-biswas/wavecon-telecom-analysis-report

Wavecon Telecom Analysis Report - A Internship Project of Codebasics

data-visualization dataanalysis powerbi powerpoint-presentations storytelling

Last synced: 11 Jan 2026

https://github.com/anergictcell/esbmeplots

An extension of the D3.js library for fast and flexible generation of basic plot types

d3js data-visualization javascript plotting

Last synced: 13 Jun 2026

https://github.com/gappeah/solana-ml-forecast

This project uses machine learning, specifically an XGBoost regressor, to predict the price of Solana (SOL) based on historical data and engineered features.

cryptocurrency data-visualization machine-learning solana xgboost

Last synced: 25 Feb 2025

https://github.com/arosas17/mapping_earthquakes

Created a map to demonstrate the correlation between the tectonic plates and earthquakes. Circle were made on a map to indicate earthquakes, changing colors and size based on magnitude of the earthquake.

data-visualization javascript map

Last synced: 20 May 2026

https://github.com/johnwalley/how-do-you-stack-up

Data visualisation and storytelling

data-visualization

Last synced: 11 Jan 2026

https://github.com/vzamboulingame/data-portfolio

This repository showcases my projects in Python and SQL, highlighting my skills in data analysis & visualization.

data-analysis data-portfolio data-science data-science-portfolio data-science-projects data-visualization jupyter-notebook portfolio python sql

Last synced: 20 May 2026

https://github.com/traccyyyyy/employeehrwebapp

Modern web application built with Lit, featuring Web Components, real-time data visualization, responsive UI, and RESTful API integration.

api-rest data-visualization developer-tools frontend interactive-dashboard javascript lit real-time state-management ui-ux webapp webcomponents

Last synced: 20 May 2026

https://github.com/zhouzhuofei/juliadl

learning Julia, write some notebooks, like machine learning and data science, visualization.

data-science data-visualization julia mxnet

Last synced: 21 Apr 2026

https://github.com/leandrocollares/long-range-brilliance

A responsive scatterplot showing minutes played and 3-point field goals made by the best 3-point shooters in NBA history

d3 data-visualization svelte

Last synced: 15 May 2026

https://github.com/alinababer/data-science-and-insight-agent-rag-llama3-lava-llm

Data-Science-and-Insight-Agent-RAG-LLama3-Lava-LLM-Django-WebApplication is an advanced AI-driven chatbot designed to assist in data science, document analysis, and image interpretation. This repository contain the Datascience Agent of this project.

artificial-neural-networks classifcation data-analysis data-engineering data-visualization datascience large-language-models llama2 lstm machine-learning python random-forest regression

Last synced: 01 Jan 2026

https://github.com/codeonthespectrum/web-scrap

Este projeto realiza o web scraping da Wikipédia para obter dados sobre os municípios mais populosos do estado do Rio de Janeiro.

data-analysis data-visualization webscraping

Last synced: 16 Feb 2026

https://github.com/heshamoomar/power-bi

visualizing real data from a survey that people took about people's jobs and work fields using Power BI

data-visualization microsoft-power-bi

Last synced: 04 Feb 2026

https://github.com/tanishpoddar/logitrack

LogiTrack is a Python & Streamlit-powered inventory management system for real-time warehouse optimization. It offers multi-warehouse planning, interactive maps, and supply chain analytics, supporting global coordinates, CSV/SQL data, and customizable parameters.

data-visualization database inventory-management logistics optimization python streamlit supply-chain supply-chain-analytics warehouse-optimization

Last synced: 02 Nov 2025

https://github.com/andersoncrs/regularizacion_lasso_en_modelos_de_regresion_lineal

Este repositorio contiene un análisis detallado sobre la implementación de la regularización Lasso en modelos de regresión lineal para predecir el precio de vehículos. Se parte de un conjunto de datos limpio y se aplican diversas transformaciones y modelados para mejorar la precisión de las predicciones.

data-analysis data-science data-visualization jupyter-notebook linear-regression regularization-methods seaborn sklearn

Last synced: 16 May 2026

https://github.com/matheusbcmelo/primeirorelatoriopowerbi

Primeiro relatório desenvolvido em PowerBI no curso DIO - Python Data Analytics

business-intelligence data-visualization powerbi report

Last synced: 25 Jan 2026

https://github.com/mvharsh/blinkit-sales-dashboard

An interactive Power BI dashboard visualizing Blinkit's sales performance across outlets, item types, and customer ratings for strategic insights.

blinkitdashboard data-analysis data-visualization powerbi

Last synced: 25 Jan 2026

https://github.com/anonymo2239/big-data-churn-analyzer

Scalable customer churn prediction using PySpark. Includes EDA, feature engineering, modeling, and real-time inference on new data.

big-data churn-analysis churn-prediction classification-algorithm data-analysis data-science data-visualization modeling pyspark

Last synced: 21 May 2026

https://github.com/rsc-labs/see-open-data

Show www.dane.gov.pl in user friendly format. Generate flourish data or other data visualizations.

data data-visualization flourish government poland

Last synced: 04 Apr 2025

https://github.com/angchekar28/lung-cancer-prediction

This project builds and compares multiple machine learning models to predict lung cancer based on patient attributes. It evaluates classification models like Logistic Regression, Decision Tree, Random Forest, and SVM for early diagnosis.

data-science data-visualization jupyter-notebook lung-cancer-detection machine-learning model-comparison python

Last synced: 14 May 2026

https://github.com/arekflo2002/analiza_danych-rstudio-_dyskryminacja_kobiet

Wykorzystując rstudio oraz zestawy dane ze strony https://www.gapminder.org/data/ badam tematykę dyskrminacjii kobiet na poszczególnych kontynentach i wyciągam odpowiednie wnioski

data data-preparation-and-analysis data-visualization rstudio statistics

Last synced: 14 Apr 2025

https://github.com/deller23/hotel_booking_data_cleaning

Efficiently transforming raw hotel booking data into actionable insights! This project leverages Python and Pandas for advanced data cleaning—handling missing values, detecting outliers, and optimizing features—ensuring a high-quality dataset ready for analysis and modeling.

data-analysis data-cleaning data-preprocessing data-visualization data-wrangling pandas python

Last synced: 31 Mar 2025

https://github.com/ascender1729/sentitweet

SentiTweet: Advanced sentiment analysis tool using AWS Comprehend and TextBlob. Analyze text sentiment via CLI or web interface with visualizations.

aws-comprehend cli-tool data-visualization machine-learning natural-language-processing python sentiment-analysis text-analysis textblob web-application

Last synced: 31 Mar 2025

https://github.com/abdoomohamedd/data-science-projects

A collection of data science projects ranging from exploratory data analysis to predictive modeling and clustering. Each project is designed to solve specific problems or explore particular datasets using various data science techniques and tools.

data-analysis data-analysis-python data-cleaning data-science data-visualization machine-learning machine-learning-algorithms

Last synced: 14 May 2025

https://github.com/sarvamm/zeno-chat

Chat with your data in natural language and get insights and plots without any writing any code

chatbot data-science data-visualization large-language-models streamlit

Last synced: 19 May 2026

https://github.com/vaxdata22/zillow-rapid-api-end-to-end-etl-data-pipeline-by-airflow-on-ec2

This is an end-to-end AWS Cloud ETL project. This data pipeline orchestration uses Apache Airflow on AWS EC2 as well as AWS Lambda. It demonstrates how to build ETL data pipeline that would perform data transformation using Lambda function as well as loading into a Redshift cluster table. The data would then be visualized using Amazon QuickSight.

amazon-quicksight amazon-redshift apache-airflow aws-ec2 aws-lambda aws-s3 business-intelligence dags data-visualization etl-pipeline orchestration python3 rapid-api zillow-house-listings

Last synced: 19 May 2026

https://github.com/botsakhil/eda-on-data-science-job-salaries

Repo for Exploratory Data Analysis using dataset of Data Science Job Salaries

data-science data-visualization exploratory-data-analysis python python-visualization

Last synced: 26 Feb 2025

https://github.com/adam0white/codepulse

Analyze development velocity of public GitHub repositories by calculating lines of code changed per minute between commits. Features interactive charts, summary statistics, and a clean, responsive interface built with React and Cloudflare Workers.

bun cloudflare-workers code-metrics commit-analysis data-visualization development-velocity github-analysis hono react recharts serverless shadcn-ui tailwind-css typescript vite

Last synced: 10 Apr 2026

https://github.com/gerhynes/d3-movie-quotes

A simple page built to practice binding data to elements using D3. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 01 May 2026

https://github.com/ledsouza/estudo_matplotlib

Projeto de curso sobre o uso da biblioteca matplotlib

data-visualization matplotlib vitrinedev

Last synced: 03 Oct 2025

https://github.com/mem48/covid

Interactive MSOA Maps of Covid19 cases in England

covid data-visualization map msoa vector-tiles

Last synced: 24 Jul 2025

https://github.com/nischay002/us-honey-production-analysis

Analysis of US honey production (1995–2021) using Python & data visualization. Identifies trends in honey yield, pricing, and colony distribution across states.

data-analysis data-visualization exploratory-data-analysis honey-production matplotlib pandas python seaborn us-agriculture

Last synced: 26 Feb 2025

https://github.com/garcane/solana-ml-forecast

This project uses machine learning, specifically an XGBoost regressor, to predict the price of Solana (SOL) based on historical data and engineered features.

cryptocurrency data-visualization machine-learning solana xgboost

Last synced: 10 May 2026

https://github.com/shubhammittal-data/hr_dashboard_tableau

An interactive HR Analytics Dashboard built using Tableau. Provides insights into workforce demographics, hiring trends, salary analysis, and employee records for data-driven decision-making.

chatgpt4 data data-analysis data-visualization drawio-tools faker-generator hr-analytics hr-analytics-dashboard human-resources numpy python tableau tableau-public

Last synced: 17 May 2026

https://github.com/kristishqau/apartmentregressionanalysis

This data science project aims to predict apartment prices through regression analysis. The dataset used contains information about apartments, and the project involves various steps such as data preprocessing, exploratory data analysis, feature engineering, and building a decision tree regression model.

apartment-prices data-preprocessing data-science data-visualization decision-tree-regression jupyter-notebook prediction python3

Last synced: 01 May 2026

https://github.com/tralahm/octave-matlab

Using matlab and octave for machine learning and numerical computing

data-science data-visualization machine-learning matlab octave-functions octave-scripts tralahm tralahtek

Last synced: 13 Jun 2026

https://github.com/shefreenkaur/comp_430_project

A comprehensive, open-source business intelligence visualization tool designed for algorithmic trading systems. This application transforms complex trading data into intuitive visualizations, enabling traders and analysts to make data-driven decisions.

algorithmic-trading api-development business-intelligence data-analytics data-visualization etl-pipeline fastapi finance financial-analysis interactive-dashboard plotly streamlit

Last synced: 13 Apr 2026

https://github.com/ragedunicorn/mantisx-notebook

A repository for Jupyter notebooks analysing mantisx data

data-analysis data-visualization mantis mantisx shooting training

Last synced: 24 Jul 2025

https://github.com/epfromer/x2-vue

Vue.js front end for email searching. Interfaces to x2-server.

auth0 data-visualization graphql vue vuex

Last synced: 08 May 2026

https://github.com/gmbeddard/em255-intro_data_science-finalproject

A data science project analyzing symptoms, socioeconomic impacts, and diagnosis trends of endometriosis using NHANES datasets. Features machine learning models and visualizations to enhance healthcare insights.

data-science data-visualization pandas-python womens-health

Last synced: 23 Jul 2025