An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/devanshsahu47/hr-dashboard-mysql-powerbi

A comprehensive HR dashboard that visualizes key workforce metrics such as employee demographics, attrition rates, and performance trends. Built using Power BI/Excel, it enables data-driven HR decision-making with interactive charts and KPIs.

data-analytics data-visualization excel power-bi

Last synced: 04 Feb 2026

https://github.com/smpotts/sp500_index_analysis

Uses the Plotly Dash framework to visualize publicly available data for companies listed on the S&P 500 index

dash-plotly data-visualization financial-analysis pandas-dataframe python

Last synced: 01 Apr 2025

https://github.com/shrutiijoshi/e-commerce

The dataset contains various attributes related to orders, customers, and products, providing a comprehensive view of the sales process.

analysis data-visualization tableau-public visualization

Last synced: 07 Jan 2026

https://github.com/jansim/nicknames

Specify human readable names for the columns in your data once and then reuse them across your project to rename plots axes, dataframe columns, tables and anything else.

data-cleaning data-visualization r r-package

Last synced: 04 Sep 2025

https://github.com/wisdom-osborn/data-analytics-course-online-

🔍 Data Analytics with Python — Hands-on Course Materials Jupyter notebooks, projects, and datasets based on the freeCodeCamp Data Analysis with Python certification. Learn NumPy, Pandas, data cleaning, and visualization through real-world examples

data data-analysis data-science data-visualization freecodecamp numpy pandas pandas-dataframe project python

Last synced: 19 Apr 2026

https://github.com/ronitjariwala/prodigy_ds_04

Prodigy InfoTech Data Science Internship Task-4

data-analysis data-science data-visualization python

Last synced: 02 May 2026

https://github.com/sayamalt/steel-energy-consumption-prediction-using-pyspark

Successfully established a machine learning model using PySpark which can precisely predict the energy consumption of the steel industry, up to an r2 score of approximately 99.5%.

apache-spark big-data-analytics big-data-processing cross-validation data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning model-training-and-evaluation python regression spark sql

Last synced: 10 Mar 2026

https://github.com/matheusafonseca/deploy-ml-models-with-streamlit-udemy

This repository is dedicated to storing the code developed during the "Machine Learning Model Deployment with Streamlit" course on Udemy. The course covers basic to advanced techniques for deploying machine learning models using Streamlit.

data data-science data-visualization interface joblib layout machine-learning optimization-algorithms python python3 sklearn sklearn-datasets sklearn-library sklearn-pipeline streamlit

Last synced: 19 Apr 2026

https://github.com/sisolieri/ds-market-data-science-final-project

Final project for my Master's in Data Science. It includes Business Intelligence with Power BI, KMeans clustering of products and stores, and multivariate sales forecasting using machine learning models for DS Market, a retail chain in the USA.

business-intelligence clustering data-science data-visualization kmeans machine-learning powerbi python retail-analytics time-series-forecasting xgboost

Last synced: 19 May 2026

https://github.com/vedikasnehil/sql-50

This project focuses on solving 50 SQL problems every weekend from LeetCode to strengthen SQL skills, master advanced techniques, and build consistency. Each solution is documented with clear explanations, creating a valuable resource for learning and application.

data-visualization database-management sql

Last synced: 06 Jan 2026

https://github.com/bhaskarbharati/ibm-datascience-hands-on-lab

This is the basic hands-on exercise using Jupyter Notebook. This lab is done in the process of learning course Tools For Data Science | IBM

data-analysis data-science data-visualization datawrangling eda machine-learning

Last synced: 23 Apr 2025

https://github.com/tclzcja/spark-comparison-visualization

A data visualization project I made for Blue Telescope/HP.

client-project data-visualization

Last synced: 15 Jun 2025

https://github.com/teja-1403/game-of-thrones-analysis

Demonstrate Exploratory Data Analysis on GOT Dataset using plots and graphs and using the information extracted from text.

analysis data-visualization datascience machine-learning python

Last synced: 12 Apr 2026

https://github.com/amishidesai04/emergency-calls-data-analysis-project

Welcome to the Emergency Calls Data Analysis project repository. This project is dedicated to extracting, processing, and visualizing data from the "Emergency – 911 Calls, Montgomery County" dataset, sourced from Kaggle. The main objective is to analyze trends in emergency calls in Montgomery County, Pennsylvania, spanning multiple years.

analysis data-analysis data-extraction data-processing data-science data-visualization numpy pandas python seaborn

Last synced: 02 May 2026

https://github.com/soumyajiitdas/My-GenAICapstoneProject

A Generative AI-powered journaling assistant that analyzes daily entries to extract emotions, stress levels, and mood trends — built using Google Gemini API for mental wellness insights.

ai-assistant data-visualization generative-ai machine-learning mental-health prompt-engineering python

Last synced: 04 Jul 2025

https://github.com/jorgeterence/sic201

🐍 Coding exercises from the SIC201 Python bootcamp

algorithms bootcamp-project data-visualization exercises jupyter-notebook python

Last synced: 19 May 2026

https://github.com/datasqlsantosh/project-portfolio-e-commerce-data-analysis

In this personal Project-Portfolio-E-commerce-Data Analysis project, an exploratory data analysis was performed on the E-commerce Data available on Kaggle. The main aim of the project is to uncover insights into the store's sales and profits trends and patterns from 2018 to 2019.

data-cleaning data-visualization database dataset exc power-bi sql

Last synced: 11 Sep 2025

https://github.com/namansnghl/medical-expense-prediction-linear-reg

Medical Insurance data EDA and premium prediction

analysis data-visualization regression-models

Last synced: 11 Jun 2026

https://github.com/zahramh99/dynamic-pricing-strategy

Dynamic Pricing is an application of data science that involves adjusting the prices of a product or service based on various factors in real time. It is used by companies to optimize revenue by setting flexible prices that respond to market demand, demographics, customer behaviour and competitor prices.

business-intelligence data-science data-visualization demand-prediction dynamic-pricing machine-learning predictive-modeling price-prediction price-prediction-model pricing-strategy revenue-optimization ride-sharing

Last synced: 27 Jun 2025

https://github.com/asuquoaa/ann_arbor_weather_analysis_2005-2015

This project analyzes historical weather data from Ann Arbor, Michigan, collected by the National Centers for Environmental Information (NCEI) Global Historical Climatology Network daily (GHCNd).

data-cleaning-and-preprocessing data-visualization

Last synced: 03 Apr 2025

https://github.com/angchekar28/valorant-gameplay-analysis

This project analyzes Valorant gameplay data to understand key factors affecting match outcomes. It compares various machine learning models to predict player performance, rank classification, and match success.

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook machine-learning python

Last synced: 12 Apr 2026

https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn

This is the last project in the nanodegree udacity program. it's about data visualization.

data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree

Last synced: 09 May 2026

https://github.com/iamsainikhil/data-visualization

Visualization of Web data using Python

data-analysis data-visualization python webscraping

Last synced: 13 Jun 2026

https://github.com/mimi-netizen/python-and-machine-learning-in-financial-analysis

This comprehensive repository covers financial data analysis using Python and machine learning techniques, including time series modeling, portfolio optimization, risk assessment, credit risk prediction, and deep learning applications in finance.

data-analysis data-science data-visualization finance financial-analysis financial-data financial-modeling

Last synced: 19 May 2026

https://github.com/mattsebastianh/Making-a-Visual-Argument

Data Visualization with Matplotlib | Making a Visual Argument in Matplotlib

data-visualization matplotlib python

Last synced: 18 Jun 2026

https://github.com/rorrell/employmentdata

A Jupyter Notebook where I use group by to analyze the average unemployment rate by year

data-analysis data-visualization jupyter-notebook python3

Last synced: 02 May 2026

https://github.com/amlanmohanty1/fannie-mae-borrower-behavior-and-characteristics-2007vs2019

Analysis using R and tidyverse to compare borrower behavior and characteristics between the years 2007 and 2019, focusing on key financial metrics such as credit scores, interest rates, debt to income ratios, and loan to value ratios.

data-visualization fannie-mae r tidyverse

Last synced: 13 Sep 2025

https://github.com/fatihilhan42/spotify-songs-recommendations-system_with_python

We developed a song recommendation system for the user with the data we received from our Spotify song dataset. Data set and other applications are given in the description. Have a nice day.

data-analysis data-science data-visualization jupyter-notebook python recommendation-engine recommendation-system

Last synced: 02 May 2026

https://github.com/sayamalt/house-price-prediction

Successfully created a regression model for predicting the price of any house, excluding enormous real estates and mansions, to a significant level of accuracy.

data-visualization exploratory-data-analysis feature-engineering feature-selection machine-learning regression-analysis regression-testing

Last synced: 09 Nov 2025

https://github.com/holy-angel-university/global-cost-index-analysis

This analysis explores the cost of living across various countries, aiming to provide insights into economic disparities and living standards on a global scale. Utilizing a dataset that includes indices for overall cost of living, groceries, restaurant prices, and rent, we investigate the top and least expensive countries worldwide.

data-science data-visualization exploratory-data-analysis jupyter-notebook python3

Last synced: 02 May 2026

https://github.com/sanjiban08/coffee-sales-dashboard

Explore your coffee sales like never before with our Interactive Excel Dashboard—unlock insights, track trends, and enhance decision-making for a robust and caffeinated business strategy. ☕📈

data-cleaning data-visualization excel pivot-tables

Last synced: 26 Jan 2026

https://github.com/shahiakhilesh1304/fitbitcasestudy

This is a case study based on data retrieved from a Fitbit band, and we are making predictions about human behavior based on their mood.

case-study data-visualization fitbit jupyter-notebook numpy python3

Last synced: 13 Apr 2026

https://github.com/sayamalt/employee-attrition-prediction

Successfully established a machine learning model which can accurately predict whether an employee of a given company will leave it in the impending future or not, based on several employee details and employment metrics.

binary-classification continuous-deployment continuous-integration cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation

Last synced: 08 Oct 2025

https://github.com/dissorial/prx21_erikz

Analysis of self-tracked data: interactive visualizations & predictive algorithms

analytics data-analysis data-science data-visualization machine-learning matplotlib pandas python python3 visualization

Last synced: 02 May 2026

https://github.com/ramonanf/tc1002s_semanatec

Herramientas computacionales: El arte de la analítica

data-analysis data-visualization jupiter-notebook pandas-python

Last synced: 15 Jun 2025

https://github.com/shivam5992/bokeh-vis

Visualising the acquisitions made by Google using python - Bokeh

bokeh bokeh-server data-visualization eda exploratory-data-analysis python

Last synced: 26 Jun 2025

https://github.com/faizkhairi/repo-insights

Visualize commit patterns, language breakdown, and contributor stats for any GitHub repository

analytics data-visualization github github-api nextjs oauth recharts swr tailwindcss typescript visualization

Last synced: 02 May 2026

https://github.com/aliasgarsogiawala/dashboards

Power BI dashboards , each folder contains a pbix file and a pdf file with explanation of the dashboard

analysis dashboards data data-visualization powerbi

Last synced: 12 Feb 2026

https://github.com/crazy-dot/hiring-process-analytics

Analyse the company's hiring process data and draw meaningful insights from it

data-analytics data-visualization hiring-process ms-excel-data-analytics statistical-analysis trainity

Last synced: 07 Jan 2026

https://github.com/zen204/renewable-energy-usage-v-electricity-access

Interactive data visualization project created for COSI 116A: Introduction to Information Visualization at Brandeis University (Fall 2024). The project showcases data-driven insights using advanced visualization techniques and user interactivity. Hosted on GitHub Pages.

d3js data-analysis data-visualization electricity github-pages html-css-javascript information-visualization interactive python renewable-energy tableau web-development

Last synced: 08 Feb 2026

https://github.com/davityak03/stock-value-prediction-and-forecasting

This project predicts stock market trends using an LSTM neural network, focusing on Apple Inc.'s historical data for accurate future price forecasting. It includes data retrieval, preprocessing, model training, and evaluation.

data-visualization datareader lstm pandas python tensorflow tiingo

Last synced: 03 May 2026

https://github.com/sco1/xbmini-py

Python Toolkit for the GCDC HAM

data-analysis data-visualization python python3

Last synced: 07 May 2025

https://github.com/shivabajelan/squamous_cell_carcinoma_treatment_analysis

The study involved treating 249 mice with SCC tumors using a range of drug regimens, including Pymaceuticals' drug of interest, Capomulin. Over 45 days, tumor development was observed and measured to compare the performance of Capomulin against other treatments. My task was to generate tables and figures for the technical report of the study.

data-visualization matplotlib pandas python

Last synced: 11 Apr 2026

https://github.com/apelullo/cobalt_health_wellness_platform_ops

Cobalt is a mental health and wellness platform created for Penn Medicine employees that serves as a hub for support services such as therapy, wellness coaching, topic- and population-specific group sessions, and a variety of self-help resources.

academic-research data-cleaning-pipeline data-validation data-visualization decision-support feature-development healthcare-data hipaa key-performance-metrics mental-health-services operations-research product-analytics reporting-pipeline

Last synced: 23 Mar 2025

https://github.com/chigurakula-bs/forecasting-time-series-on-covid-19

COVID-19 is one of the most pandemic problems today, many countries suffer tohandle COVID-19 problems. Data mining techniques provide a good quality tool for improving manual analysis identification COVID19 cases per day, deaths per day and number of patients cured per day.

arima-model data-mining-algorithms data-visualization jupyter-notebook sarima-model

Last synced: 28 Apr 2026

https://github.com/anarya22/accenture-north-america-data-analytics-and-visualization-job-simulation-on-forage

Completed a simulation focused on advising a hypothetical social media client as a Data Analyst at Accenture. Cleaned, modelled and analyzed 7 datasets to uncover insights into content trends to inform strategic decisions. Prepared a PowerPoint deck and video presentation to communicate key insights for the client and internal stakeholders.

analyzing-visualization data-cleaning data-visualization numpy pandas powerbi powerpoint-presentations

Last synced: 09 May 2026

https://github.com/curatorcodicis/reddit-sentiment-analyzer

A Python-based tool that analyzes sentiment trends in Reddit discussions. Fetch posts, analyze sentiment using NLP, and visualize trends in an interactive Streamlit dashboard.

data-visualization docker mongodb nlp praw python reddit sentiment-analysis streamlit

Last synced: 13 Apr 2026

https://github.com/isinghabhishek/data_analysis_with_python

Introduction to Data Analysis covering the basics of Python, Numpy, Pandas, Data Visualization, and Exploratory Data Analysis.

data-visualization exploratory-data-analysis numpy pandas python

Last synced: 03 May 2026

https://github.com/derrmru/whats-in-the-news

Data Visualisation of News Content

data-visualization nlp react scraped-data

Last synced: 17 May 2026

https://github.com/badranalyst/tips-dataset-analysis-dashboard-with-streamlit-and-plotly

Interactive Streamlit dashboard analyzing the Seaborn 'tips' dataset, which records information on restaurant bills, including total bill amounts, tips, customer demographics (e.g., gender, smoking status), and dining details (e.g., day, time). Visualized with Plotly for insights into tipping patterns.

data-analysis data-analytics data-visualization dataset eda exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas plotly python seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/loosenthedark/ci_data-visualisation-dashboard-mini-project

Code Institute IFD Module demo project using D3.js, Crossfilter, dc.js & queue.js to leverage sample data relating to salary levels & participation in academia parsed by gender. Bootstrap-based theme.

bootstrap4 code-institute crossfilter css3 d3js data-visualisation data-visualization dcjs frontend html5 javascript queue svg

Last synced: 11 May 2026

https://github.com/timjjting/escaping-flatland

A demo of optimization techniques for plotting large datasets in a 3D space using Three.js + Svelte integration

big-data data-visualization glsl-shaders lod octree svelte sveltekit three-js

Last synced: 11 May 2026

https://github.com/carmendev/covid-19-tracker

Data visualization React.js project deployed with Firebase. Daily statistics about current, recovered and closed cases coming from an API.

data-visualization firebase numeral reactjs

Last synced: 11 Apr 2026

https://github.com/mattsebastianh/Making-a-visual-argument--compare-grammy-win-records-Project

Data Visualization with Matplotlib | Making a Visual Argument in Matplotlib

data-visualization matplotlib python

Last synced: 18 Jun 2026

https://github.com/pkjjoshi/behind-the-menu-uncovering-insights-from-restaurant-data

Discover hidden patterns in dining data — from popular cuisine pairings to geographic restaurant clusters

data-analysis data-visualization insights jupyter-notebook pandas python restaurant-data

Last synced: 05 Jul 2025

https://github.com/virajbhutada/titanic-survival-prediction

ML project focused on predicting Titanic passenger survival using various algorithms and extensive data analysis techniques. This project includes detailed data visualization and interpretation to uncover key factors affecting survival. By leveraging various ML models the analysis aims to achieve high predictive accuracy.

ada-boost-classifier data-exploration data-science data-visualization decision-tree-classifier hyperparameter-tuning knn-classification logistic-regression machine-learning model-interpretation random-forest-classifier roc-curve titanic-classification

Last synced: 14 Jun 2026

https://github.com/mkaspulanwar/p6_bigdata_realtime_largescale_visualization

Praktikum Week 6 Big Data: Real-time analytics dan visualisasi data skala besar menggunakan PySpark Structured Streaming, Parquet Data Lake, dan Streamlit untuk monitoring mobilitas dan traffic smart city.

big-data data-visualization pyspark spark-streaming streamlit traffic-analytics

Last synced: 13 Apr 2026

https://github.com/corey-richardson/microbit-data-logger

In preparation for Work Experience Students coming in, I am using this project to familiarise myself with the BBC micro:bits which we will provide them with. I am also using it as a chance to expand on my data visualisation with Python experience.

data-visualization matplotlib microbit pandas pyplot signal-processing

Last synced: 03 May 2026

https://github.com/deliprofesor/breast-cancer-detection-using-svm-with-smote-and-model-optimization

This project analyzes health and lifestyle factors influencing heart attack risk using statistical methods and machine learning, with Ridge Regression identified as the best predictive model.

classification data data-preprocessing data-science data-visualization gridsearchcv machine-learning python roc-curve smote svm

Last synced: 10 Apr 2025

https://github.com/leandrocollares/nyc-film-permits

NYC film permits: an exploratory data analysis

data-analysis data-visualization pandas plotly

Last synced: 05 Jul 2025

https://github.com/karo23361/toy-store-kpi-power-bi

PowerBI Portfolio Project

csv data data-visualization powerbi

Last synced: 03 Feb 2026

https://github.com/shoebjoarder/superstore

A Dash app to analyze Superstore dataset.

dashboard data-analysis data-visualization python-3

Last synced: 02 Apr 2025

https://github.com/kivanc57/feature_comparison

This project explores the relationship between features and diagnosis in cancer data. Using methods like boxplots, scatterplots, PCA, k-means clustering, and logistic regression, we analyze and visualize data to understand health indicators.

boxplot clustering correlation data-science data-visualization descriptive-statistics explanatory-data-analysis pearson-correlation r scatter-plot spearman

Last synced: 06 Jun 2026

https://github.com/shellynagar27/business-insights-360-project

A comprehensive Dashboard which provides better understanding of the business's market standing, key focus areas for optimization, underperforming customers, and year-wise financial insights, aiding in better inventory planning and performance tracking. Further it can be used in answering n number of why questions based on the situations.

dashboard data-analysis data-visualization dax-languague dax-studio excel performance-optimization power-bi reporting sql storage-manager

Last synced: 27 Jan 2026

https://github.com/parthivnaresh/facilyst

Facilyst is a library that makes using data science and machine learning tools easier.

data-science data-visualization deep-learning machine-learning mock-data neural-network python

Last synced: 18 Mar 2025

https://github.com/rohitinu6/tesla-price-prediction

A machine learning project that predicts future stock price movements using Logistic Regression, SVC, and XGBoost with engineered financial features.

data-analysis data-visualization feature-engineering financial-analysis logistic-regression machine-learning matplotlib python scikit-learn seaborn stock-market stock-price-prediction support-vector-machine time-series xgboost

Last synced: 03 May 2026

https://github.com/muichi-mon/fxplot

A simple JavaFX-based plotting library for quick and easy data-visualization.

data-visualization javafx plot series-data

Last synced: 16 May 2026

https://github.com/ahmedmmahrous/movie-recommendation-and-analysis

Perform analysis and Basic Recommendations based on Similar Genres and Movies which Users prefer.

data-visualization feature-engineering nu pan py recommender-system seaborn

Last synced: 03 Feb 2026