An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/shivam5992/bokeh-vis

Visualising the acquisitions made by Google using python - Bokeh

bokeh bokeh-server data-visualization eda exploratory-data-analysis python

Last synced: 26 Jun 2025

https://github.com/touppercase78/salary-prediction-collection

Salary predictions with ML models and analyses on datasets from several other GitHub repos

data-analysis data-visualization datasets machine-learning python3 regression-models

Last synced: 02 May 2026

https://github.com/ramonanf/tc1002s_semanatec

Herramientas computacionales: El arte de la analítica

data-analysis data-visualization jupiter-notebook pandas-python

Last synced: 15 Jun 2025

https://github.com/nehul1149/olympic-data-analysis

This project is an interactive data visualization and analytics platform for exploring historical Olympic Games data. Built with Python and Streamlit, it offers an in-depth analysis of medal tallies, athlete statistics, and country-wise performance trends, providing users with powerful insights into the world's biggest sporting event.

analysis data-analysis data-science data-visualization matplotlib python streamlit

Last synced: 18 May 2026

https://github.com/amlanmohanty1/fannie-mae-borrower-behavior-and-characteristics-2007vs2019

Analysis using R and tidyverse to compare borrower behavior and characteristics between the years 2007 and 2019, focusing on key financial metrics such as credit scores, interest rates, debt to income ratios, and loan to value ratios.

data-visualization fannie-mae r tidyverse

Last synced: 13 Sep 2025

https://github.com/mimi-netizen/python-and-machine-learning-in-financial-analysis

This comprehensive repository covers financial data analysis using Python and machine learning techniques, including time series modeling, portfolio optimization, risk assessment, credit risk prediction, and deep learning applications in finance.

data-analysis data-science data-visualization finance financial-analysis financial-data financial-modeling

Last synced: 19 May 2026

https://github.com/notthestallion/data_visualisation-examples

This repository was created to learn and practice graph showing and data visualization. The goal is to gain experience in creating compelling and informative visualizations.

data data-science data-visualization database learn learn-to-code learning learning-by-doing matplotlib matplotlib-figures matplotlib-pyplot visualization

Last synced: 12 May 2026

https://github.com/emircanakyuzz/logistic_regression_ile_tahmin-estimating_with_logistic_regression

Çeşitli şirketlerin yaptığı harcamaları makine öğrenmesi algoritmalarından biri olan logistic regression kullanarak sınıflandırmaya çalışıyoruz. Eğittiğimiz model ile şirketlerin gelecek yıllardaki harcamalarının hangi alana ait olduğunu tahmin ediyoruz (sınıflandırıyoruz) ve performansını değerlendiriyoruz.

artificial-intelligence data-science data-visualization jupyter-notebook logistic-regression machine-learning prediction

Last synced: 29 Mar 2025

https://github.com/sarvamm/zeno-chat

Chat with your data in natural language and get insights and plots without any writing any code

chatbot data-science data-visualization large-language-models streamlit

Last synced: 19 May 2026

https://github.com/Lightning-Chart/lcjs-example-0053-dataGaps

Example showcasing how data gaps can be handled XY series. Particularly highlights line and area series in a trading use case

data-visualization demo example lightningchart-js template trading

Last synced: 22 Jul 2025

https://github.com/alifeee/occupation-data

Plotting Industry and occupation data from the ONS 2021 Census

census data-visualization employment occupation office-for-national-statistics ons pie-chart

Last synced: 26 Jun 2025

https://github.com/technologiestiftung/ihk-gewerbedaten-time-travel

Data visualization that allows time travelling through the Berlin business landscape of the past decades, highlighting business registrations for each year

businesses data-visualization map maplibre-gl-js open-data stimulus-js

Last synced: 22 Jul 2025

https://github.com/jabulente/tanzania-geographical-zones

This project provides a geospatial visualization of Tanzania's geographical zones and regions. It uses geospatial data to map each zone, display regions, and annotate them for easy identification. The visualizations include simulated data to demonstrate thematic mapping techniques.

ai data-analysis data-science data-visualization geopandas geospatial location matplotlib ml python tanzania tanzania-geographic tanzania-locations

Last synced: 19 May 2026

https://github.com/samuelson777/sensor-data-dashboard-simulator

Interactive Sensor Data Dashboard Simulator showcasing temperature, humidity, light, and motion sensors with real-time visualization. Fully client-side, responsive, and designed to demonstrate sensor handling and frontend skills without hardware.

client-side css dashboard data-visualization frontend html iot javascript realtime responsive sensor simulation visualization webapp

Last synced: 30 Apr 2026

https://github.com/christopher-dabrowski/posi-c1

Podstawy Sztucznej Inteligencji - Laboratoria C1

artificial-intelligence data-visualization laboratory-exercises

Last synced: 17 Mar 2025

https://github.com/iamsainikhil/data-visualization

Visualization of Web data using Python

data-analysis data-visualization python webscraping

Last synced: 13 Jun 2026

https://github.com/lovasoa/presidentielle

Graphiques correspondants aux derniers sondages IFOP pour les présidentielles françaises de 2017.

chart chartjs data-visualization elections francais france presidential

Last synced: 03 Apr 2025

https://github.com/pramodkondur/dataspark-end-to-end-dataanalytics

Cleaned, performed EDA and stored data in MySQL. Queried, and analyzed data, uncovering opportunities to drive revenue growth and optimize operations, with a potential revenue growth of $30.03 million. Reported key insights using Power BI.

data-analysis data-visualization eda powerbi python sql

Last synced: 21 May 2026

https://github.com/tynandebold/day-length-line-chart

A day-length line chart, charting the length of daylight from different locations around the world.

d3 d3-visualization d3js data-visualization data-viz

Last synced: 19 May 2026

https://github.com/das-amlan/learning-plotly

This repository contains all codes of my Plotly learning path

data-visualization pandas plotly

Last synced: 07 May 2026

https://github.com/srvcl/lung-cancer-survival-analysis

Data Cleaning of a dataset and Survival Analysis in R Language

data-analysis data-science data-visualization r survival-analysis

Last synced: 11 May 2026

https://github.com/debjyotisaha/data-science-projects-phase-2

Developed and implemented data science projects leveraging Python, machine learning algorithms, and statistical techniques. Focused on predictive modelling, data preprocessing, and insights generation to solve real-world problems.

analysis data-visualization distribution python

Last synced: 30 Apr 2026

https://github.com/yrohitha/customer-segmentation

Clean and Apply RFM technique to rank and group clusters to identify the best customers and perform targeted marketing campaigns, using real online transaction data

data-cleaning data-science data-visualization datetime marketing-analytics pandas python3 user-segmentation

Last synced: 13 Mar 2025

https://github.com/mindlessmuse666/iris-knn

Проект демонстрирует применение алгоритма k-ближайших соседей (KNN) для классификации набора данных Iris. Включает загрузку данных, обучение модели, оценку производительности и визуализацию результатов с использованием библиотек Pandas, Scikit-learn, Matplotlib, Seaborn и Plotly.

algorithm classification data-analysis data-visualization iris-dataset knn lazy-learning machine-learning python scikit-learn

Last synced: 17 Aug 2025

https://github.com/nika2811/new-york-city-taxi-fare-prediction

About In this project using New York dataset we will predict the fare price of next trip. The dataset can be downloaded from https://www.kaggle.com/kentonnlp/2014-new-york-city-taxi-trips The dataset contains 8 features along with GPS coordinates of pickup and dropoff

data data-preprocessing data-visualization decision-trees feature-engineering kaggle kaggle-competition linear-regression machine-learning neural-network nyc polynomial-regression ridge-regression scikit-learn taxi taxi-data tensorflow xgboost

Last synced: 06 Apr 2025

https://github.com/franksunye/streamlitccdemo

A fully-featured Streamlit application demo, showing how to quickly deploy interactive web apps on Streamlit Community Cloud. Supports English, Chinese, and Spanish (i18n), including user interaction, file processing, data display, and database features.

data-visualization demo i18n streamlit

Last synced: 19 May 2026

https://github.com/namansnghl/medical-expense-prediction-linear-reg

Medical Insurance data EDA and premium prediction

analysis data-visualization regression-models

Last synced: 11 Jun 2026

https://github.com/jabulente/histogram-visualization-with-matplotlib

This repository showcases how to create visually appealing and customized histograms using Python’s Matplotlib and Seaborn libraries. It includes examples of enhancing default plots with colors, fonts, transparency, and layout adjustments to better communicate data distribution insights.

ai big-data-analytics data-science data-storytelling data-visualization histogram matplotlib

Last synced: 22 Jul 2025

https://github.com/jabulente/geospatial-data-visualizations-tanzania-s-administrative-geographic-and-socioeconomic-landscape

This repository showcases geospatial data visualizations focused on Tanzania's administrative boundaries, geographic features, and selected socioeconomic indicators. Using GeoPandas, Matplotlib, and other geospatial libraries, the project provides static and customizable maps of regions, districts, and population distributions.

ai data-science data-visualization geopandas geospatial-analysis geospatial-visualization machine-learning oops python tanzania tanzania-locations

Last synced: 29 Jul 2025

https://github.com/datasqlsantosh/project-portfolio-e-commerce-data-analysis

In this personal Project-Portfolio-E-commerce-Data Analysis project, an exploratory data analysis was performed on the E-commerce Data available on Kaggle. The main aim of the project is to uncover insights into the store's sales and profits trends and patterns from 2018 to 2019.

data-cleaning data-visualization database dataset exc power-bi sql

Last synced: 11 Sep 2025

https://github.com/sgb31/covid-19-data-analysis

"In this project, I analyzed COVID-19 data to explore trends, case growth, and key patterns. I worked on cleaning the data, performing exploratory analysis, and visualizing infection rates, recoveries, and fatalities. The goal was to gain insights into how the pandemic evolved and its overall impact.

data-analysis data-visualization matplotlib pandas python seaborn

Last synced: 13 May 2026

https://github.com/ax-va/interactive-data-visualization-dale-2023

These examples on Interactive Data Visualization in the browser using Flask and D3.js are compiled with some modifications from the book "Data Visualization with Python and Javascript: Cleaning, Cleaning, Exploring, and Transforming Your Data" by Kyran Dale, published by O'Reilly Media in 2023.

ax-va d3 d3-visualization d3js data-science data-visualization dataviz javascript python

Last synced: 13 Mar 2025

https://github.com/alexandrehiroyuki/bcc_endofcourseproject

End of course project for "Bases Computacionais da Ciência" class at UFABC.

data-science data-visualization jupyter-notebook python

Last synced: 17 May 2026

https://github.com/jorgeterence/sic201

🐍 Coding exercises from the SIC201 Python bootcamp

algorithms bootcamp-project data-visualization exercises jupyter-notebook python

Last synced: 19 May 2026

https://github.com/tomwhite/misp-2017

MISP camp 2017 materials and code

bioinformatics data data-visualization hackathon

Last synced: 18 Apr 2026

https://github.com/mindlessmuse666/features-scaling

Проект по масштабированию признаков датасета Iris с использованием Python, Pandas, Scikit-learn, Seaborn и Plotly. Включает визуализацию данных, применение различных методов масштабирования и оценку производительности модели логистической регрессии.

data-scaling data-visualization feature-engineering iris-dataset machine-learning pandas plotly python scikit-learn seaborn student-project

Last synced: 16 Jun 2025

https://github.com/jatin-mehra119/insurance_dataset

The objective of this project is to predict insurance charges based on various factors.

data-visualization dataanalysis prediction-model python regression-models

Last synced: 15 May 2026

https://github.com/janashanaa/flightanalysis

This Jupyter Notebook presents an exploratory data analysis of data derived from a flight booking website.

data-analysis data-visualization exploratory-data-analysis jupyter-notebook python

Last synced: 15 May 2026

https://github.com/sweta-kaundilya/911-calls-capstone-project

For this capstone project we will be analyzing some 911 call data from Kaggle.

data data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 28 Apr 2026

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: 09 Apr 2026

https://github.com/satyacoder29/comparison-of-region-based-sales-tableau

The region-based sales comparison analyzes sales performance across different regions. It identifies trends, top-performing regions, and areas needing improvement by comparing metrics like revenue, growth rate, and product demand. This analysis helps optimize sales strategies and resource allocation for better performance.

data-analysis data-cleaning data-collection data-visualization powerquerym relationships tableau tableau-desktop unions

Last synced: 02 Feb 2026

https://github.com/darylalim/streamlit-highcharts

Streamlit app for building data visualizations with the Highcharts for Python Toolkit.

data-visualization highcharts

Last synced: 01 Jul 2026

https://github.com/jaguzmana/colombia-covid-analysis

A project proposed to enhance SQL proficiency and develop skills in data visualization using Tableau.

data-visualization mssql-database tableau

Last synced: 08 Mar 2026

https://github.com/arction/lcjs-example-0804-meshcircle

A demo application showcasing LightningChart JS IntensityMesh series.

chart data-visualization heatmap intensity lcjs lightningchart-js

Last synced: 12 Mar 2025

https://github.com/arction/lcjs-example-0908-3drealtimepoints

A demo application showcasing LightningChart JS to display 3D Scatter chart in real-time.

3d-visualization chart data-visualization lcjs lightningchart-js scatter-plot

Last synced: 12 Mar 2025

https://github.com/abdul-aa/kickstarters

Predictive Modeling and Clustering Insights for Kickstarter Success

boosting-ensemble clustering clustering-analysis data-visualization gradient-boosting kprototypes python shap

Last synced: 15 May 2026

https://github.com/sisolieri/ds-market-data-science-final-project

Final project for my Master's in Data Science. It includes Business Intelligence with Power BI, KMeans clustering of products and stores, and multivariate sales forecasting using machine learning models for DS Market, a retail chain in the USA.

business-intelligence clustering data-science data-visualization kmeans machine-learning powerbi python retail-analytics time-series-forecasting xgboost

Last synced: 19 May 2026

https://github.com/vavarm/data-analysis-french-electric-automobile-infrastructure

Data analysis realized in R Shiny and Python about the French electric vehicle and charging station infrastructure

data-analysis data-science data-visualization factominer geojson ggplot2 plotly python r rshiny

Last synced: 15 May 2026

https://github.com/sayamalt/fake-news-classification-using-fine-tuned-bert

Successfully developed a text classification model to predict whether a given news text is fake or not by fine-tuning a pretrained BERT transformed model imported from Hugging Face.

bert-embeddings bert-model data-analysis data-visualization deep-learning fine-tuning-bert model-evaluation model-training-and-evaluation text-classification text-preprocessing text-tokenization tokenizer-nlp wordcloud-visualization

Last synced: 05 Apr 2025

https://github.com/satyacoder29/crm-analytics

CRM Analytics Dashboard – An interactive dashboard using Tableau, SQL, and Salesforce CRM Analytics (CRMA) to analyze sales performance, customer segmentation, and churn prediction. Features automated ETL pipelines, predictive analytics, and real-time insights for data-driven decision-making. 🚀📊

advanced-excel data-analysis data-cleaning data-collection data-transformation data-visualization matplotlib numpy pandas powerbi python seaborn sql tableau

Last synced: 03 Mar 2025

https://github.com/danzed1/health-ai-assistant

🩺 Deliver personalized health advice using AI, providing instant, accurate responses to wellness questions in a user-friendly application.

ai ai-assistant andriod app-backend apple-health cv data-visualization datascience doctor dspy-ai healthcare healthcare-application ios llm patient promptql react-native whatsapp

Last synced: 15 May 2026

https://github.com/melih0132/projects

This repository showcases projects from my computer science journey, covering technologies like web development and interactive applications.

csharp data-visualization database game-development html-css javascript python software-development unity web-development

Last synced: 27 Mar 2025

https://github.com/sweta-kaundilya/adventureworks-cycles-powerbi-project

This project was completed to simulate real-world tasks that data professionals encounter every day on the job.

dashboarddesign data-visualization datamodeling dataprep dax exploratory-data-analysis powerbi powerquery

Last synced: 08 Mar 2026

https://github.com/yujonglee/seoul-ultrafinedust-visualization

Seoul Ultrafine-dust Visualization using Open data.

data-visualization fine-dust perl5 processing

Last synced: 15 May 2026

https://github.com/lucasdota/bar_chart

Bar chart with JSON data fetching

d3js data-visualization fetch-api javascript json json-api

Last synced: 17 May 2026

https://github.com/oshinrathor/data-science-systems-and-analytics-projects

Dive into my Data Science Projects Repository, featuring a Spam SMS Classifier, NIA Dashboard, H1N1 Vaccine Prediction, and NYC Taxi Fare Prediction. Each project showcases my skills in data cleaning, exploratory analysis, modeling, and visualization, offering valuable insights and methodologies for data enthusiasts and practitioners.

dashboard data-analysis data-driven-decisions data-presentation data-science data-visualization dataexploration eda insights nia webanalytics

Last synced: 02 Mar 2025

https://github.com/kameronbrooks/datalys2-reporting

Datalys2 Reports allows you to create rich, interactive reports by simply defining a JSON configuration embedded in your HTML. It handles the layout, data visualization, and interactivity, so you don't need to write custom React code for every report.

data data-visualization html react

Last synced: 08 Apr 2026

https://github.com/barrettotte/anilist-ml

Training a binary classifier model to predict if I would recommend an anime using my Anilist user data.

anilist binary-classification data-visualization machine-learning scikit-learn

Last synced: 15 May 2026

https://github.com/cyberoctane29/deutsche-bank-customer-churn-prediction-end-to-end-analysis-and-modeling

In this project, I aim to predict customer churn for Deutsche Bank using supervised machine learning. It involves data exploration, feature engineering, and building Naive Bayes, Decision Tree, Random Forest, and XGBoost models. Models are tuned, evaluated, and compared to identify the best approach for churn prediction.

bank-customer-churn churn-analysis churn-prediction customer-churn-analytics data-analysis data-analytics data-visualization decision-tree eda gaussian-naive-bayes machine-learning random-forest supervised-learning xgboost

Last synced: 11 Oct 2025

https://github.com/cyberoctane29/salifort-motors-predicting-employee-turnover-and-improving-retention-analysis-and-modeling

In this project, I work as a data analytics professional at Salifort Motors, a fictional leader in alternative energy vehicles. I analyze employee survey data to identify turnover drivers and build predictive models, including multiple logistic regression, decision trees, and random forests, to forecast attrition and support retention strategies.

data-analytics data-visualization decision-trees eda employee-attrition ethical-artificial-intelligence feature-engineering logistic-regression machine-learning random-forest regression-analysis statistical-analysis supervised-learning turnover-analysis

Last synced: 09 Jul 2025

https://github.com/andrewzgheib/football-database-analysis

Football database utilizing PostgreSQL and Pandas for data management, with PowerBI for intuitive KPI visualization

data-analysis data-visualization database pandas pgsql postgr powerbi sql

Last synced: 04 Apr 2025

https://github.com/cagandemirmr/music_power_bi

I recreated my sql project in Power Bi by adjusting relationships between tables.

data-visualization dataanalysis powerbi relational-databases sqlserver

Last synced: 13 Mar 2025

https://github.com/cyberoctane29/invistico-airlines-customer-satisfaction-prediction-end-to-end-analysis-and-modeling

This project presents an end-to-end workflow for predicting airline customer satisfaction using survey data. It involves building and evaluating classification models (Logistic Regression, Decision Tree, Random Forest, XGBoost), covering data cleaning, exploratory analysis, model training, tuning, evaluation, and feature importance analysis.

customer-satisfaction data-analytics data-visualization decision-tree eda logistic-regression machine-learning random-forest regression-analysis satisfaction-analysis statistical-analysis supervised-learning xgboost

Last synced: 29 Aug 2025

https://github.com/stephenombuya/data-visualization-with-r

This repository contains R programs that generate various types of charts and plots, commonly used for data visualization. The visualizations include:

bar-charts data-visualization plots-in-r r scatter-plot

Last synced: 29 Oct 2025

https://github.com/SciddhantoSinha/Chat-Analyzer-using-WhatsApp-Data-Visualization

A web app to analyze WhatsApp chats and gain insights into conversation statistics, activity trends, emojis, and more!

chat-analyzer data-visualization python streamlit whatsapp

Last synced: 27 Jun 2026

https://github.com/fadhiildzaki/bank_deposit_prediction

This project showcases an end-to-end data science solution to predict whether a client will subscribe to a term deposit based on historical marketing data from a Portuguese banking institution. By leveraging machine learning algorithms, the project aims to improve the efficiency and effectiveness of marketing campaigns.

classification-algorithm data-analyst data-science data-visualization machine-learning python statistical-analysis supervised-learning

Last synced: 15 May 2026

https://github.com/kimaruthagna/geodjango

the project introduces the aspect of geodjango and storing of spatial data in a database.Postgres was used in this project

data-visualization donut-chart extension-postgis geodjango geomap graphos layers postgis postgresql-database python-json spatial-data

Last synced: 29 Oct 2025

https://github.com/bala-1409/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

acf adf arima-model data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms pacf python python3 sarimax-model seasonality seasonality-analysis time-series time-series-forecasting trends

Last synced: 27 Apr 2026

https://github.com/autumnchris/doping-allegations-scatterplot

A D3.js scatterplot built in React.js that presents the top 35 fastest completions of the Alpe d'Huez and whether or not the associated cyclist has been accused of doping during that period.

babel css3 d3 d3-js d3js data-visualization data-visualization-certification doping-allegations-scatterplot freecodecamp javascript react reactjs sass scatterplot scatterplot-graph-challenge scss webpack

Last synced: 08 Apr 2026

https://github.com/nagar2nd/zomato-bangalore-analysis-tableau

Analysing restaurant data in Bengaluru to enhance customer satisfaction by optimizing the restaurant experience. The focus is on improving the popularity of different cuisines, enhancing delivery times, and boosting restaurant ratings. An interactive Tableau dashboard has been developed to help Zomato identify key areas for improvements.

data-analysis data-visualization tableau

Last synced: 05 Mar 2026

https://github.com/wa-lead/ml485_blind_preprocessing_prediction_comp

This project aims to achieve the best prediction results by applying various preprocessing techniques and blind data engineering.

data-engineering data-visualization machine-learning python

Last synced: 19 May 2026

https://github.com/shubhamgoyal575/credit-card-fraud-detection

📌 Credit Card Fraud Detection using Machine Learning This project focuses on detecting fraudulent credit card transactions using machine learning models like Random Forest, XGBoost, and Deep Learning. The dataset is preprocessed to handle class imbalance, and multiple models are evaluated based on ROC AUC Score and F1 Score.

adaboost-classifier artificial-neural-networks credit-card-fraud data-analysis data-cleaning data-preprocessing data-science data-visualization deep-learning exploratory-data-analysis lightgbm machine-learning machine-learning-algorithms random-forest-classifer scikit-learn tensorflow xgboost

Last synced: 08 Feb 2026

https://github.com/imnotamr/datasets-used

A comprehensive collection of datasets for machine learning and data science projects, covering topics from advertising and sales to health and sports analytics

ai classification data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning models python regression-models

Last synced: 19 May 2026

https://github.com/arosas17/bikesharing

Use of Tableau Public to create unique graphs to visualize clear patterns in a bike-sharing program and possibly be applied to a new bike-sharing program.

data-visualization tableau-public

Last synced: 25 Apr 2026