An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/sundarsharma332/codeliner

code snippet highlighter with custom line selection

code data-visualization program snapshot visualizer

Last synced: 10 Mar 2026

https://github.com/deliprofesor/income-analytics-interpretable-machine-learning-model

This project predicts whether an individual earns more than 50K using the Adult Income dataset. A Random Forest model is trained and evaluated, with explanations provided through DALEX and LIME for feature importance and model transparency.

classification dalex data-preprocessing data-science data-visualization feature-engineering income-prediction lime machine-learning model-explainability predictive-modeling r-programming random-forest

Last synced: 10 Apr 2025

https://github.com/leandrocollares/nyc-film-permits

NYC film permits: an exploratory data analysis

data-analysis data-visualization pandas plotly

Last synced: 05 Jul 2025

https://github.com/ledsouza/covid19

Projeto de análise de dados dos casos de Covid19

data-science data-visualization matplotlib pandas seaborn vitrinedev

Last synced: 05 May 2026

https://github.com/kivanc57/feature_comparison

This project explores the relationship between features and diagnosis in cancer data. Using methods like boxplots, scatterplots, PCA, k-means clustering, and logistic regression, we analyze and visualize data to understand health indicators.

boxplot clustering correlation data-science data-visualization descriptive-statistics explanatory-data-analysis pearson-correlation r scatter-plot spearman

Last synced: 06 Jun 2026

https://github.com/shellynagar27/business-insights-360-project

A comprehensive Dashboard which provides better understanding of the business's market standing, key focus areas for optimization, underperforming customers, and year-wise financial insights, aiding in better inventory planning and performance tracking. Further it can be used in answering n number of why questions based on the situations.

dashboard data-analysis data-visualization dax-languague dax-studio excel performance-optimization power-bi reporting sql storage-manager

Last synced: 27 Jan 2026

https://github.com/lkasym/smart-dynamic-pricing

An AI-powered dynamic pricing system using Dueling DQN and customer behavior simulation, with a full-stack React + Flask dashboard for real-time insights and performance benchmarking.

ai-project data-visualization deep-learning dqn-tensorflow ecommerce full-stack-ai machine-learning reinforcement-learning tensorflow

Last synced: 05 May 2026

https://github.com/jiyanshgarg/delhivery-logistics-data-analysis

This project analyzes Delhivery's logistics delivery dataset to understand delivery performance, route efficiency, and operational patterns using data analytics techniques. The analysis focuses on transforming raw segment-level logistics data into meaningful trip-level insights that can help improve delivery efficiency and route planning.

business-insights-and-recommendations data-analysis data-cleaning-and-preprocessing data-visualization exploratory-data-analysis feature-engineering feature-extraction feature-selection hypothesis-testing outlier-detection outlier-treatment

Last synced: 12 Jun 2026

https://github.com/sayamalt/employee-attrition-prediction

Successfully established a machine learning model which can accurately predict whether an employee of a given company will leave it in the impending future or not, based on several employee details and employment metrics.

binary-classification continuous-deployment continuous-integration cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation

Last synced: 08 Oct 2025

https://github.com/sayamalt/taxi-trip-fare-prediction

Successfully created a machine learning model which can accurately predict the fare of a taxi trip based on several features such as trip duration, tip amount, etc.

cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-selection model-training-and-evaluation regression-modelling

Last synced: 09 Nov 2025

https://github.com/gautam25raj/data-sync

A powerful platform designed to revolutionize the way teams collaborate and visualize data.

chat collaboration data-visualization express material-tailwind mongodb mongoose nextjs nodejs reactjs redux redux-toolkit tableau tableau-dashboard tailwindcss

Last synced: 11 Apr 2026

https://github.com/donmaruko/python-eda-toolkit

CLI-runned EDA with 30 commands utilizing text-related functions, statistical calculations, data visualization, and data manipulation.

data data-analysis data-science data-visualization matplotlib pandas scipy seaborn statistical-analysis statistics wordcloud

Last synced: 06 May 2026

https://github.com/cartervr/taxdatabase-sql-tableau

End-to-end process for building an SQL Azure database, performing data analysis with SQL and Python, and visualizing data with Tableau.

azure data-science data-visualization database-architecture database-deployment database-management databse-design datanalysis erdiagram sql tableau

Last synced: 13 Mar 2026

https://github.com/cronware/predictive-maintenance

The Predictive Maintenance System is a C# WinForms application designed to monitor and analyze sensor data from industrial equipment in real time. It integrates machine learning (ML.NET) and MongoDB to detect anomalies, predict failures, and optimize maintenance schedules before equipment breakdown occurs.

csharp data-visualization dotnet machine-learning mlnet mongodb predictive-maintenance winforms

Last synced: 13 Apr 2026

https://github.com/pat8901/diskanalyzer-cli

Processes a pdf file holding storage utilization data to automatically create graph visualizations revealing the true demographics hidden in large data.

data-visualization graphs-generation matplotlib

Last synced: 27 Dec 2025

https://github.com/terilios/automated_data_scientist

Automated Data Scientist: An intelligent, adaptive data analysis tool that leverages AI-driven automation to dynamically plan, execute, and refine data science workflows. Automatically handles data preparation, analysis planning, code generation, and result interpretation using advanced language models.

adaptive-analytics ai-driven-analytics ai-powered-data-tools api-integration automated-data-science automation data-insights data-preparation data-science-workflow data-visualization dynamic-analysis-planning exploratory-data-analysis intelligent-data-processing language-models machine-learning ml-ops openai-gpt python scalable-data-analysis

Last synced: 23 Jun 2025

https://github.com/franloza/contratosdemadrid

This project is an interactive web application for exploring and analyzing public contracts in the Community of Madrid. It allows users to search for companies and view their contract details, aiming to promote transparency and facilitate access to public information.

data-visualization duckdb evidence open-data

Last synced: 23 Jun 2026

https://github.com/mattsebastianh/Make-a-Line-Chart

Data Visualization with Matplotlib | Matplotlib Fundamentals

data-visualization matplotlib pandas-dataframe python

Last synced: 18 Jun 2026

https://github.com/karo23361/toy-store-kpi-power-bi

PowerBI Portfolio Project

csv data data-visualization powerbi

Last synced: 03 Feb 2026

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-2

Q2) Salary_hike -> Build a prediction model for Salary_hike Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization. Correlation Analysis. Model Building. Model Testing. Model Predictions.

correlation-analysis data-visualization distplot eda feature-engineering model-building model-predictions model-template numpy ols-regression p-value pandas python r-square-values regression-plot seaborn simple-linear-regression smf statsmodels t-score

Last synced: 08 May 2026

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 11 Apr 2026

https://github.com/rehanvhora778/bibtex-extraction

📄 Extract BibTeX entries from PDFs automatically, generating a complete bibliography without manual input or reliance on external APIs.

academic-writing analysis automation bibliometric-analysis bibliometrics bibtex data-visualization langchain latex metadata-extraction pdf pyhton pypdf reference-management research-tools

Last synced: 06 May 2026

https://github.com/tsopermon/comparison-ml-algorithms

This repository compares the performance of Adaline, Logistic Regression, and Perceptron models on binary classification tasks using linearly, non-linearly, and marginally separable datasets from the Iris dataset. It includes MATLAB implementations, 10-fold cross-validation, and visualizations of decision boundaries and MSE histories.

adaline binary-classification classification-accuracy cross-validation data-visualization decision-boundaries iris-dataset logistic-regression machine-learning matlab mse neural-networks perceptron

Last synced: 15 Mar 2025

https://github.com/djeada/data-visualization

This repository is dedicated to the exploration of various data visualization frameworks through bite-sized code snippets, as well as providing insights on effective data visualization techniques and principles.

altair data-visualization matplotlib plotly

Last synced: 08 Jan 2026

https://github.com/5hraddha/eda-instacart-customers-shopping-habits

In this Exploratory Data Analysis (EDA) project we'll clean up the data and prepare a report that gives insight into the shopping habits of Instacart customers.

data-visualization exploratory-data-analysis instacart matlpotlib numpy pandas

Last synced: 13 Apr 2026

https://github.com/haonamnguyen/costumer-shopping-trends-analysis

This project analyzes a synthetic dataset of customer shopping behavior to see key trends and insights. Using SQL and Tableau, the analysis focuses on customer demographics, purchase patterns, and preferences, including age distribution, payment methods, shipping types, and top product categories.

data-analysis data-visualization sql tableau

Last synced: 05 Jan 2026

https://github.com/bertiewooster/ipywidgets

Interactive data visualizations in a Jupyter Notebook per tutorial https://python.plainenglish.io/interactive-visualizations-with-pandas-seaborn-and-ipywidgets-173e5d7d6a5e

data-analysis data-science data-visualization ipython-notebook ipywidgets juypter-notebook python

Last synced: 06 Mar 2026

https://github.com/zulhaditya/netflix-analysis

Netflix data analysis using multiple python libraries.

data-visualization python

Last synced: 19 May 2026

https://github.com/jatin-s16/hr_mysql_powerbi

This repository contains raw HR data along with key business questions. I performed data cleaning using MySQL queries and wrote analytical queries to extract meaningful insights. The results were then visualised using Power BI to enhance business understanding.

data-analysis data-science data-visualization mysql powerbi

Last synced: 29 May 2026

https://github.com/manukot/sturdy-engine-python-

I've leant not only various Theoretical Concepts but also practical projects in my Masters Coursework

data-analysis data-visualization python3

Last synced: 13 May 2026

https://github.com/kinolag/traffic

A geospatial visualisation app showing road traffic information for all areas of Inner London. Built in TypeScript, combining React with D3.

d3 data-visualization geojson geospatial-visualization mapping react responsive-design svg topojson typescript

Last synced: 13 May 2026

https://github.com/chauxvive/fccbarchart

An interactive D3.js bar chart visualizing US GDP growth over time. Built as part of FreeCodeCamp’s Data Visualization certification.

d3 d3js data-visualization dataviz

Last synced: 13 May 2026

https://github.com/ivandobrovolsky/crimeaisukraine

How do maps, data libraries, streaming platforms, travel services, and internet infrastructure classify Crimea? We audited 200 digital platforms across 12 categories.

bigdata data-science data-visualization database machine-learning python

Last synced: 13 May 2026

https://github.com/alexgenovese/react-charts-covid-19-data

Examples on COVID-19 data using different library charts: G2, G2Plot, Plotly, ApexCharts

data-analysis data-science data-visualization react reactjs

Last synced: 13 May 2026

https://github.com/estnafinema0/housing-price-analysis

Predicting housing prices with regression models and visual analytics. Includes preprocessing, custom pipelines, and visualized performance metrics.

custom-models data-preprocessing data-visualization eda exploratory-data-analysis housing-prices jupyter-notebook machine-learning pipelines regression-models seaborn sklearn

Last synced: 13 May 2026

https://github.com/silkiemoth/eds-240-class-examples

Repository for in-class work assignments and notes in EDS-240 Data Visualization and Communication at UCSB.

classwork data-visualization r ucsb-meds

Last synced: 13 May 2026

https://github.com/magnus0969/heart-diease-eda

Exploratory Data Analysis (EDA) on heart disease data to uncover key risk factors and patterns. This project utilizes Python, Pandas, Seaborn, and Matplotlib to visualize trends, correlations, and insights that contribute to heart disease prediction and prevention.

data-insights data-science-projects data-visualization heart-disease-analysis python

Last synced: 14 May 2026

https://github.com/deliprofesor/joblocationmapper

JobLocationMapper is a Python tool that visualizes job listings on an interactive map. It uses city and state data to place job markers accurately and color-codes them by occupation (Software, Marketing, Design). The map clusters markers for better organization, and users can click on them to view job details.

clustrered-markers data-analysis data-visualization folium geocoding geographical-visualization interactive-map job-listings map-visualization pandas python

Last synced: 14 May 2026

https://github.com/mathyouf/kaggle-notebook-code

Code and Images which I used in Kaggle Notebooks. Mostly for style and code clarity.

data-visualization kaggle

Last synced: 14 May 2026

https://github.com/satvikpraveen/matplotlibmasterpro

📷 MatplotlibMasterPro is a complete, portfolio-ready project to master data visualization using matplotlib. Includes 16 notebooks, real datasets, exportable plots, custom themes, Streamlit dashboard, and Docker support. Ideal for learners and data professionals.

charts custom-plots dashboarding data-analysis data-science data-visualization educational-project interactive-visualizations jupyter-notebook matplotlib notebooks open-source plotting portfolio-project python python-utilities reproducible-research subplots time-series-analysis visualization-tools

Last synced: 14 May 2026

https://github.com/balajimohan18/foreign-exchange-rate-time-series-datascience-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-analytics data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis

Last synced: 14 May 2026

https://github.com/yashsingh43/cdc-sleep-duration-health-analysis

Analysis of CDC BRFSS 2022 data exploring how sleep duration relates to mental and physical health outcomes.

beautifulsoup brfss cdc data-analysis data-visualization matplotlib pandas plotly public-health python

Last synced: 11 Jun 2026

https://github.com/ewels/contributor-graphs

Contributor timelines for any git or GitHub repo: a publication-ready SVG and an interactive HTML page

cli contributors data-visualization git github open-source rust svg timeline visualization

Last synced: 11 Jun 2026

https://github.com/mohamedmetwalli5/breastcancerdiagnosis

Breast cancer diagnosis using machine learning via the XGBoost Algorithm after visualizing the data set & exploring it.

cancer data-visualization machine-learning

Last synced: 11 Jun 2026

https://github.com/mogalina/graph-rank-dynamics

Computational engine for analyzing how importance propagates through directed graphs.

data-visualization education graph-theory page-rank statistics

Last synced: 12 Jun 2026

https://github.com/madebysan/timeline

A static film timeline for seeing when movies are set, from ancient history to imaginary futures.

cinema data-visualization film html-css movies static-site timeline tmdb vanilla-javascript

Last synced: 12 Jun 2026

https://github.com/danielvartan/estela

🧒🏽🍎 Analysis and Visualization of Food Consumption Data for Brazilian Children Aged 2 to 4, as Monitored by SISVAN in 2019

brazil child-health child-nutrition data-science data-visualization malnutrition nutritional-epidemiology sisvan sus

Last synced: 12 Jun 2026

https://github.com/adamspannbauer/twitch_packed_bar

Example using a packed barchart to visualize emote usage in a twitch.tv chat

chat data-visualization data-viz packed-barchart twitch

Last synced: 12 Jun 2026

https://github.com/jorgeatgu/d3-bundle

Importando solo los módulos necesarios de d3

d3js d3v4 data-visualization

Last synced: 13 Jun 2026

https://github.com/shashwat9kumar/trends_in_a_country_on_twitter

Finding trending topics in each country on twitter and visualizing them in a WordCloud

data data-visualization trends tweepy twitter-api wordcloud

Last synced: 13 Jun 2026

https://github.com/stephenombuya/automation_scripts

A collection of Python scripts and tools designed to automate various tasks, improve productivity, and simplify repetitive actions. Each script is well-documented and serves a specific purpose, ranging from data visualization to smart home control.

automation-with-python data-visualization productivity python3 smart-home-automation webautomation

Last synced: 13 Jun 2026

https://github.com/luizassimoes/q5ga-latency-and-throughput

Quick 5G Analyser: PyQT5 software developed to help with simple graphical analysis and chart generating for ping and iperf3 tests.

data-analysis data-visualization pyqt5 python

Last synced: 13 Jun 2026

https://github.com/jatinnxn/diabetes-prediction

this repository showcases a machine learning model built to predict diabetes using Diabetes dataset. The project walks through data preprocessing, model training, and evaluation, offering a Decision Tree-based solution to classify individuals as diabetic or non-diabetic based on various health metrics. It also supports real-time predictions.

data-cleaning data-preprocessing data-visualization decision-tree-classifier machine-learning

Last synced: 13 Jun 2026

https://github.com/jameshulse/are-we-winning

An interactive map representing which countries are winning or losing their battle against Covid-19

coronavirus coronavirus-tracking covid-19 data-visualization vue vuejs

Last synced: 13 Jun 2026

https://github.com/april-jk/stoke-your-code

Trading-terminal style Git history viewer that turns repository activity into candlestick charts and volume bars.

analytics candlestick-chart data-visualization developer-tools git-history git-visualization github lightweight-charts react typescript

Last synced: 15 Jun 2026

https://github.com/prathmesh2507/global-stock-intelligence-dashboard

Interactive Global Stock Market Analytics Dashboard built using Python, YFinance, Pandas, Streamlit, and Plotly. Analyze 20+ countries and 400+ top stocks with advanced visualizations and financial insights.

dashboard data-analysis data-visualization python stock-analysis streamlit

Last synced: 15 Jun 2026

https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020

Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).

bigquery data data-analysis data-visualization python sql tableau

Last synced: 15 Jun 2026

https://github.com/anderson-andre-p/uber-data-analysis

This repository contains a comprehensive data analysis project focused on Uber rides. The dataset used in this project is a spreadsheet obtained from Uber, containing data related to ride details, such as pick-up and drop-off locations, date and time of the ride, and the fare amount.

data-analysis data-science data-visualization python

Last synced: 15 Jun 2026

https://github.com/claudiahw/excel-sales-dashboard

Data-driven Excel dashboard visualizing sales trends, top products, and profit breakdowns with dynamic filtering options.

dashboard data-visualization excel excel-dashboard pivot-tables

Last synced: 15 Jun 2026

https://github.com/bpazy/my_running_page

Make your own running home page

codoon data-visualization garmin gpx keep nike strava

Last synced: 17 Jun 2026

https://github.com/hanifheinrich/population-data-visualization

Implementasi Visualisai Data pada Data Kependudukan Nagari Tanjung Balik, Kabupaten Solok, Sumatera Barat Menggunakan Streamlit

data-visualization python streamlit-dashboard

Last synced: 16 Jun 2026

https://github.com/joonarafael/ids-exercises

Repository to store the exercise submissions for the Introduction to Data Science course (University of Helsinki).

course-work data-science data-visualization jupyter-notebook university-assignment

Last synced: 16 Jun 2026

https://github.com/leftcoastnerdgirl/webscraping_and_beautifulsoup

This project uses Beautiful Soup to create scrap data from a news website.

beautifulsoup data-visualization jupyter-notebook splinter webscraping

Last synced: 17 Jun 2026

https://github.com/tanmayborse/institionistic_fuzzy_approx_space

This model introduces a hybrid approach that utilizes rough sets on intuitionistic fuzzy approximation spaces for pre-processing and soft sets for post-processing, resulting in an effective decision-making solution.

data-cleaning-and-preprocessing data-science data-visualization decision-making fuzzy-logic

Last synced: 17 Jun 2026

https://github.com/mattsebastianh/Making-a-Visual-Argument

Data Visualization with Matplotlib | Making a Visual Argument in Matplotlib

data-visualization matplotlib python

Last synced: 18 Jun 2026

https://github.com/mattsebastianh/Make-the-Other-Charts.-Silly-s-Ice-Cream-Shop-Project

Data Visualization with Matplotlib | Matplotlib Fundamentals | Silly's Ice Cream Shop Project

data-visualization matplotlib python

Last synced: 18 Jun 2026

https://github.com/ibttf/bayborhood

Interactive map to find the ideal neighborhood in San Francisco based on data.

data data-analysis data-visualization gis mapbox react

Last synced: 18 Jun 2026

https://github.com/alinababer/covid19-timeseries-cases-and-deaths-forecasting-

This study is based on confirmed cases and deaths collected from Pakistan. Results demonstrate the promising potential of TIME SERIES model in forecasting COVID-19 cases and highlight the superior performance of the time series compared to the LSTM.we apply AI-based forecasting models such time series ARIMA, LSTM, prophet and VAR.

arima covid-19 data-analysis data-science data-visualization fbprophet forecasting lstm rnn time-series var vectorautoregression

Last synced: 19 Jun 2026

https://github.com/mahapeth/invest-track

Реализация инструмента для мониторинга активности пользователей ИС "Инвест" для ВКР по направлению 01.03.02 Прикладная математика и информатика

analitycs app data-analysis data-visualization jupyter-notebook python sites

Last synced: 20 Jun 2026

https://github.com/an4pdm/relatorio-de-vendas

O presente projeto foi feito através das ferramentas oferecidas pelo Power BI afim de aprimorar meus conhecimentos sobre ETL. Os dados utilizados foram de origem do site "Kaggle".

data-analysis data-visualization database etl powerbi

Last synced: 20 Jun 2026

https://github.com/sakan811/stress-pattern-occurrence-in-english-words

This project is intended to provide English learners with data that allows them to make a data-driven guess when encountering words that they aren't sure where to stress

data-analysis data-visualization english english-language english-learning language powerbi powerbi-report powerbi-visuals

Last synced: 20 Jun 2026

https://github.com/jayavarshini-jayakumaran/nba-exploratory-data-analysis

A data analytics project that explores NBA game and player data using Python and Power BI. Features data preprocessing, EDA, feature engineering, and an interactive dashboard for visualizing team and player performance trends.

data-analysis data-visualization exploratory-data-analysis powerbi python3

Last synced: 20 Jun 2026

https://github.com/erishen/langgraph-csv-analyst

CSV → Multi-Agent Analysis Pipeline → Visual HTML Report. Built with LangGraph StateGraph: data profiling, trend analysis, anomaly detection, and investment portfolio analysis.

asset-lens csv-analysis data-visualization investment-analysis langchain langgraph multi-agent plotly python

Last synced: 23 Jun 2026

https://github.com/markalbrand56/ds-laboratorio-9

Data Exploration in different datasets

data-exploration data-visualization matplotlib pandas

Last synced: 23 Jun 2026

https://github.com/ladaegorova18/data_analysis

Learning the basics of data analysis in Python

analytics data-analysis data-visualization steam-games

Last synced: 24 Jun 2026

https://github.com/minervarose/exoplanet-discovery-observatory

Interactive Tableau exploration of exoplanet discoveries, planetary systems, and discovery trends using NASA Exoplanet Archive data.

analytics astronomy business-intelligence dashboard data-visualization exoplanets nasa space tableau tableau-public

Last synced: 24 Jun 2026

https://github.com/sami-bre/visualizing-sort-algorithms

a jupyter lab showcasing time complexity behaviors of the mergesort, quicksort and insertionsort algoritms

data-visualization jupyter-notebook sorting-algorithms time-complexity-analysis

Last synced: 25 Jun 2026

https://github.com/saagpatel/signal-noise

An interactive essay teaching Bayesian reasoning through direct manipulation of live visualizations

bayesian d3 data-visualization education interactive-essay nextjs statistics typescript

Last synced: 28 Jun 2026