An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/nika2811/new-york-city-taxi-fare-prediction

About In this project using New York dataset we will predict the fare price of next trip. The dataset can be downloaded from https://www.kaggle.com/kentonnlp/2014-new-york-city-taxi-trips The dataset contains 8 features along with GPS coordinates of pickup and dropoff

data data-preprocessing data-visualization decision-trees feature-engineering kaggle kaggle-competition linear-regression machine-learning neural-network nyc polynomial-regression ridge-regression scikit-learn taxi taxi-data tensorflow xgboost

Last synced: 06 Apr 2025

https://github.com/franksunye/streamlitccdemo

A fully-featured Streamlit application demo, showing how to quickly deploy interactive web apps on Streamlit Community Cloud. Supports English, Chinese, and Spanish (i18n), including user interaction, file processing, data display, and database features.

data-visualization demo i18n streamlit

Last synced: 19 May 2026

https://github.com/namansnghl/medical-expense-prediction-linear-reg

Medical Insurance data EDA and premium prediction

analysis data-visualization regression-models

Last synced: 11 Jun 2026

https://github.com/jabulente/histogram-visualization-with-matplotlib

This repository showcases how to create visually appealing and customized histograms using Python’s Matplotlib and Seaborn libraries. It includes examples of enhancing default plots with colors, fonts, transparency, and layout adjustments to better communicate data distribution insights.

ai big-data-analytics data-science data-storytelling data-visualization histogram matplotlib

Last synced: 22 Jul 2025

https://github.com/jabulente/geospatial-data-visualizations-tanzania-s-administrative-geographic-and-socioeconomic-landscape

This repository showcases geospatial data visualizations focused on Tanzania's administrative boundaries, geographic features, and selected socioeconomic indicators. Using GeoPandas, Matplotlib, and other geospatial libraries, the project provides static and customizable maps of regions, districts, and population distributions.

ai data-science data-visualization geopandas geospatial-analysis geospatial-visualization machine-learning oops python tanzania tanzania-locations

Last synced: 29 Jul 2025

https://github.com/datasqlsantosh/project-portfolio-e-commerce-data-analysis

In this personal Project-Portfolio-E-commerce-Data Analysis project, an exploratory data analysis was performed on the E-commerce Data available on Kaggle. The main aim of the project is to uncover insights into the store's sales and profits trends and patterns from 2018 to 2019.

data-cleaning data-visualization database dataset exc power-bi sql

Last synced: 11 Sep 2025

https://github.com/jorgeterence/sic201

🐍 Coding exercises from the SIC201 Python bootcamp

algorithms bootcamp-project data-visualization exercises jupyter-notebook python

Last synced: 19 May 2026

https://github.com/tomwhite/misp-2017

MISP camp 2017 materials and code

bioinformatics data data-visualization hackathon

Last synced: 18 Apr 2026

https://github.com/sweta-kaundilya/911-calls-capstone-project

For this capstone project we will be analyzing some 911 call data from Kaggle.

data data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 28 Apr 2026

https://github.com/sisolieri/ds-market-data-science-final-project

Final project for my Master's in Data Science. It includes Business Intelligence with Power BI, KMeans clustering of products and stores, and multivariate sales forecasting using machine learning models for DS Market, a retail chain in the USA.

business-intelligence clustering data-science data-visualization kmeans machine-learning powerbi python retail-analytics time-series-forecasting xgboost

Last synced: 19 May 2026

https://github.com/lucasdota/bar_chart

Bar chart with JSON data fetching

d3js data-visualization fetch-api javascript json json-api

Last synced: 17 May 2026

https://github.com/kameronbrooks/datalys2-reporting

Datalys2 Reports allows you to create rich, interactive reports by simply defining a JSON configuration embedded in your HTML. It handles the layout, data visualization, and interactivity, so you don't need to write custom React code for every report.

data data-visualization html react

Last synced: 08 Apr 2026

https://github.com/andrewzgheib/football-database-analysis

Football database utilizing PostgreSQL and Pandas for data management, with PowerBI for intuitive KPI visualization

data-analysis data-visualization database pandas pgsql postgr powerbi sql

Last synced: 04 Apr 2025

https://github.com/cagandemirmr/music_power_bi

I recreated my sql project in Power Bi by adjusting relationships between tables.

data-visualization dataanalysis powerbi relational-databases sqlserver

Last synced: 13 Mar 2025

https://github.com/kimaruthagna/geodjango

the project introduces the aspect of geodjango and storing of spatial data in a database.Postgres was used in this project

data-visualization donut-chart extension-postgis geodjango geomap graphos layers postgis postgresql-database python-json spatial-data

Last synced: 29 Oct 2025

https://github.com/autumnchris/doping-allegations-scatterplot

A D3.js scatterplot built in React.js that presents the top 35 fastest completions of the Alpe d'Huez and whether or not the associated cyclist has been accused of doping during that period.

babel css3 d3 d3-js d3js data-visualization data-visualization-certification doping-allegations-scatterplot freecodecamp javascript react reactjs sass scatterplot scatterplot-graph-challenge scss webpack

Last synced: 08 Apr 2026

https://github.com/nagar2nd/zomato-bangalore-analysis-tableau

Analysing restaurant data in Bengaluru to enhance customer satisfaction by optimizing the restaurant experience. The focus is on improving the popularity of different cuisines, enhancing delivery times, and boosting restaurant ratings. An interactive Tableau dashboard has been developed to help Zomato identify key areas for improvements.

data-analysis data-visualization tableau

Last synced: 05 Mar 2026

https://github.com/wa-lead/ml485_blind_preprocessing_prediction_comp

This project aims to achieve the best prediction results by applying various preprocessing techniques and blind data engineering.

data-engineering data-visualization machine-learning python

Last synced: 19 May 2026

https://github.com/shubhamgoyal575/credit-card-fraud-detection

📌 Credit Card Fraud Detection using Machine Learning This project focuses on detecting fraudulent credit card transactions using machine learning models like Random Forest, XGBoost, and Deep Learning. The dataset is preprocessed to handle class imbalance, and multiple models are evaluated based on ROC AUC Score and F1 Score.

adaboost-classifier artificial-neural-networks credit-card-fraud data-analysis data-cleaning data-preprocessing data-science data-visualization deep-learning exploratory-data-analysis lightgbm machine-learning machine-learning-algorithms random-forest-classifer scikit-learn tensorflow xgboost

Last synced: 08 Feb 2026

https://github.com/imnotamr/datasets-used

A comprehensive collection of datasets for machine learning and data science projects, covering topics from advertising and sales to health and sports analytics

ai classification data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning models python regression-models

Last synced: 19 May 2026

https://github.com/samir-atra/share-lm_dataset_analysis

Analysis, studies and optimizations on the ShareLM extension dataset

data-analysis data-visualization gemma3n huggingface huggingface-transformers pandas

Last synced: 19 May 2026

https://github.com/danasilver/twacker

Track your Twitter friends (following) and followers.

data-visualization heroku-app twitter

Last synced: 12 Jul 2025

https://github.com/shuyib/london_weather_prediction

The London Weather Project aims to predict the mean temperature in London using historical weather data, involving data cleaning, feature engineering, and modeling with techniques like imputation, transformation, scaling, and the use of Mlflow for tracking model performance and hyperparameters.

data-cleaning data-lab data-science data-visualization datacamp-projects environmental-science feature-engineering forecasting jupyter-notebook machine-learning mlflow open-data python random-forest regression-analysis time-series weather-prediction

Last synced: 29 Mar 2025

https://github.com/borjamome/radiografia-madrid

Análisis de Población, Economía y Sociedad de Madrid con R.

data-analysis data-visualization madrid r

Last synced: 17 Jun 2025

https://github.com/sukhitashvili/pca_tutorial

PCA algorithm from scrach, using only matrix-vector multiplications

data-analysis data-science data-visualization machine-learning-algorithms pca

Last synced: 29 Mar 2025

https://github.com/stat-by-tish/house-insurance-data-analysis

Fraud detection in house insurance using MATLAB – EDA, classification (trees, KNN, SVM, RF), and clustering. Built for a student project.

classification classification-trees clustering data-visualization exploratory-data-analysis house-data insurance-claims kmeans-clustering knn-classification matlab

Last synced: 26 Jun 2025

https://github.com/no-country-simulation/c21-55-n-data-bi

Trabajo de análisis estadístico en Power Bi, sobre la deserción de alumnos en carreras culturales universitarias de argentina.

data-visualization

Last synced: 18 Feb 2026

https://github.com/samukiszhsd/alteryx-analytics

Você está trabalhando com dados de transações bancárias do Itaú e precisa fazer algumas análises para ajudar o time de auditoria a detectar padrões incomuns e possíveis transações suspeitas.

alteryx data-analysis data-structures data-visualization etl workflow

Last synced: 18 Feb 2026

https://github.com/prady2309/stock-analysis

Analysis on the stock prices of Apple, Google, Microsoft and Amazon

data-analysis data-science data-visualization python stock-market

Last synced: 19 May 2026

https://github.com/ezeparziale/analisis-uso-bicicletas-caba

:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.

data data-science data-visualization

Last synced: 14 Mar 2025

https://github.com/saroshfarhan/irish_hospital_data_anaysis

Irish hospital's patient discharge data for four counties analysis

data-analysis data-science data-visualization healthcare irish-data r-programming-language

Last synced: 18 Feb 2026

https://github.com/chahelgupta/interactive-data-visualization-tool-java

The JavaFX project aims to build an interactive data visualization tool offering Pie Charts, Bar Graphs, and Line Graphs. Users can input data for each chart type, customize visual aspects like colors and labels, and interact with zooming and tooltips.

data-visualization data-visualization-javafx data-visualization-project java java-application javafx javafx-application javafx-application-framework javafx-project

Last synced: 02 Jan 2026

https://github.com/nivasharmaa/friskwatch

A Java program for analyzing stop-and-frisk data from the NYPD. Features data import, organization, and statistical analysis to compare occurrences during and after policy implementation.

data-analysis data-visualization dataprocessing datascience file-io java java-oop nypd-data

Last synced: 19 May 2026

https://github.com/analyticalnahid/plotly-tutorial

A intro of Plolty for Data Science

data-science data-visualization ploty python3

Last synced: 28 Mar 2025

https://github.com/saketr3/voting-policy-impact-visualizer

Data visualization web app where users can compare voter turnout of different demographics with states’ voting policy fairness scores

data-visualization voting

Last synced: 14 Mar 2025

https://github.com/shellynagar27/marketing-content-performance-analysis

Analyzed 2024 social media campaign data from TikTok, Instagram, LinkedIn, and X.com using Power BI to uncover performance trends across platforms, content types, and regions. Built an interactive dashboard to drive insights on engagement, optimal posting times, and content strategy.

data-analysis data-modelling data-visualization excel figma marketing-analytics powerbi powerquery wireframing

Last synced: 26 Jun 2025

https://github.com/alan-oliveir/previsao_cartao_fidelidade

Projeto de ciência de dados para previsão de plano de fidelidade para clientes de uma companhia aérea.

data-science data-visualization database gradio python sql

Last synced: 04 May 2026

https://github.com/grascya/sleep-health_-lifestyle-dataset

Classifier to predict the presence of a sleep disorder based on the other columns in the dataset.

data-visualization exploratory-data-analysis joblib machine-learning-algorithms pickle python statistical-analysis

Last synced: 20 May 2026

https://github.com/furkankarakuz/turkey_earthquake

This project focuses on analyzing and visualizing earthquake data specific to Turkey. It aims to provide insightful visualizations on topics such as earthquake frequency, location, and magnitude using data obtained from Boğaziçi University Kandilli Observatory and Earthquake Research Institute.

api data data-visualization earthquake python python3 request streamlit turkey turkey-earthquake

Last synced: 20 May 2026

https://github.com/the-ethan-hunt/dekh-data

Playground for data visualization notebooks

data-visualization jupyter-notebook python

Last synced: 28 Mar 2025

https://github.com/otsaloma/pollen-chart

Helsinki pollen count visualization

data-visualization javascript lambda pollen python

Last synced: 17 Apr 2026

https://github.com/saisathvik07/e-commerce-sales-analysis-using-sql-and-powerbi

This repository provides an extensive examination of Amazon Sales Data utilizing SQL

analytics data-science data-visualization mysql-database powerbi

Last synced: 22 Mar 2025

https://github.com/harmonicode/filtra

Digital Filter Designer is a powerful application built using PyQt5 and Matplotlib. It allows users to design and visualize digital filters, including standard filters and all-pass filters, and generate corresponding C code. Ideal for students, researchers, and engineers in digital signal processing.

data-visualization digital-signal-processing filter-design pyqt5 real real-time-processing

Last synced: 22 Mar 2025

https://github.com/holy-angel-university/student-performance-analysis

This project analyzes student data to understand factors affecting final exam scores. Data includes study habits, extracurriculars, family background, school environment, and demographics. The goal is to identify key contributors to academic success.

data-science data-visualization exploratory-data-analysis jupyter-notebook python3

Last synced: 06 Apr 2025

https://github.com/cudavailable/gdp-data-visualization

A data visualization project for GDP data

data-visualization gdp vue

Last synced: 20 May 2026

https://github.com/vaxdata22/customer-churn-data-analytics-etl-pipeline-by-airflow-on-ec2

This is an end-to-end AWS Cloud ETL project. This orchestration uses Apache Airflow on AWS EC2 as well as AWS Glue. It demonstrates how to build ETL pipeline that would perform data transform using Glue job/crawler as well as loading into a Redshift table. It also shows how to connect Amazon Athena to Glue Data Catalog, and Power BI to Redshift.

amazon-athena amazon-redshift apache-airflow aws-ec2 aws-glue aws-s3 business-intelligence customer-churn-analytics dags data-visualization etl-pipeline orchestration power-bi python3

Last synced: 18 Jun 2025

https://github.com/srinibas-masanta/hotel-revenue-analysis-dashboard

This project focuses on analyzing hotel booking data to uncover key metrics and insights that drive revenue management decisions. By creating an interactive Power BI dashboard, the project aims to improve strategic decision-making, optimize occupancy rates, and enhance overall financial performance within the hospitality industry.

business-analytics data-analysis data-science data-visualization dax-functions hospitality powerbi

Last synced: 12 Jan 2026

https://github.com/prakhar-code/house_sales_analysis

House Sales Analysis Of King County, Washington, USA and Clean Visualization.

data-cleaning data-visualization excel tableau tableau-dashboards tableau-public

Last synced: 12 Jan 2026

https://github.com/iankitnegi/ms-data-analyst-professional-certificate

Journey through the Microsoft Power BI Data Analyst Certificate with notes, projects, and exercises. 🚀

data-visualization microsoft powerbi

Last synced: 24 Jan 2026

https://github.com/archanakokate/bank_term_deposit_prediction

Build a Decision Tree classifier to predict if the client will subscribe to a Term Deposit based on their demographic and behavioral data.

data-analysis data-visualization exploratory-data-analysis machine-learning

Last synced: 14 Sep 2025

https://github.com/shivasairam1706/mlops-project1

End-to-end ML-Ops project using PySpark and AWS, covering environment setup, model training, deployment with data capture, execution, and analysis. CI/CD pipelines (AWS CodePipeline) and monitoring (CloudWatch) ensure automated deployment, performance tracking, and model retraining for production-ready ML solutions.

aws aws-lambda aws-s3 data-engineering data-science data-visualization delta-lake docker forcasting mlops-project pyspark unix-shell

Last synced: 20 May 2026

https://github.com/asimpson/is-steph-mvp

🏀 Compare the 2018 NBA MVP contenders against Steph Curry's historic, unanimous, 2016 MVP season.

data-visualization nba reactjs

Last synced: 20 May 2026

https://github.com/gui-sitton/carsells

In this project I am an analyst on the Crankshaft List. Hundreds of free vehicle advertisements are published on the site every day. I need to study the data collected over the last few years and determine which factors influence the price of a vehicle.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 20 May 2026

https://github.com/karanch10/fraudshield

FraudShield is a machine learning credit card fraud detection system that analyzes transaction attributes to identify suspicious activities in real time. Built with Python, SQL, and Django, it provides a user-friendly interface for fraud prediction using OpenBanking APIs and advanced detection techniques. Ideal for businesses and individuals.

data-analysis data-science data-visualization machine-learning python3

Last synced: 20 May 2026

https://github.com/shuddha2021/interactive-data-visualization-app

An interactive web application for visualizing data using Chart.js. Users can explore and analyze data through dynamic charts and customize their view

chart data-visualization event-handling interactive-ui javascript real-time-updates responsive-design web-development

Last synced: 01 Nov 2025

https://github.com/kaustubh-indulkar/te-it-dsbda-assignmnets

This repository contains the solutions for a series of assignments covering Data Science And Big Data Analytics concepts.

big-data big-data-analytics data-analytics data-science data-visualization sppu-2019-pattern sppu-it-dept

Last synced: 29 Mar 2025

https://github.com/patricksferraz/aqw-madrid-data-analysis

Interactive analysis and visualization of Madrid's air quality and weather data (2001-2016) using Python, Dash, and Jupyter. Features interactive maps, statistical analysis, and data visualization tools.

air-quality dash data-analysis data-engineering data-science data-visualization data-wrangling environmental-data environmental-science interactive-dashboard jupyter jupyter-notebook madrid open-data pandas plotly python statistical-analysis time-series weather-data

Last synced: 30 Jan 2026

https://github.com/mohamed-walied/customer-behavior-analysis-using-r

Customer Behavior Analysis project utilizing the "Groceries Market Basket Dataset" from Kaggle. The project employs a data-driven approach to uncover customer purchasing patterns and relationships within the grocery market using K-means Clustering and Association Rules using Apriori-Algorithm. In collaboration with some friends.

apriori-algorithm association-rule-learning dashboard data-cleaning data-visualization k-means-clustering r-programming-language

Last synced: 26 Jul 2025

https://github.com/williamd1k0/metacritic-games

Distribution of Metacritic scores for console games.

data-scraping data-visualization metacritic web-scraping

Last synced: 26 Jun 2025

https://github.com/c2r0b/2q

Manage data and relationships with AI

data-visualization graphql relationships rust tauri

Last synced: 09 Apr 2026

https://github.com/irishmorales/ph-poverty-statistics

An exploratory data analysis of Philippine poverty data. Data includes given 1991-2015 data, appended FIES 2018 & 2021 data, and 2024 & 2027 poverty estimates calculated using ARIMA.

data-visualization exploratory-data-analysis philippines poverty-alleviation

Last synced: 22 Mar 2025

https://github.com/ranxi2001/predicting-mental-health-risk

数据分析案例-精神健康预测(数据来源kaggle)

data-analysis data-visualization eda

Last synced: 27 Jun 2025

https://github.com/samruddhi3012/rfm-analysis

Hi there! In this project I have performed Sales Analysis (RFM Analysis) using SQL and Tableau.

data-analysis data-visualization mssqlserver rfm-analysis segmentation tableau

Last synced: 27 Jun 2025

https://github.com/hannahgsimon/halmodeling2024

Developed code using the Hybrid Automata Library (HAL) to create a spatial agent-based model of radio-immune response to spatially fractionated radiotherapy. This project was in association with the Cleveland Clinic Lerner Research Institute, Jacob Scott Lab.

agent-based-model bifurcation-analysis cancer-models computational-biology data-visualization hybrid-automata immune-response mathematical-modelling ordinary-differential-equations radiation-therapy spatial-model statistics systems-biology

Last synced: 23 Nov 2025

https://github.com/anergictcell/esbmeplots

An extension of the D3.js library for fast and flexible generation of basic plot types

d3js data-visualization javascript plotting

Last synced: 13 Jun 2026

https://github.com/gappeah/solana-ml-forecast

This project uses machine learning, specifically an XGBoost regressor, to predict the price of Solana (SOL) based on historical data and engineered features.

cryptocurrency data-visualization machine-learning solana xgboost

Last synced: 25 Feb 2025

https://github.com/arosas17/mapping_earthquakes

Created a map to demonstrate the correlation between the tectonic plates and earthquakes. Circle were made on a map to indicate earthquakes, changing colors and size based on magnitude of the earthquake.

data-visualization javascript map

Last synced: 20 May 2026

https://github.com/vlad1343/data-visualisation

Python project showcasing interactive and static visualizations using Plotly and Matplotlib. It includes analysis of CSV, JSON, and API data, turning complex datasets into clear, insightful charts.

anova api csv-files data-analysis data-visualization json matplotlib matplotlib-pyplot pandas pandas-python plotly python3 seaborn seaborn-python

Last synced: 08 Apr 2026

https://github.com/faizantkhan/python_matplotlib

Matplotlib is a powerful Python library for creating visualizations and plots. It’s widely used for data representation, making complex information more accessible and interpretable. It offers various types of plots, including line graphs, scatter plots, bar charts, histograms, and more

data-analysis data-analytics data-engineering data-science data-visualization deep-learning graphs line machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot matplotlib-python python

Last synced: 20 May 2026

https://github.com/traccyyyyy/employeehrwebapp

Modern web application built with Lit, featuring Web Components, real-time data visualization, responsive UI, and RESTful API integration.

api-rest data-visualization developer-tools frontend interactive-dashboard javascript lit real-time state-management ui-ux webapp webcomponents

Last synced: 20 May 2026

https://github.com/fvdavid/d3-in-action

Angular 19 Data Visualization D3js

angular d3-visualization d3js data-visualization typescript

Last synced: 08 May 2026