An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/piras-s/braincancerclassifier

Classifying brain tumors using Gaussian Naive Bayes with MRI-derived features. Includes feature selection, model evaluation, prediction uncertainty, and probability calibration.

baysian-inference calibrated-classification classification data-visualization feature-selection machine-learning medical-imaging naive-bayes-classifier python scikit-learn uncertainty-estimation

Last synced: 09 May 2026

https://github.com/georgiosioannoucoder/2023-fall-data-science-ta

These are my code examples for the 2023-fall-data-science-ta as a Data Science Teaching Assistant at CUNY Tech Prep (CTP) Cohort 9. 📊

dashboard data-visualization decision-tree eda huggingface image-classification machine-learning ml neural-network nlp pandas random-forest regression teaching-assistant transformer

Last synced: 10 May 2026

https://github.com/drkbluescience/ibm-datascience-spacex

In this project, we predict whether the Falcon 9 first stage will land successfully by following the data science methodology.

data-visualization data-wrangling machine-learning-algorithms sql-query sqlite webscraping-data

Last synced: 10 May 2026

https://github.com/loosenthedark/ci_data-visualisation-dashboard-mini-project

Code Institute IFD Module demo project using D3.js, Crossfilter, dc.js & queue.js to leverage sample data relating to salary levels & participation in academia parsed by gender. Bootstrap-based theme.

bootstrap4 code-institute crossfilter css3 d3js data-visualisation data-visualization dcjs frontend html5 javascript queue svg

Last synced: 11 May 2026

https://github.com/ceia-prefeitura/urban-lit-tracker-etl

UrbanLitTracker coleta artigos acadêmicos sobre mudanças urbanas via OpenAlex API, processa e armazena em MongoDB. Oferece dashboard interativo com Dash, exibindo dados como trabalhos mais relevantes, autores e palavras-chave frequentes, facilitando a análise e visualização da literatura urbana.

academic-research bibliometrics data-analysis data-pipeline data-visualization etl openalex-api urban-studies

Last synced: 11 May 2026

https://github.com/dannykyungh/data-analytics-portfolio

This is a repository that I have created to showcase skills, share projects and track my progress in Data Analytics / Data Science related topics.

advanced-excel data-cleaning data-modeling data-visualization data-warehousing google-sheets looker-studio python r sql tableau

Last synced: 12 May 2026

https://github.com/sakan811/honkai-star-rail-a-few-fun-insights-with-data-analysis

The project gives insights that delve into the Honkai Star Rail's character's stats of all available characters as of the given date.

data data-analysis data-science data-visualization docker flask game honkai honkai-star-rail honkai-starrail seaborn webscraping webscraping-data webscraping-selenium

Last synced: 10 Jun 2026

https://github.com/arction/lcjs-example-0017-largelinechartxy

Example visualization of large line chart (several million data points)

data-visualization lightningchart-js line-chart template

Last synced: 12 Mar 2025

https://github.com/sakan811/gachascope

Evaluate the cost-effectiveness of various in-app purchase bundles available in gacha games.

data data-analysis data-visualization game honkai honkai-star-rail honkai-starrail hoyoverse javascript nextjs tableau tableau-public typescript wutheringwaves

Last synced: 04 May 2026

https://github.com/chiefinnovator/daysonpurpose

A single-page, no-dependency web app that lets a user enter their birth date, country, and gender to estimate remaining days, weeks, months, and years based on life expectancy data, with caching, fallback dataset, and accessible, responsive UI.

accessible-ui data-visualization life-expectancy react single-page-application typescript web-app

Last synced: 04 Apr 2026

https://github.com/callmemaverick/game-of-thrones-investigating-episodes

Data Science project to analyze the duration of Game of Thrones episodes

data-science data-visualization matplotlib pandas-python python

Last synced: 19 Dec 2025

https://github.com/wurstbroteater/hometemp

Measure temperature and humdity of a room, retrieve online weather data, visualize it, analyse it and send it via email.

apartment-management-system data-visualization raspberry-pi scraped-data temperature temperature-monitoring temperature-sensor

Last synced: 11 Jul 2025

https://github.com/hirudikaanupama/predicting-term-deposit-subscriptions

The purpose of this project is to help banks and financial institutions identify potential customers for term deposit subscriptions, optimize marketing strategies, and improve conversion rates using data-driven insights.

data-cleaning data-imbalance-handling data-normalization data-transformation data-visualization exploratory-data-analysis hyperparameter-tuning neural-network random-forest

Last synced: 11 Jul 2025

https://github.com/casperkristiansson/finance-tracker

A project which solved an issue of mine which was tracking my finance. This Finance Tracking application gives overviews of expenses and income to give its users an easy way to explore their data.

dashboard data-visualization finance-management firebase-auth react

Last synced: 29 Dec 2025

https://github.com/adnanrahin/nlp-with-disaster-tweets

Kaggle Competition: Predict which Tweets are about real disasters and which ones are not. Natural Language Processing.

data-analysis data-science data-visualization kaggle-competition machine-learning natural-language-processing regular-expression tweets

Last synced: 21 Jun 2025

https://github.com/anuj7411/bankofbaroda-candlestick-dashboard

An interactive stock market visualization project using Python, Pandas, and Plotly to analyze Bank of Baroda price movement through a candlestick dashboard.

candlestick-chart dashboard data-visualization financial-data jupyter-notebook pandas plotly python stock-market time-series

Last synced: 17 May 2026

https://github.com/pkjjoshi/restaurants-analysis

Performed beginner-level EDA on a restaurant dataset using Python. Analyzed top cuisines, city-wise ratings, price ranges, and online delivery impact using Pandas and Matplotlib. Includes 4 well-structured notebooks with visual insights.

beginner-project data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas python restaurant-data seaborn

Last synced: 21 Jun 2025

https://github.com/hirudikaanupama/email-spam-detection-logistic-regression

This model can predict whether an email is spam or not. The logistic regression machine learning algorithm is used to train this model.

accuracy-score classification classification-report confusionmatrix data-visualization logistic-regression machine-learning roc-curve

Last synced: 11 Sep 2025

https://github.com/onlinebunker/iris-flower

Exploratory Data Analysis of Iris Flower Classification Data

data-visualization eda pandas

Last synced: 28 Apr 2026

https://github.com/atharvkadammm/suicide-prediction-system

A machine learning project predicting suicide risk based on multiple socio-economic and environmental factors using data mining techniques.

csv data-analysis data-science data-visualization datamining exploratory-data-analysis feature-engineering machine-learnin matplotlib mental-health numpy pandas riskassesment seaborn sklearn suicide-prediction supervised-

Last synced: 01 Jul 2025

https://github.com/atharvkadammm/calmlytic

An end-to-end machine learning project that predicts anxiety severity using classification models (Naive Bayes, Decision Tree, SVM, Logistic Regression, XGBoost), based on lifestyle, health, and behavioral features.

anxiety-prediction classification csv data-analysis data-preprocessing-and-cleaning data-science data-visualization ensemble-learning logistic-regression machine-learning-algorithms matplotlib mental-health numpy pandas python sci-kit-learn seaborn supervised-learning svm xgboost

Last synced: 21 Jun 2025

https://github.com/rezowanrahat/netflix_analysis

Data analysis of Netflix content using Python, Pandas, and Seaborn

data-analysis data-visualization netflix pandas python

Last synced: 07 May 2026

https://github.com/jessicaevelin/datascience

RepositĂłrio com atividades, exercĂ­cios e projetos realizados durante meus estudos em CiĂŞncia de Dados, baseados em cursos, livros, vĂ­deos e conteĂşdos da internet.

data-science data-visualization exercises jupyter machine-learning pandas projects python study

Last synced: 21 Jun 2025

https://github.com/hariprasath-v/machinehack-music_genre_classification_weekend_hackathon_edition_2

predict the genre of the songs from tunable audio track features like energy, tempo, key, mode, and valence, and others.

data-visualization exploratory-data-analysis machine-learning

Last synced: 17 Apr 2026

https://github.com/benmar2406/rent-in-germany

Interactive visualizations and maps depicting topics around rent prices and income in Germany built with Svelte.

charts d3 d3-visualization d3js data-analysis data-visualization gis gis-data infographic infographics map mapbox mapbox-gl mapbox-gl-js mapboxgl svelte

Last synced: 26 Mar 2025

https://github.com/vishal-038/real_estate_price_prediction

The Real Estate Price Prediction project aims to develop a machine learning model to predict house prices based on various features

data-analysis data-science data-visualization machine-learning python

Last synced: 21 May 2026

https://github.com/rmrt1n/chess_analysis_project

Webscraping and analysing games of Hikaru Nakamura

chess data-analytics data-visualization eda rvest tidyverse web-scraping

Last synced: 15 Jan 2026

https://github.com/alpkanoz/ibm_data_science_professional_certificate

The repository contains projects and training materials carried out throughout the IBM data science professional course.

classification clustering data-analysis data-science data-visualization dataframe ibm ibm-watson machine-learning mathplotlib pandas predictive-modeling python scikit-learn

Last synced: 07 Mar 2026

https://github.com/hirudikaanupama/student-score-prediction-linear-regression

Here the prediction and analysis of student scores using selected features is done entirely by linear regression machine learning algorithm. This project covers all methods of linear regression theory.

cross-validation data-cleaning data-visualization hyperparameter-tuning jupiter-notebook lasso-regression linear-regression machine-learning-algorithms multiple-linear-regression prediction-model python regularization ridge-regression student-score-prediction

Last synced: 26 Apr 2026

https://github.com/rvalla/covid-19-caba

Some code to analyze open data from Buenos Aires city related to COVID-19 pandemic.

covid-19 data-visualization python3

Last synced: 30 Oct 2025

https://github.com/jbalooshie/stock-analysis

A VBA script that performs basic stock analysis. Created while participating in a Data Analytics Bootcamp.

data-science data-visualization excel microsoft vba vba-excel vba-macros vba-script

Last synced: 20 Jan 2026

https://github.com/kate8382/frontend-module

Frontend module for a web application with user authentication, real-time dashboard, and data management

authentication dashboards data-visualization frontend

Last synced: 21 Jun 2025

https://github.com/saisurajmatta/cryptocurrency-market-analyzer-python-project

Cryptocurrency Market Analyzer: Python script utilizing CoinMarketCap API to fetch, analyze, and visualize real-time trends of top 15 cryptocurrencies over different time intervals.

data-analytics data-visualization matplotlib pandas python seaborn

Last synced: 05 May 2026

https://github.com/bayunova28/healthcare_analytics

This repository contains about data analytics project from healthcare industry

data-analytics data-engineering data-visualization healthcare pyspark sql

Last synced: 21 Jun 2025

https://github.com/dmytrori/himalayan_expeditions

Himalayan expedition stats, 1905–2020

alpinism data-analysis data-visualization pandas-python

Last synced: 21 Jun 2025

https://github.com/sadratehranian/pem-fuel-cell

The methodology section details the use of Python for data processing and analysis, employing statistical and machine learning-based anomaly detection techniques to identify potential issues in fuel cell stacks. It emphasizes data preprocessing, feature engineering, exploratory data analysis (EDA), and anomaly detection.

anomaly-detection data-analysis data-science data-visualization exploratory-data-analysis feature-engineering fuel-cell machine-learning preprocessing python statistical-analysis visual-studio-code

Last synced: 26 Mar 2025

https://github.com/hfagerlund/machine-learning-iris-analysis

No longer maintained. Moved to https://github.com/hfagerlund/machine-learning-classifier-iris/.

data-visualization jupyter-notebook machine-learning python37

Last synced: 22 Jul 2025

https://github.com/cano1998/data-visualization-project

A project focused on data visualization to explore various aspects of a car dataset. The visualizations provide insights into car performance, efficiency, and characteristics based on different manufacturers and features.

bar-pl bar-plot data-analysis data-visualization histogram jupyter-notebook line-plot

Last synced: 17 Jul 2025

https://github.com/pjaiswalusf/heart-failure-prediction

A machine learning project predicting heart failure risk using Random Forest and XGBoost. It involves data cleaning, feature engineering, and EDA before training. The best model is saved using Joblib. Key techniques: outlier detection, feature scaling, and optimization.

data-processing data-visualization feature-engineering machine-learning model-training optimization random-forest-classifier saving-model xgboost

Last synced: 07 Mar 2026

https://github.com/theshashanksinha/deloitte-au

Analyzed telemetry and salary equality data using Tableau and Excel to identify machine downtime patterns and assess gender pay equity, translating raw data into actionable business insights.

data-analytics data-visualization microsoft-excel tableau

Last synced: 06 Mar 2026

https://github.com/rajesh9943/decoding-sales-patterns-strategic-insights-from-data

To identify the key drivers of sales and uncover patterns for strategic decision-making. This involves analyzing purchasing behavior by area, town, and commodity type, while also tracking customer choices over time. Descriptive statistics and time series analysis were used to reveal key sales trends across a four-year period.

data-analytics data-processing data-visualization data-wrangling reporting sales-analysis sales-growth

Last synced: 11 Jul 2025

https://github.com/aglowraph/gromacs-xvg-plot-script

A Python script for automating the plotting of .xvg files from GROMACS simulations, with dynamic labeling, time unit detection, and colorful visualization. This script reads, plots, and saves each .xvg file in the same directory, making data analysis more efficient.

automation computational-chemistry data-visualization gromacs matplotlib molecular-dynamics numpy python scientific-computing xvg-plotting

Last synced: 18 May 2026

https://github.com/samaalharbi2/virtual-work-experience---data-analysis-at-stc

Virtual Work Experience in Data Analysis at STC

analysis data data-visualization misk stc

Last synced: 20 Jun 2025

https://github.com/as16082023/manufacturing-downtime-analysis

In the Maven Analytics data challenge, analyzed manufacturing downtime for a soda production company using Excel, identifying key issues and root causes of delays. Insights were shared through tables, charts, and a concise report with actionable recommendations.

advanced-excel data-visualization excel

Last synced: 20 Jan 2026

https://github.com/sahilmaurya28/youtube-data-analysis

YouTube Data Analysis using Python — uncovering trends, engagement patterns, and correlations between likes, comments, views, and categories to understand what drives content success.

analysis data-analysis data-visualization matplotlib-pyplot numpy pandas portfolio-project python seaborn youtube

Last synced: 13 Apr 2026

https://github.com/rafinha0rafinha/web-analyzer-backend

(Legacy) This is the backend for Mazaoro SARLU's lead magnet "Web Analyzer". This project analyzes websites using Google Lighthouse and returns a detailed report consumed by the frontend.

azure-app-service azure-devops chartjs cicd data-analysis data-science data-visualization express flask hacktoberfest lighthouse numpy sentiment-analysis vader-sentiment-analyzer

Last synced: 10 Apr 2026

https://github.com/kgelli/news-sentiment-analysis-pipeline-with-microsoft-fabric

End-to-end news sentiment analysis pipeline built with Microsoft Fabric, analyzing Bing News API data with sentiment analysis, visualization in Power BI, and real-time alerts via Teams

azure bing-api data-activator data-engineering data-pipeline data-visualization fabric microsoft-fabric one-lake-synapse power-bi sentiment-analysis

Last synced: 10 May 2026

https://github.com/theshefer/covid-map

Interactive map showing covid data implemented on R language

big-data data-visualization r r-studio

Last synced: 20 Jun 2025

https://github.com/saravanansuriya/streamlit

Streamlit Tutorial for machine learning and data science.

data-visualization python-script streamlit-webapp

Last synced: 18 May 2026

https://github.com/sharinas/mapped_travel_locations

A web-based Python mapping project of specific places around the world, with interactive pop-ups and color coded markers. Project uses folium, pandas, python, and a .csv file to store data.

csv data-visualization folium mapping pandas pipenv python

Last synced: 18 May 2026

https://github.com/ianjure/simple-corr

A simple data correlation visualizer built in Streamlit.

data-visualization streamlit

Last synced: 18 May 2026

https://github.com/ahmetzamanis/clusteringcountry

Non-hierarchical k-medoids clustering on a dataset of country statistics.

clustering data-science data-visualization k-medoids machine-learning r rmarkdown unsupervised-learning

Last synced: 16 Dec 2025

https://github.com/yash22222/olympic-games-analytics-using-apache-spark

The "Olympic Games Analytics Using Apache Spark Databricks" project explores data from the Olympic Games (1896-2016) to identify trends and insights. Using Apache Spark for big data processing and Databricks for visualization, the project analyzes key factors like top-performing countries and athlete attributes, showcasing real-world analytics.

apache apache-kafka apache-spark big-data-analytics csv data data-analytics data-visualization databricks excel mysql olympics regions

Last synced: 03 May 2026

https://github.com/leonardoberlatto/1000-startups-analytics

Data analytics on startups data using Tableau

analytics data-science data-visualization tableau

Last synced: 11 Jan 2026

https://github.com/mae776569/weratedogs-wrangling

Wrangling WeRateDogs Twitter data to create interesting and trustworthy analyses and visualizations

data-analysis data-science data-visualization tweets twitter-api

Last synced: 25 Jan 2026

https://github.com/nishumehta/british-airways-reviews-analysis

This project analyzes British Airways reviews using Tableau to create an interactive dashboard. The dashboard visualizes average ratings across multiple metrics and trends over time.

dashboard data-analysis data-visualization tableau tableau-public

Last synced: 12 Jan 2026

https://github.com/madrury/hot-sauce

Simuation of a Hot Sauce Spicyness Dataset

data-analysis data-science data-visualization dataset machine-learning

Last synced: 16 May 2026

https://github.com/whisplnspace/insightgenie

InsightGenie is an AI-powered data analyst that lets you upload files, ask questions, and get insights with visualizations

data-analysis data-science data-visualization deployment gemini-api huggingface nlp

Last synced: 19 Jun 2025

https://github.com/willmeyers/usgs-groundwater-trends

Visualized USGS groundwater level trends

data-visualization

Last synced: 30 Oct 2025

https://github.com/rafay99-epic/metricmate

Metric Mate is a modern, Python-based GUI tool for visualizing and analyzing gaming performance metrics with a sleek Tokyo Night theme.

data-visualization python python-gui-tkinter python-script

Last synced: 11 May 2025

https://github.com/benzerinsio/onlineretail-tableau

📊 Um dashboard interativo básico criado no Tableau para explorar vendas de uma loja online, com visualizações de receita por região e tendências temporais.

data-visualization eda sales-analysis tableau visualizacao-de-dados

Last synced: 09 Feb 2026

https://github.com/luka-j/csw5-eda

Materials for CS Week 5 lecture on exploratory data analysis

data-visualization r shiny tidyverse

Last synced: 26 Apr 2026

https://github.com/lucasfloresc/final_project

This is the final project of the Ironhack Bootcamp. In this project I applied all methods and tecniques learned in the Bootcamp, such as Web Scrapping and API extraction, Data cleaning and processing with Python, Python logic, the implementation of machine learning and Data Visualization. All displayed in Streamlit for more user friendly interface

data-analysis data-visualization machine-learning python streamlit webscraping

Last synced: 08 May 2026

https://github.com/arction/lcjs-example-0009-severalaxisxy

A demo application showcasing using multiple axes in LightningChart JS.

axis chart data-visualization lcjs lightningchart-js

Last synced: 12 Mar 2025

https://github.com/mkk-1817/cvip-ds-exploratory_data_analysis-terrorism

This repository deals with exploring global terrorism trends analyzing the Global Terrorism Database to uncover temporal patterns, identify top terrorist groups, examine attack types, and gain insights into geographical and success/failure dynamics.

coderscave data-analysis data-science data-visualization eda exploratory-data-analysis python terrorism-analysis

Last synced: 19 Jun 2025

https://github.com/guomaimang/magic-vaccine

A research of spread of COVID-19 with and without vaccine, also Group Project of COMP1433(Introduction of data analysis).

data-science data-visualization r-language

Last synced: 11 Jan 2026

https://github.com/kaczmarj/car-safety-shiny

An R Shiny app -- final project for BMI 530

cars data-visualization nhtsa shiny visualization

Last synced: 02 Feb 2026

https://github.com/guoweish/amo.gl

toolkit for native webgl

data-visualization glsl webgl

Last synced: 16 Feb 2026

https://github.com/mfakhriazhar/ecom-qtt-prediction

In e-commerce, understanding seasonal sales trends and best-selling products is critical to business strategy. However, companies often struggle with predicting sales, determining factors that influence sales (discounts, product categories, locations), and optimizing stock and marketing.

data-analysis data-science data-visualization e-commerce-project eda machine-learning python

Last synced: 19 May 2026

https://github.com/tanyakuznetsova/music_mental_health

Harnessing music's power for better mental health: genre recommendations and data-driven analysis of listeners' trends

data-visualization decision-tree decision-tree-classifier exploratory-data-analysis k-means-clustering pca-analysis recommendation-system recommender-system surprise-python

Last synced: 11 Jul 2025

https://github.com/yaser-123/movie_recommendation

The Movie Recommendation App provides users with personalized movie suggestions, trailers, and essential details, all through an intuitive and interactive interface.The **Movie Recommendation App** is a Streamlit-based application that suggests movies based on user preferences. The app uses data from the TMDB dataset and APIs like YouTube and OMDb

data-visualization imdb jupiter-notebook kaggle omdb-api python streamlit tmdb-api youtube-api

Last synced: 06 May 2026

https://github.com/yash22222/web-scraping-for-data-analysis-predictive-model-on-customer-data

Utilized web scraping for customer feedback at Air India, conducting robust data analysis, and applying machine learning for predictive modeling. Drove data-driven decisions, enhancing services, and elevating customer satisfaction. Expertise in web scraping, analysis, and predictive modeling for actionable insights.

data-analysis data-preprocessing data-science data-visualization exploratory-data-analysis machine-learning powerbi random-forest-classifier sentiment-analysis tableau web-scraping

Last synced: 30 May 2026

https://github.com/travisbreaks/sovereign-matrix

Ruthless project prioritization system. Multi-dimensional weighted scoring with real-time visualization. Kill what doesn't matter. React 19 + Tailwind + Framer Motion.

dashboard data-visualization decision-framework framer-motion prioritization react tailwindcss typescript

Last synced: 01 Mar 2026

https://github.com/sabdikay/telco-customer-churn-analysis-ibm-dataset

This project explores customer churn trends for a company in California using an IBM dataset. Built in a Jupyter Notebook, it employs pandas, NumPy, matplotlib, seaborn, plotly, and scipy to clean, analyze, and visualize data. Through statistical tests and interactive maps, it uncovers key drivers behind customer cancellations

business-intelligence customer-churn data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-noteboook matplotlib numpy pandas plotly predictive-modeling python scipy seaborn statistical-analysis

Last synced: 07 Apr 2026