An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/18mahi/tweet-sentiment-analysis

Classify tweets into happy, sad, angry, excited, and neutral with this interactive Python model. Combining TF-IDF text features with engineered numeric features like emoji sentiment, polarity, subjectivity, and punctuation counts, it demonstrates intermediate-level NLP, feature engineering, and visualization skills.

data-science data-visualization jupyter-notebook machine-learning matplotlib nlp numpy pandas sklearn string textblob

Last synced: 30 Apr 2026

https://github.com/iankitnegi/tableautales

"Discover my Tableau journey! Dive into data-driven stories, visualizations, and projects as I explore the power of data visualization."

data data-visualization tableau

Last synced: 21 Jan 2026

https://github.com/gabboraron/biostatisztika_es_alkalmazasai

"A statisztika a matematika azon ága, melynek feladata, hogy eszközt adjon a politikusok kezébe, mellyel tetszőleges állítás és annak ellentéte is tudományos alapon igazolható"

biostatistics data-analysis data-visualization r statistics statistics-course

Last synced: 24 Oct 2025

https://github.com/ljadhav25/world-population-analysis-1990-2023-

This repository contains data and analysis related to the world population from 1990 to 2023. The objective is to explore population trends, identify patterns, and visualize demographic changes across different countries and continents over the past few decades.

data-analysis-python data-visualization matplotlib numpy-library pandas-library seaborn

Last synced: 08 Oct 2025

https://github.com/sayamalt/mental-health-classification-using-fine-tuned-distilbert

Successfully established a multiclass text classification model by fine-tuning pretrained DistilBERT transformer model to classify several distinct types of mental health statuses such as anxiety, stress, personality disorder, etc. with an accuracy of 77%.

data-visualization deep-learning distilbert-fine-tuning distilbert-model model-evaluation model-inference model-training-and-evaluation multiclass-text-classification natural-language-processing text-classification text-preprocessing text-tokenization

Last synced: 08 Oct 2025

https://github.com/aymanmomin/excel-coffee-data-analytics-exploring-coffee-orders-dataset

This project utilizes a coffee orders dataset to perform comprehensive data analytics and gain insights into customer preferences, popular items, and sales trends. The analysis aims to provide valuable information for coffee shop owners and enthusiasts, facilitating data-driven decision-making and improved customer satisfaction.

data-analysis data-visualization excel project

Last synced: 18 Jan 2026

https://github.com/lgibson7/stat_651

Coursework for STAT 651 Data Visualization, Cal State East Bay Fall 2022

data-visualization leaflet shiny-apps tableau

Last synced: 05 Feb 2026

https://github.com/tyriek-cloud/power-bi-nyc-housing-financial-report

This report was conducted to provide a comprehensive analysis of various NYC housing and financial data.

dashboard data-analysis data-visualization financial-analysis powerbi statistics

Last synced: 21 Jan 2026

https://github.com/abinjohn8138-commits/churn-analysis

This project focuses on analyzing customer churn behavior within a telecommunication company using visual insights. The goal is to understand what factors lead to customer attrition and help the business take proactive steps to retain customers.

colab-notebook data-visualization excel insights jupyter-notebook pandas python

Last synced: 05 May 2026

https://github.com/gauravsy704/sct_ds_2

Performed data cleaning and exploratory data analysis (EDA) on the Titanic dataset from Kaggle. Investigated the relationships between variables and identified key patterns and trends in the data using Python, with a focus on survival rates, passenger demographics, and embarkation details.

data-science data-visualization jupyter-notebook pandas python seaborn

Last synced: 06 May 2026

https://github.com/hiteshsahu/visual-studio-hybrid-application

Visual Studio and Java Script full duplex communication

data-visualization html5 javascript kendo-ui visual-basic

Last synced: 09 Oct 2025

https://github.com/armahdavi/analytics_statistics_ml_plotting_dust_extraction_hvac_filters_ph2

PhD Technical Paper 1 - Phase 2 - Mahdavi & Siegel (2020) (Aerosol Science & Technology; AS&T) - Sharing all the data pipelines, processing codes, descriptive statistics, statistical modellings, and plotting/visualizations - Project Miestone: 2017 - 2020 - Full-length article is available

data-pipelines data-science data-visualization machine-learning matplotlib-pyplot numpy pandas-dataframe python scipy-stats sklearn statistics

Last synced: 14 Apr 2026

https://github.com/lixx21/tableau_netflix_movies_tvshows_2021

Visualize netflix movies and tv shows in 2021

data-visualization dataset netflix tableau

Last synced: 19 Jan 2026

https://github.com/mgckaled/rs_data-analytics

Repositório agregador do conteúdo da formação Data Analytics desenvolvido pelo Rocketseat.

data-analytics data-visualization python sql statistics

Last synced: 09 Oct 2025

https://github.com/damisparks/machine-learning

Machine learning is a method of data analysis that automates analytical model building. It is a branch of artificial intelligence based on the idea that systems can learn from data, identify patterns and make decisions with minimal human intervention

data-science data-visualization deeplearning machine-learning machine-learning-algorithms machinelearning-python

Last synced: 09 Oct 2025

https://github.com/juanes0023/dashboard-mtp

🚗 Track user activity and revenue in real-time with the Mileage Tracker Pro Dashboard for clear insights and growth trends.

analytics business-intelligence dashboard data-visualization plotly python real-time-analytics saas streamlit supabase

Last synced: 20 Apr 2026

https://github.com/sillyash/untappd-viz

A data visualisation page using public datasets and HTML/CSS/JS with D3.js.

beer beer-statistics data data-analysis data-visualization kaggle kaggle-dataset public-dataset school-project

Last synced: 18 May 2026

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 19 Jan 2026

https://github.com/hirkojoba/fintrack

Full-stack financial tracking app with ML forecasting and AI insights. Built with Rails, PostgreSQL, Python/scikit-learn, and OpenAI API.

artificial-intelligence data-visualization fintech full-stack machine-learning openai postgresql python ruby-on-rails scikit-learn

Last synced: 14 Apr 2026

https://github.com/loaiwalid07/automation_data_overviwe

This is Streamlit app that gives an overview for a dataset you upload

automation data data-analysis data-exploration data-science data-transformation data-visualization

Last synced: 19 May 2026

https://github.com/ratna-babu/generating-graphs

Generate, color, and visualize random graphs using Python's NetworkX and Matplotlib. Includes compression and storage of graph data with .gz and pickle. Ideal for exploring graph coloring and greedy algorithms in graph theory.

data-visualization erdos-renyi graph-coloring graph-theory greedy-algorithm matplotlib networkx python random-graph random-graph-generation

Last synced: 10 Oct 2025

https://github.com/salma-mamdoh/the-android-app-market-on-google-play-project

My project aims to practice Data Analysis and Data Visualization on DataCamp

data-analysis data-visualization datacamp jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 19 Jan 2026

https://github.com/controldata23/automobiles-data-exploration

An Exploratory Data Analysis done on an Automobiles dataset from kaggle

data-exploration data-visualization eda jupyter-notebook matplotlib python-data-analysis

Last synced: 19 Jan 2026

https://github.com/badranalyst/data-professional-survey-breakdown-power-bi-dashboard

This project presents an interactive Power BI dashboard analyzing data professionals' insights. Key focus areas include job satisfaction, challenges in entering the data field, career priorities, demographics, and more. The visualization helps uncover trends and factors impacting data professionals globally.

charts dashboard dashboards data data-cleaning data-visualization dataset dax power-bi powerbi

Last synced: 23 Feb 2026

https://github.com/luthfirrahmanb/tips-visualization

visualization tips data set with dash plotly

dash data-science data-visualization plotly python3

Last synced: 11 Oct 2025

https://github.com/alexmcvay/uber-data

UBER sql clone

data data-visualization sql

Last synced: 19 Jan 2026

https://github.com/saifalibaig/covid-19-death-rate-analysis-using-python

Analysis of Covid-19 data along with the world happiness report to identify if there is any relationship between death rate and happiness rate of countries all over the world.

data-analysis data-visualization numpy pandas python3 sns visualization

Last synced: 03 May 2026

https://github.com/azaz9026/email-spam-detection

Welcome to the Email Spam Detection project! This repository provides a machine learning model for detecting spam emails using a Naive Bayes classifier and a simple web interface built with Streamlit.

data-analysis data-cleaning data-structures data-visualization deep-learning machine-learning python sql streamlit

Last synced: 14 Apr 2026

https://github.com/benst099/circlesplot

Visualize proportions with circles in a plot

cran cran-r data-science data-visualization proportions r visualization

Last synced: 11 Oct 2025

https://github.com/mouradhamzaoui/End-To-End-MLOPS-Airline-Project

This project aims to predict the number of passengers, freight quantity, and mail quantity for American airlines operating between Canadian and U.S. airports using an MLOps approach. It involves automating the data pipeline, from data extraction and preparation to model training and evaluation, leveraging tools like DVC, MLflow, and Docker for vers

data-visualization docker dvc github-actions machine machine-learning-algorithms mlflow

Last synced: 14 Apr 2026

https://github.com/mr-chang95/udacity-starbucks-challenge

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

data data-science data-visualization numpy pandas sklearn

Last synced: 14 Apr 2026

https://github.com/ahsankhizar5/titanic-eda-visualization

Exploratory Data Analysis and Visualization on the Titanic Dataset using Python, Pandas, Matplotlib, and Seaborn to uncover survival patterns.

data-analysis data-science data-visualization eda kaggle machine-learning matplotlib pandas python seaborn titanic-dataset

Last synced: 31 May 2026

https://github.com/archanakokate/kkbox_music_recommendations

Predicting the chances of a user listening to a song repetitively after the first observable listening event.

data-visualization exploratory-data-analysis machine-learning statistical-analysis

Last synced: 11 Oct 2025

https://github.com/abeltavares/postql

Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.

cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper

Last synced: 19 Jan 2026

https://github.com/tzerk/esr

R package 'ESR' for plotting and analysing ESR spectra in dating applications

data-analysis data-visualization electron-spin-resonance geochronology r

Last synced: 13 Mar 2026

https://github.com/alexondata/daan_eda-exploratory-data-analysis_ecommerce

This project presents an Exploratory Data Analysis (EDA) pipeline for an eCommerce dataset, integrating Python, SQL Server, and Power BI to transform raw transactional data into meaningful business insights. The project was developed as part of an academic assignment at Transilvania University of Brașov, Faculty of Mathematics and Computer Science.

data-analysis data-visualization ecommerce microsoft-sql-server powerbi python

Last synced: 18 May 2026

https://github.com/agarwalrachit399/fifa-19-analysis

Analysis of one of the most popular games ,FIFA-19 on tableau and a detailed report for the same

dashboard data-visualization fifa19 tableau

Last synced: 19 Jan 2026

https://github.com/luzmo-official/temperature-increase

A web app displaying Global temperature rises since 1961 based on the dataset made public by FAOSTAT

climate dashboard data-visualization temperature

Last synced: 19 Jan 2026

https://github.com/moustafamohamed01/san-francisco-salaries-data-analysis

Salary Data Analysis 💰📊 An analysis of salary data to uncover trends, distributions, and anomalies in employee compensation. This project involves data cleaning, visualization, and statistical analysis to explore salary trends, job roles, and correlations. Insights are presented using Python and Jupyter Notebook. 🚀

data-cleaning data-visualization jupyter-notebook matplotlib pandas python seaborn

Last synced: 08 May 2026

https://github.com/louisfernando1204/websocket-benchmark

A comprehensive performance testing and analysis suite designed to evaluate and compare different WebSocket server implementations across various programming languages and libraries.

benchmarking broadcast-test coder-websocket csv data-analysis data-visualization echo-test golang gorilla-websocket nodejs python3 socket-io websocket-client websocket-server ws

Last synced: 09 Apr 2026

https://github.com/achronus/data-graphing-tool

A tool for finding the perfect graph that fits your CSV data.

data-visualization matplotlib numpy pandas python3

Last synced: 13 Oct 2025

https://github.com/arunabhagit/data-driven-marketing-optimization-enhancing-engagement-conversions-and-customer-satisfaction

Analyzed ShopEasy’s marketing data using SQL, Python, and Power BI to identify low engagement and conversion issues, performed sentiment analysis, and delivered data-driven strategies for measurable performance improvement.

business-intelligence data-visualization dataanalytics marketingstrategy powerbi python sql

Last synced: 13 Oct 2025

https://github.com/meettomb/focusetrack

FocusTrack is a simple desktop app built with WPF and C# that helps you see how you spend time on your computer. It tracks the apps you use, records how long you use them, and shows the data in easy-to-read charts and lists. With options to filter by today, last week, or last month, FocusTrack helps you understand your digital habits.

app-usage-tracker csharp data-visualization desktop-app livecharts monitoring productivity time-management wpfh

Last synced: 13 Oct 2025

https://github.com/petitatelier/data-generators

A collection of data generators, to play with in visualization experiments

data-generator data-visualization

Last synced: 13 Oct 2025

https://github.com/ashioyajotham/data_centers_are_eating_the_world

This idea is mostly how supercomputers (AI data centers) are coming up fast so it is an attempt to map them like Semi-Analysis does

data-center data-visualization mapping neon-postgres neondb postgis-database render-deployment supercomputers typescript vercel-deployment

Last synced: 01 Jul 2026

https://github.com/anushkundu/london-housing-market-analysis

London Housing Market Analysis: An Insightful Power BI Dashboard"

data-analysis data-visualization powerbi transformation

Last synced: 27 Jan 2026

https://github.com/mahambilalandahaan/week8

K-Means Deep Dive: Clustering analysis with Elbow and Silhouette methods in Python

clustering data-visualization jupyter-notebook k-means machine-learning python scikit-learn unsupervised-learning

Last synced: 20 Apr 2026

https://github.com/saisurajmatta/healthcare-data-analytics

Power BI project analyzing Emergency Department data, demonstrating skills in data transformation, DAX, and visualization. It focuses on patient flow, wait times, demographics, and satisfaction, providing actionable insights for healthcare improvement. Includes documentation, data dictionary, and code samples.

data-analysis data-modeling data-visualization dax power-bi powerbi-visuals powerquery

Last synced: 22 Jan 2026

https://github.com/saisurajmatta/e-commerce-sales-advanced-data-analysis

Excel-based e-commerce analytics for FNP, a gift company. It covers data extraction, modeling, and visualization, providing actionable insights on revenue, customer behavior, and operations. Key skills include Excel, Power Query, Power Pivot, and DAX. The analysis culminates in data-driven business recommendations.

data-analysis data-visualization dax excel power-pivot power-query

Last synced: 22 Jan 2026

https://github.com/pngo1997/chicago-airbnb-cta

Interactive Chicago CTA train stations geospatial map.

data-visualization geospatial html python visualization

Last synced: 15 Oct 2025

https://github.com/iamrajhans/star-cred

Score a GitHub repo's stargazers 0-100 by how many are real, active developers vs. new, inactive, or bot-like accounts. 100% client-side dashboard — runs in your browser, no backend.

bot-detection credibility dashboard data-visualization fake-stars github github-api github-graphql github-pages react stargazers tailwindcss typescript vite

Last synced: 21 Jun 2026

https://github.com/harshindcoder/sleeping_time_survey_analysis

This survey analysis aims to identify lifestyle factors—such as screen time, exercise, alcohol consumption, beverages, and sleep direction—that affect sleep quality and duration. By analyzing these factors, we seek to predict sleep patterns and provide actionable insights to improve sleep health and overall well-being.

data-cleaning data-collection data-visualization exploratory-data-analysis regression-models survey-analysis

Last synced: 15 Oct 2025

https://github.com/madhursinghbhadoriya/cutomer_data_analysis1

Customer_Data Analysis - Tableau

chart data-visualization tableau

Last synced: 22 Jan 2026

https://github.com/fatihilhan42/nba-players-data-1950-to-2021

In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.

data data-analysis data-engineering data-science data-visualization

Last synced: 16 Oct 2025

https://github.com/gia-lexa/real-time-data-visualizer

An opportunity to tinker with data viz, this Python app tracks live stock prices using the Yahoo Finance API and surfaces the price changes in real-time using live-updating data visualizations.

data-visualization python

Last synced: 17 Oct 2025

https://github.com/gaurav0502/survey-analysis-visualizations

This repository consists of the various data visualization techniques used to analyze the responses of a survey conducted as a part of the project for the course MGT1002-Principles of Management

analysis data-visualization matplotlib seaborn visualization

Last synced: 17 Oct 2025

https://github.com/analyst-amitbisht/pizza-sales-report-

Its a guided project to practice tools like SSMS + Power BI & also skills like data cleaning, data exploration, data analysis, data visualization, etc.

analytics data data-visualization powerbi sql-server

Last synced: 18 Oct 2025

https://github.com/andreighinea1/bdv-project

Exploratory analysis of the Global Terrorism Database (GTD) using Python and Jupyter. This project visualizes global terrorist activity trends from 1970 to 2017, highlighting patterns across time, regions, and groups using ML techniques.

data-visualization exploratory-data-analysis jupyter-notebook kaggle-dataset terrorism-analysis

Last synced: 19 Oct 2025

https://github.com/yulia-momotyuk/dla-data-analysis-practice

This repository contains my homework assignments completed during the "Data Analyst in IT" course at Data Loves Academy.

analytics data-analysis data-visualization excel mysql numpy pandas postgres powerbi python seaborn sql tableau

Last synced: 14 Apr 2026

https://github.com/srosalino/data_wrangling_investigations

Series of 3 investigation works, regarding the subject of Data Wrangling (Acquire data from different sources; Understand how to clean and pre-process data; Transform data for analytics purposes; Perform feature engineering; Visualize data)

data-cleaning-and-preprocessing data-extraction-and-pre-processing data-visualization feature-engineering

Last synced: 19 Oct 2025

https://github.com/tejaswirupa/united-airlines-flight-gain-analysis

Analyzed 30K+ United Airlines flights to evaluate time gained or lost during flight. Used hypothesis testing to compare on-time vs. late departures, identifying routes with the highest average time gain.

data-science data-visualization hypothesis-testing r statistical-analysis

Last synced: 27 Jan 2026

https://github.com/estebanrucan/personal-book

Algunos materiales de apoyo que he ido realizando en el tiempo

data-science data-visualization dplyr ggplot2 here magrittr r vroom

Last synced: 20 Oct 2025

https://github.com/umutdinceryananer/ctis471-supervised-ml

This project, developed in Python as part of CTIS471, integrates data analysis, machine learning models, and visualization techniques.

data-visualization machine-learning supervised-learning

Last synced: 20 Oct 2025

https://github.com/emirhansilsupur/hotel-booking-analytics-dashboard

Interactive Power BI dashboard visualizing hotel booking metrics for two Portuguese properties (Algarve resort & Lisbon city).

dashboard data-visualization power-bi

Last synced: 27 Jan 2026

https://github.com/lasyakonduru/e-commerce-analysis-using-advanced-sql

This project analyzes e-commerce order fulfillment using Advanced SQL Techniques and Python-based visualization to uncover insights on sales trends, customer segmentation, shipping cost optimization, and payment preferences.

business-analytics common-table-expressions customer-segmentation data-visualization database-design indexing normalization-techniques partitioning window-functions-in-sql

Last synced: 24 Jan 2026

https://github.com/polymervis/polymer-vis

Global namespace with some utility functions for PolymerVis.

data-visualization polymer polymer-elements visualization webcomponents

Last synced: 17 May 2026

https://github.com/lingumd/ny_citibike

Bike trip analysis to convince investors that a bike-sharing program in Des Moines is a solid business proposal.

data-visualization jupyterlab tableau

Last synced: 23 Oct 2025

https://github.com/keerthanapalanikumar/prodigy-infotech

This repository contains data science projects from my Prodigy Infotech internship, including data visualization, cleaning and EDA on the Titanic dataset, a decision tree classifier for the Bank Marketing dataset, and Twitter sentiment analysis.

data-cleaning-and-eda data-visualization decision-tree-classifier sentiment-analysis

Last synced: 23 Oct 2025

https://github.com/vishal-verma-96/employee_attrition_analysis

The Attrition Analytics Dashboard uses Power BI to analyze and visualize employee attrition trends, providing actionable insights to company for making strategic decisions.

attrition-analysis business-intelligence data-storytelling data-visualization interactive-dashboard powerbi

Last synced: 15 Mar 2026