An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/camara94/data_analyse_series_temporelles

Dans ce tutoriel, nous allons répondre aux questions suivantes: 1. Lire les données Microsoft à l'aide du package **Pandas Data reader** 2. Obtenez le **prix maximum** de l'action de **2017 à 2022** 3. Quelle est la **date du cours le plus élevé** de l'action ? 4. Quelle est la **date du cours le plus bas** de l'action ?

data-analysis data-analysis-python data-science data-structures-and-algorithms data-visualization serie series-forecasting

Last synced: 09 Apr 2025

https://github.com/rmitsch/paella

Web application for visual parameter space analysis of topic models utilizing word embeddigs.

data-visualization latent-dirichlet-allocation natural-language-processing topic-modeling word2vec

Last synced: 07 Jan 2026

https://github.com/itskshitija/analyzing-the-nyc-airbnb-market

The aim of this project is to utilize Python to understand the factors that influence Airbnb prices in New York City, or identifying patterns of all variables. Our analysis provides useful information for travelers and hosts in the city and some of the best insights for the Airbnb business.

data-science data-visualization dataanalysis dataanalysisusingpython

Last synced: 22 Jul 2025

https://github.com/sravyatogarla/movie-recommendation-system

A complete Movie Recommendation System project implementing Popularity-Based, Content-Based, and Collaborative Filtering models using the MovieLens dataset. Built with Python, Pandas, and Plotly, featuring interactive inputs and visualizations.

capstone-project collaborative-filtering content-based-filtering data-science data-visualization edureka jupyter-notebook machine-learning movie-recomendation-system movielens pandas popularity-based-filtering python recommender-system scikit-learn sql

Last synced: 13 Apr 2026

https://github.com/carcesar/salariogovernadores2023

Visualização dos salários dos governadores em 2023

data-science data-visualization politics

Last synced: 24 Apr 2025

https://github.com/sanjana-bongale/cta_ridership_data_visualization_using_tableau

Tableau-based analysis of Chicago Transit Authority (CTA) ridership trends (2015-2024). It includes interactive dashboards, heatmaps, and comparative visualizations to explore bus and rail boarding data, COVID-19 impact, and long-term trends.

customer-analysis dashbaord data-visualization tableau

Last synced: 16 Feb 2026

https://github.com/gabrieladados/tableau_dashboards

Dashboards desenvolvidos no Tableau

dashboards data-visualization figma tableau

Last synced: 09 Apr 2025

https://github.com/sanjiban08/coffee-sales-dashboard

Explore your coffee sales like never before with our Interactive Excel Dashboard—unlock insights, track trends, and enhance decision-making for a robust and caffeinated business strategy. ☕📈

data-cleaning data-visualization excel pivot-tables

Last synced: 26 Jan 2026

https://github.com/fbarffmann/sqlalchemy-challenge

Built a Flask API with SQLAlchemy to analyze and visualize Hawaii climate data. Automated data extraction and developed database queries for temperature and precipitation insights.

api climate-data data-analysis data-visualization flask orm python sql sqlalchemy sqlite

Last synced: 13 Apr 2026

https://github.com/petarran/gun-violence-usa

Data Science project comparing USA gun violence cases to its causes.

data-science data-visualization r

Last synced: 05 Sep 2025

https://github.com/nurulashraf/polynomial-regression-manufacturing

A Python project implementing polynomial regression to analyse and predict manufacturing-related data. Features include data preprocessing, model training, and visualisation of results. Ideal for exploring machine learning applications in manufacturing process optimisation.

data-analysis data-visualization machine-learning manufacturing polynomial-regression predictive-modeling process-optimization python regression-models scikit-learn

Last synced: 16 Apr 2026

https://github.com/wilkerhop/linestream

A dynamic line visualization using HTML, JavaScript, and SVG. Each point has a vertical position based on its currentPosition, and all points are connected. New points can be added dynamically, updating the visual representation in real time. This project explores JavaScript, DOM manipulation, and SVG rendering.

data-visualization dynamic-graphics frontend html interactive-ui javascript proof-of-concept svg web-development

Last synced: 29 May 2026

https://github.com/zen204/renewable-energy-usage-v-electricity-access

Interactive data visualization project created for COSI 116A: Introduction to Information Visualization at Brandeis University (Fall 2024). The project showcases data-driven insights using advanced visualization techniques and user interactivity. Hosted on GitHub Pages.

d3js data-analysis data-visualization electricity github-pages html-css-javascript information-visualization interactive python renewable-energy tableau web-development

Last synced: 08 Feb 2026

https://github.com/naninsv/yulu-bike-rental-analysis

Assisting Yulu, India's micro-mobility provider, in understanding factors influencing shared electric cycle demand. Employing statistical tests and analysis on a dataset to identify significant predictors and gauge their impact on cycle demand.

data-visualization eda hypothesis-testing matplotlib-pyplot numpy-library pandas-library python scipy-stats seaborn-python statistical-analysis

Last synced: 13 May 2026

https://github.com/athul64/tmdb-dataset-analysis

This data set contains information about 10,000 movies extracted from TMDB. The dataset contains movies from 1960 to 2015. Including user ratings and revenue. Original data from Kaggle.

data-visualization dataframe eda numpy pandas python

Last synced: 14 Apr 2026

https://github.com/samruddhi3012/tata-data-visualization

Hi! This repo contains the dashboard I created using Tableau for TATA Data Visualization Training!

data-analysis data-visualization tableau tata

Last synced: 07 Jan 2026

https://github.com/shubham200137/expense-tracker-dashboard

The task is to create a Power BI dashboard from expense data (October–December) stored on SharePoint/OneDrive. It should include dropdowns for file and sheet selection with auto-refresh.

dashboard data-visualization powerautomate powerbi

Last synced: 04 Feb 2026

https://github.com/oenm176/hmeq-loan-analysis

Menggali wawasan dari dataset Home Equity (HMEQ). Proyek ini membangun model klasifikasi untuk mendeteksi kredit macet, yang menampilkan pra-pemrosesan data lengkap, normalisasi, dan visualisasi pohon menggunakan Python.

classification-model credit-risk-analysis data-mining data-science data-visualization decision-tree hmeq-dataset machine-learning python scikit-learn student-project

Last synced: 13 Apr 2026

https://github.com/danaelshrbiny10/gold-prices

The Egypt Gold Prices project is a data analysis and visualization initiative that focuses on tracking and understanding the daily gold prices in Egyptian pounds per gram.

data-visualization docker docker-compose matplotlib mongodb numpy pandas powerbi python3 webscraping

Last synced: 13 Apr 2026

https://github.com/bala-1409/tableau-visualization-viz.-project

This repository contains Visualization Projects which is visualized through Tableau Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and also it provides social values in some cases to calculate damages and intensity by calamities.

dashboard data-analysis data-science data-visualization exploratory-data-analysis tableau tableau-dashboards tableau-public visualization

Last synced: 04 Feb 2026

https://github.com/badranalyst/tips-dataset-analysis-dashboard-with-streamlit-and-plotly

Interactive Streamlit dashboard analyzing the Seaborn 'tips' dataset, which records information on restaurant bills, including total bill amounts, tips, customer demographics (e.g., gender, smoking status), and dining details (e.g., day, time). Visualized with Plotly for insights into tipping patterns.

data-analysis data-analytics data-visualization dataset eda exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas plotly python seaborn streamlit

Last synced: 13 Apr 2026

https://github.com/lijesh010/covid-19_global_analytics_power_bi_project

This repository is a data visualization project that offers an in-depth analysis of the Covid-19 pandemic using Microsoft Power BI. This interactive dashboard provides valuable insights into key metrics related to Covid-19 cases, deaths, recoveries, and more, helping users understand the global impact of the pandemic.

dashboard data-analysis data-visualization powerbi report

Last synced: 08 Jan 2026

https://github.com/farhashaad/farhashaad98

This is a repository to showcase my skills, share projects and track my progress in Data Science related projects.

data data-visualization dataanalysis matplotlib pandas python seaborn sql tableau

Last synced: 24 Apr 2026

https://github.com/musamairshad/matplotlib-learning

This repository contains material related to the Matplotlib Learning.

data-science data-visualization matplotlib plotting python

Last synced: 09 Oct 2025

https://github.com/mishaa931/amazon-sales-dashboard-power-bi

This project features a dynamic Power BI dashboard built on dummy Amazon sales data. It visualizes key business metrics such as revenue trends, top-selling categories, discount impact, and geographic performance. The dashboard is designed to help stakeholders make data-driven decisions through clear, interactive visuals.

data-analysis data-quality data-visualization microsoftpowerbi

Last synced: 05 Feb 2026

https://github.com/deliprofesor/breast-cancer-detection-using-svm-with-smote-and-model-optimization

This project analyzes health and lifestyle factors influencing heart attack risk using statistical methods and machine learning, with Ridge Regression identified as the best predictive model.

classification data data-preprocessing data-science data-visualization gridsearchcv machine-learning python roc-curve smote svm

Last synced: 10 Apr 2025

https://github.com/deliprofesor/virtual-reality-in-education-impact-analysis-and-insights

This project examines the impact of Virtual Reality (VR) on education, focusing on its effects on student engagement, learning outcomes, and creativity. It uses data analysis techniques like descriptive statistics, correlation analysis, and clustering to assess VR's effectiveness in enhancing learning.

clustering data data-analysis data-science data-visualization exploratory-data-analysis hypothesis-testing machine-learning python regression-analysis virtual-reality

Last synced: 14 Jun 2025

https://github.com/umutonder97/project-network-ids

Network-Based Intrusion Detection System - dev/deploy-ment of a Hybrid Intrusion Detection System (HIDS) that integrates Signature-based Network Intrusion Detection Systems (SNIDS)

artificial-neural-networks convolutional-neural-networks covid-19 covid-19-russia covid19-data data-visualization genetic-algorithm ids keras-tensorflow knn microservices-architecture network-behavioral-analysis python time-series-forecasting

Last synced: 05 Jul 2025

https://github.com/shoebjoarder/superstore

A Dash app to analyze Superstore dataset.

dashboard data-analysis data-visualization python-3

Last synced: 02 Apr 2025

https://github.com/shellynagar27/good-cabs-data-analysis-project

This project is part of CodeBasics Challenge #13, where the goal was to provide actionable insights to the Chief of Operations at Goodcabs, a cab service provider in tier-2 cities of India. The project focused on analyzing key metrics like trip volume, repeat passenger rate, and passenger satisfaction.

critical-thinking data-analysis data-visualization excel exploratory-data-analysis power-bi presentation problem-solving sql storytelling

Last synced: 25 Jan 2026

https://github.com/kivanc57/feature_comparison

This project explores the relationship between features and diagnosis in cancer data. Using methods like boxplots, scatterplots, PCA, k-means clustering, and logistic regression, we analyze and visualize data to understand health indicators.

boxplot clustering correlation data-science data-visualization descriptive-statistics explanatory-data-analysis pearson-correlation r scatter-plot spearman

Last synced: 06 Jun 2026

https://github.com/ndiplacide7/r-project

Explore diverse data analysis techniques using R programming combined with advanced machine learning algorithms to uncover insights and create powerful predictive models.

data-analysis data-visualization machine-learning-algorithms r

Last synced: 25 Mar 2025

https://github.com/treyhamilton/ds-project-1

A compilation of various programming concepts written in Python/R covering the topics listed below

covid19-data data-science data-visualization exploratory-data-analysis

Last synced: 06 Jul 2025

https://github.com/brazer27/iris-classification

A Python implementation of Naive Bayes algorithm for Iris flower classification. Features include cross-validation, data preprocessing, and prediction capabilities. Built from scratch without ML libraries, achieving ~95% accuracy on the classic Iris dataset.

cross-validation data-science data-visualization flower-classification iris-dataset machine-learning naive-bayes python

Last synced: 06 Sep 2025

https://github.com/danielpeter/wiggles

iOS App for visualizing ground motion of nearby seismic stations

data-visualization fdsn seismics

Last synced: 12 Mar 2026

https://github.com/vatshayan/pokemon-analysis

Visualization, Analysis & Predicting the accuracy of finding Pokemon power, attack & speed through Machine Learning

artificial-intelligence data data-analysis data-science data-visualization dataset machine-learning machine-learning-algorithms pokemon scikit-learn

Last synced: 30 May 2026

https://github.com/shuyib/mouse_gut_otu

Vectorization and Unsupervised Learning of Mouse Operation Taxonomic Units to determine which species of bacteria form distinct groups in a dataset.

16s-rrna anaconda analysis data-visualization dataset gut-microbiome matplotlib-figures mothur numpy-arrays pandas-dataframe pca-analysis python3 scikitlearn-machine-learning sops t-sne unsupervised-learning

Last synced: 13 Apr 2026

https://github.com/zulhaditya/netflix-analysis

Netflix data analysis using multiple python libraries.

data-visualization python

Last synced: 19 May 2026

https://github.com/balajimohan18/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

acf adf data-analysis data-cleaning data-science data-visualization eda exploratory-data-analysis machine-learning pacf seasonality time-series trends

Last synced: 30 May 2026

https://github.com/alainamariajoe/netflix-data-visualizer

A simple data visualization project using Netflix movies and TV shows dataset to create basic insights and visual representations.

data-visualization matplotlib pandas python seaborn

Last synced: 17 May 2026

https://github.com/suresh-chelani/crop-data-visualization

This project implements data visualization tasks using TypeScript, Vite, Apache ECharts, and Mantine v7. The goal is to process agricultural data, handle missing values, and render a table and a bar chart based on the dataset.

apache-echarts data-visualization mantine-v7 typescript vite

Last synced: 01 Mar 2025

https://github.com/omar7001-b/data-miner

DataMiner is an interactive web application for data mining and machine learning. It helps users upload, clean, transform, and analyze datasets while building predictive models — all through a simple and powerful Streamlit interface.

data-cleaning data-mining data-preprocessing data-science data-visualization interactive-dashboards pandas python scikit-learn streamlit

Last synced: 28 Apr 2025

https://github.com/diogocarrola/freecodecamp-projects

A collection of projects completed as part of the freeCodeCamp curriculum. This repository showcases my progress and skills in web development, including HTML, CSS, JavaScript and more.

apis data-visualization front-end javascript responsive-design

Last synced: 26 Mar 2025

https://github.com/thomas-basham/ps-creel

This web application fetches fishing report data from the Washington Department of Fish and Wildlife (WDFW) Creel Reports page and displays it on an interactive map.

creel creel-survey data-science data-visualization database fish fishing nextjs postgresql puget-sound-data pugetsound react sql website

Last synced: 13 Apr 2026

https://github.com/aykutsp/world-infrastructure-data-hub

Interactive world map of fuel prices, electricity, EV charging costs and CO2 emissions — daily-refreshed open data pipeline.

choropleth climate-data co2-emissions data-visualization electricity-prices energy-data ev-charging fuel-prices github-actions leaflet open-data react sustainability typescript vite world-map

Last synced: 05 Apr 2026

https://github.com/nurulashraf/telco-customer-churn-prediction-model

This repository contains a Telco Customer Churn Prediction project using machine learning. It includes data preprocessing, exploratory data analysis, feature engineering, and model development to predict customer churn. Key tools used are Python, Pandas, NumPy, Matplotlib, Seaborn, and scikit-learn.

churn-prediction classification-model customer-churn data-visualization exploratory-data-analysis machine-learning predictive-analytics python scikit-learn

Last synced: 16 Mar 2025

https://github.com/rohanag03/whatsapp-chat-analyzer

This project is a comprehensive tool for analyzing WhatsApp chat histories. Using Python and Streamlit, it visualizes chat data to provide insights into messaging patterns and media sharing habits.

chat-analysis data-visualization python streamlit

Last synced: 12 May 2026

https://github.com/satyam4229/prediction-of-different-diseases

Prediction of the different diseases with the help of different symptoms express the diseases in the real time. In the dataset, there are 132+ different symptoms on which the model is trained to give the best result of the disease.

data-analysis data-science data-visualization jupyter-notebook kaggle python

Last synced: 13 Apr 2026

https://github.com/ankitrai259/sales_insight_dashboard

Sales Insight: Using SQL for data cleaning and Power BI for making interactive dashboard

dashboard data data-visualization datacleaning postgresql powerbi sql

Last synced: 17 Mar 2025

https://github.com/fbarffmann/mycitibike

Built an interactive Leaflet.js map visualizing over 750 Citi Bike station locations in NYC. Analyzed usage patterns, station density, and user navigation across the network.

citibike data-analysis data-visualization geojson geospatial interactive-map javascript leaflet nyc web-mapping

Last synced: 07 Jul 2025

https://github.com/nimomach/cafe-sales

This analysis focuses on evaluating the sales performance of a cafe by examining key metrics such as total revenue, sales by product category, peak sales times, and many more.

cafe data-analysis data-visualization sales

Last synced: 12 Mar 2026

https://github.com/lvsvendsen/shime-monitor-r

R script for visualizing pH and pump activity in SHIME gut microbiome experiments.

data-visualization microbiome r research-tool shime

Last synced: 13 Sep 2025

https://github.com/mohsinraza2999/new-york-taxi-fare-analysis

This project analyzes and predicts taxi fares estimate fares in advance using Regression Analysis. Conducted EDA, hypothesis testing, to identify key variables. Developed ML models (Random Forest, XGBoost) with GridSearchCV for hyperparameter tuning to predict generous tip giver accurately.

ab-testing data-un data-visualization exploratory-data-analysis fea random-forest regression-analysis sklearn xgboost

Last synced: 17 May 2026

https://github.com/jianxi-erin/bigdata-machinelearning-lab

本项目是一个综合性的大数据与机器学习实验平台,包含两个主要任务,每个任务涵盖三个关键技术模块:大数据处理、数据分析和机器学习。项目基于真实的竞赛设计,提供完整的数据处理模拟和建模实践。

data-analysis data-visualization hadoop machine-learning python spark sql

Last synced: 03 May 2026

https://github.com/minhtungonep/android-traffic-analysis

Android malware detection project analyzing network traffic patterns in a telecommunications context. Uses statistical hypothesis testing and data visualization to evaluate traffic features like DNS query times, TCP packets, and volume bytes for distinguishing between benign and malicious Android applications.

android-library bachelor-project bachelor-thesis cybersecurity cyface data-visualization hypothesis-testing malware-detection matplotlib network-traffic numpy pandas python scipy sdk statistical-analysis telecommunications voip-security

Last synced: 09 May 2026

https://github.com/18mahi/tweet-sentiment-analysis

Classify tweets into happy, sad, angry, excited, and neutral with this interactive Python model. Combining TF-IDF text features with engineered numeric features like emoji sentiment, polarity, subjectivity, and punctuation counts, it demonstrates intermediate-level NLP, feature engineering, and visualization skills.

data-science data-visualization jupyter-notebook machine-learning matplotlib nlp numpy pandas sklearn string textblob

Last synced: 30 Apr 2026

https://github.com/aymanmomin/excel-coffee-data-analytics-exploring-coffee-orders-dataset

This project utilizes a coffee orders dataset to perform comprehensive data analytics and gain insights into customer preferences, popular items, and sales trends. The analysis aims to provide valuable information for coffee shop owners and enthusiasts, facilitating data-driven decision-making and improved customer satisfaction.

data-analysis data-visualization excel project

Last synced: 18 Jan 2026

https://github.com/lgibson7/stat_651

Coursework for STAT 651 Data Visualization, Cal State East Bay Fall 2022

data-visualization leaflet shiny-apps tableau

Last synced: 05 Feb 2026

https://github.com/shivani0126/resturant_rating_analysis

Restaurant ratings Analysis is a project where real consumers from 2012, including additional information about each restaurant and their cuisines, and each consumer and their preferences are visualised through Power BI dashboard.

dashboard data-visualization dataanalysis datamodeling dataprep dax-functions powerbi

Last synced: 27 Jan 2026

https://github.com/alexquilis1/spanish-fuel-stations-analysis

Real-time analysis of Spanish fuel prices using government API data with interactive maps and regional comparisons

data-analysis data-visualization fuel-prices geospatial-analysis ggplot2 government-data leaflet open-data r shiny spain tidyverse

Last synced: 08 Oct 2025

https://github.com/tyriek-cloud/power-bi-nyc-housing-financial-report

This report was conducted to provide a comprehensive analysis of various NYC housing and financial data.

dashboard data-analysis data-visualization financial-analysis powerbi statistics

Last synced: 21 Jan 2026

https://github.com/HarmoniCode/Filtra

Digital Filter Designer is a powerful application built using PyQt5 and Matplotlib. It allows users to design and visualize digital filters, including standard filters and all-pass filters, and generate corresponding C code. Ideal for students, researchers, and engineers in digital signal processing.

data-visualization digital-signal-processing filter-design pyqt5 real real-time-processing

Last synced: 09 Oct 2025

https://github.com/jdede1/data-analysis-visualization-assignment-5

INFO 526 — Data Analysis and Visualization, Assignment 5 (Dashboard Reports — Iowa Liquor Sales). Part of the Master’s in MIS/ML program at the University of Arizona. Includes positive and negative dashboards showing key KPIs: top products/vendors driving sales vs bottom products/vendors hindering sales.

dashboard data-visualization matplotlib pandas seaborn

Last synced: 16 Apr 2026

https://github.com/hiteshsahu/visual-studio-hybrid-application

Visual Studio and Java Script full duplex communication

data-visualization html5 javascript kendo-ui visual-basic

Last synced: 09 Oct 2025

https://github.com/do-me/excel-column-analyzer

A free online tool to analyze Excel column data. Instantly count unique values, calculate frequencies, and visualize results in charts.

chartjs data-science data-visualization tailwind

Last synced: 09 Oct 2025

https://github.com/nidhi-kekatpure/churn-prediction-for-subscription-customers

End-to-end customer churn prediction using AWS SageMaker, XGBoost, and Python - complete ML pipeline from data generation to real-time inference with 79% AUC score.

aws business-intelligence data-science data-visualization jupyter-notebook predictive-analytics

Last synced: 16 May 2026

https://github.com/rohanchawla972/business-analytics-and-visual-storytelling

Business analytics repository integrating multi-source datasets and applying advanced methods — EDA, clustering, panel regression, sentiment analysis, and data storytelling — to extract actionable insights across hospitality, innovation, and financial services.

business-analytics clustering data-visualization jupyter-notebook predictive-modeling python storytelling

Last synced: 18 May 2026

https://github.com/anandu-jpg/coffee-shop-sales-analysis

This project analyzes coffee shop sales data to identify trends, patterns, and insights that can help improve operations, boost revenue, and enhance the customer experience.

business-intelligence data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas phyton

Last synced: 18 May 2026

https://github.com/chandkund/housing-price-prediction

Predict housing prices using the Boston Housing Dataset. Covers data loading, cleaning, preprocessing, EDA, normalization, standardization, and regression models (Linear Regression, Decision Tree, Random Forest, Extra Trees). Evaluated with Mean Squared Error (MSE). Tech: Python, Pandas, NumPy, Scikit-learn, Seaborn, Matplotlib.

data-science data-visualization matplotlib numpy pandas pyhton sklearn sklearn-library sklearn-metrics

Last synced: 21 Jan 2026

https://github.com/frankelavsky/security-dash-challenge

I had two 8 hour days to create a visualization dashboard for three datasets. Tab one: Voronoi overlay on line graph. Tab two: Data partitioning method keeps in-memory usage low. Tab three: deals with "Failed" vs "Successful" attempts as positive/negative barcharts over time. I used d3.js, require, MVC pattern, and vanilla js.

client-side complexity css3 d3 d3js dashboard data-analysis data-structures-algorithms data-visualization frontend-app html5 interactive-visualizations javascript modular network-analysis network-monitoring network-security security single-page-app visualization

Last synced: 14 Apr 2026

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 19 Jan 2026

https://github.com/badranalyst/data-professional-survey-breakdown-power-bi-dashboard

This project presents an interactive Power BI dashboard analyzing data professionals' insights. Key focus areas include job satisfaction, challenges in entering the data field, career priorities, demographics, and more. The visualization helps uncover trends and factors impacting data professionals globally.

charts dashboard dashboards data data-cleaning data-visualization dataset dax power-bi powerbi

Last synced: 23 Feb 2026

https://github.com/bhavinpatel4199/machine-learning-programming

This repository serves as a central hub for various machine learning projects and experiments. It contains multiple sub-repositories, each focusing on different aspects of machine learning, from data preprocessing to advanced deep learning techniques.

data-structures data-visualization machine-learning machine-learning-algorithms pandas-dataframe python3 sklearn

Last synced: 19 Jan 2026

https://github.com/mitgar14/etl-workshop-2

Workshop #2 (ETL process using Airflow) for the ETL course using Apache Airflow to build a data pipeline.

airflow data-engineer data-engineering data-visualization etl pandas postgresql powerbi python sqlalchemy

Last synced: 14 Apr 2026

https://github.com/archanakokate/kkbox_music_recommendations

Predicting the chances of a user listening to a song repetitively after the first observable listening event.

data-visualization exploratory-data-analysis machine-learning statistical-analysis

Last synced: 11 Oct 2025

https://github.com/abeltavares/postql

Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.

cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper

Last synced: 19 Jan 2026

https://github.com/moustafamohamed01/san-francisco-salaries-data-analysis

Salary Data Analysis 💰📊 An analysis of salary data to uncover trends, distributions, and anomalies in employee compensation. This project involves data cleaning, visualization, and statistical analysis to explore salary trends, job roles, and correlations. Insights are presented using Python and Jupyter Notebook. 🚀

data-cleaning data-visualization jupyter-notebook matplotlib pandas python seaborn

Last synced: 08 May 2026

https://github.com/louisfernando1204/websocket-benchmark

A comprehensive performance testing and analysis suite designed to evaluate and compare different WebSocket server implementations across various programming languages and libraries.

benchmarking broadcast-test coder-websocket csv data-analysis data-visualization echo-test golang gorilla-websocket nodejs python3 socket-io websocket-client websocket-server ws

Last synced: 09 Apr 2026

https://github.com/tashi-2004/Apache-Spark-Geospatial-Air-Quality-Analysis

This project analyzes air quality data across regions to identify improvement areas, track trends, and classify similar regions using clustering. Leveraging PySpark, it processes sensor data, calculates Air Quality Index (AQI), and visualizes results with histograms and geographic maps to highlight areas with good air quality.

aqi aqi-prediction clustering data-science data-visualization geospatial-visualization kmeans-clustering predictive-modeling sensor-data time-series-analysis

Last synced: 13 Oct 2025