An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/vaxdata22/customer-churn-data-analytics-etl-pipeline-by-airflow-on-ec2

This is an end-to-end AWS Cloud ETL project. This orchestration uses Apache Airflow on AWS EC2 as well as AWS Glue. It demonstrates how to build ETL pipeline that would perform data transform using Glue job/crawler as well as loading into a Redshift table. It also shows how to connect Amazon Athena to Glue Data Catalog, and Power BI to Redshift.

amazon-athena amazon-redshift apache-airflow aws-ec2 aws-glue aws-s3 business-intelligence customer-churn-analytics dags data-visualization etl-pipeline orchestration power-bi python3

Last synced: 18 Jun 2025

https://github.com/srinibas-masanta/hotel-revenue-analysis-dashboard

This project focuses on analyzing hotel booking data to uncover key metrics and insights that drive revenue management decisions. By creating an interactive Power BI dashboard, the project aims to improve strategic decision-making, optimize occupancy rates, and enhance overall financial performance within the hospitality industry.

business-analytics data-analysis data-science data-visualization dax-functions hospitality powerbi

Last synced: 12 Jan 2026

https://github.com/rmodi6/ieee-cis-fraud-detection

IEEE-CIS Fraud Detection Kaggle Competition notebooks

data-science data-visualization fraud-detection kaggle logistic-regression xgboost

Last synced: 15 May 2026

https://github.com/asimpson/is-steph-mvp

🏀 Compare the 2018 NBA MVP contenders against Steph Curry's historic, unanimous, 2016 MVP season.

data-visualization nba reactjs

Last synced: 20 May 2026

https://github.com/alra-code/data-analytics-com-power-bi

Desafio de projetos do Boocamp Data Analytics realizado pela Dio Me em 2024

analytics data-visualization desafios-resolvidos dio-bootcamp powerbi pt-br

Last synced: 25 Jan 2026

https://github.com/youssef-saaed/activity-recognition-using-various-ml-algorithms

This project involves a comprehensive comparative analysis of various machine learning models to classify activities based on a given dataset. The analysis follows a structured approach, including data exploration, model training, model evaluation, and results interpretation to identify the best performing model.

activity-recognition comparative-analysis cross-validation data-exploration data-visualization machine-learning model-evaluation model-training neural-networks

Last synced: 22 Mar 2025

https://github.com/djsprenk/djsprenk.github.io

GitHub Pages site for DJ Sprenk

d3 d3-visualization data-visualization dj music python

Last synced: 20 May 2026

https://github.com/c2r0b/2q

Manage data and relationships with AI

data-visualization graphql relationships rust tauri

Last synced: 09 Apr 2026

https://github.com/firyanulrizky/ubud-souvenir-center-v1.0

Undergraduate Thesis Project (Mobile Apps Management Sales & E-Marketplace using Apriori Algorithm) Dedicated to Ubud Art Market

apriori-algorithm data-analytics data-mining data-visualization php web-application

Last synced: 27 Jun 2025

https://github.com/hemanth094/netflix-dashboard

This project features a Power BI dashboard that visualizes Netflix data from the provided CSV file. The repository includes the main Power BI project file, the dataset, and a related image. It's a straightforward data visualization project that demonstrates how to create an interactive dashboard for analyzing Netflix content.

data-visualization powerbi

Last synced: 16 Feb 2026

https://github.com/mdalamin5/data-science-machine-learning-basics

This repository is a comprehensive guide to Machine Learning algorithms, Python OOP, data preprocessing, and visualization using Pandas, NumPy, Seaborn, Scikit-learn, and more. It includes hands-on Jupyter notebooks, modular Python scripts, and a structured ML pipeline for training and evaluating models. 🚀

data-visualization datapreprocessing machine-learning-algorithms object-oriented-programming

Last synced: 15 May 2026

https://github.com/jibbs1703/airline-data-analysis

This repository contains the Exploratory Data Analysis of the flight delay and cancellation for airline flights in the United States in the year 2015. With this EDA, insights and solutions are suggested for business owners and airport managers.

business-insights business-solution data-analysis data-visualization

Last synced: 20 Mar 2025

https://github.com/dhou22/pulmoscan-project

A collaborative project with PulmoScan company focused on developing an advanced deep learning system for automated detection and classification of pulmonary nodules in chest CT scans, aiming to enhance early lung cancer diagnosis.

computer-vision data-visualization deep-learning lung-cancer-detection python

Last synced: 16 Apr 2026

https://github.com/vlad1343/data-visualisation

Python project showcasing interactive and static visualizations using Plotly and Matplotlib. It includes analysis of CSV, JSON, and API data, turning complex datasets into clear, insightful charts.

anova api csv-files data-analysis data-visualization json matplotlib matplotlib-pyplot pandas pandas-python plotly python3 seaborn seaborn-python

Last synced: 08 Apr 2026

https://github.com/faizantkhan/python_matplotlib

Matplotlib is a powerful Python library for creating visualizations and plots. It’s widely used for data representation, making complex information more accessible and interpretable. It offers various types of plots, including line graphs, scatter plots, bar charts, histograms, and more

data-analysis data-analytics data-engineering data-science data-visualization deep-learning graphs line machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot matplotlib-python python

Last synced: 20 May 2026

https://github.com/vzamboulingame/data-portfolio

This repository showcases my projects in Python and SQL, highlighting my skills in data analysis & visualization.

data-analysis data-portfolio data-science data-science-portfolio data-science-projects data-visualization jupyter-notebook portfolio python sql

Last synced: 20 May 2026

https://github.com/aniruddha-biswas/wavecon-telecom-analysis-report

Wavecon Telecom Analysis Report - A Internship Project of Codebasics

data-visualization dataanalysis powerbi powerpoint-presentations storytelling

Last synced: 11 Jan 2026

https://github.com/zhouzhuofei/juliadl

learning Julia, write some notebooks, like machine learning and data science, visualization.

data-science data-visualization julia mxnet

Last synced: 21 Apr 2026

https://github.com/maruf-hossen/kaggle-projects-and-learning

Comprehensive data science learning journey through Kaggle courses and exercises. Documenting progress in SQL, Python, ML, and data visualization with practical projects and business applications.

business-intelligence data-cleaning data-science data-visualization kaggle learning-journey machine-learning pandas python sql

Last synced: 05 May 2026

https://github.com/johnwalley/how-do-you-stack-up

Data visualisation and storytelling

data-visualization

Last synced: 11 Jan 2026

https://github.com/xmen3em/kaggle-competitions

This collection contains various projects and notebooks developed to tackle a range of Kaggle competitions, showcasing different machine learning techniques, data preprocessing methods, and model optimizations.

data data-science data-visualization deep-learning deployment ensemble-learning machine-learning-algorithms python streamlit

Last synced: 09 Apr 2026

https://github.com/alinababer/data-science-and-insight-agent-rag-llama3-lava-llm

Data-Science-and-Insight-Agent-RAG-LLama3-Lava-LLM-Django-WebApplication is an advanced AI-driven chatbot designed to assist in data science, document analysis, and image interpretation. This repository contain the Datascience Agent of this project.

artificial-neural-networks classifcation data-analysis data-engineering data-visualization datascience large-language-models llama2 lstm machine-learning python random-forest regression

Last synced: 01 Jan 2026

https://github.com/htanh2003/datamate

DataMate là một công cụ phân tích dữ liệu thông minh, kết hợp sức mạnh của mô hình ngôn ngữ lớn (LLM) và giao diện trực quan, giúp người dùng dễ dàng tải lên tệp CSV, khám phá dữ liệu, và nhận các phân tích thông minh

agent data-visualization deployment docker docker-compose langchain nginx streamlit

Last synced: 08 Apr 2026

https://github.com/gui-sitton/timeseries-taxi

To attract more drivers during peak hours, we need to predict the amount of cab requests for the next hour. Build a model for this prediction.

data-science data-visualization machine-learning ml python time-series time-series-analysis time-series-prediction

Last synced: 20 May 2026

https://github.com/hemant-kumar786/heart-disease-prediction

Heart Disease Analysis project in RStudio using statistical methods and data visualization. Includes data cleaning, exploratory data analysis (EDA), correlation study, and insights on key health indicators influencing heart disease.

correlation-study data-analysis data-visualization eda healthcare heart-disease r rstudio statical-analysis

Last synced: 02 Nov 2025

https://github.com/mondial7/echarts-wc-import

Webcomponent to import Echarts library - current version echarts 3.8.5

data-visualization echarts3 polymer2 webcomponents

Last synced: 03 May 2026

https://github.com/errea/vet_clinic_database

For this project you need special preparation. As the goal of this project is to solve some performance issue, first we need to introduce those issues. In order to do that, you will populate your database with a significant number of data.

data data-analysis data-structures data-visualization database

Last synced: 21 May 2026

https://github.com/kashirin-alex/thither.direct-onamove

an android skeleton-example application for using data from Thither.Direct platform on mobile applications

android-application data data-analysis data-structures data-visualization mobile-development mobility query research-data-management

Last synced: 27 Apr 2026

https://github.com/matheusbcmelo/primeirorelatoriopowerbi

Primeiro relatório desenvolvido em PowerBI no curso DIO - Python Data Analytics

business-intelligence data-visualization powerbi report

Last synced: 25 Jan 2026

https://github.com/cassiofb-dev/covid-grafico

Gráficos da COVID-19 (Mortes e Casos) com Chart.js.

chartjs covid-19 data-visualization

Last synced: 25 Jan 2026

https://github.com/anonymo2239/big-data-churn-analyzer

Scalable customer churn prediction using PySpark. Includes EDA, feature engineering, modeling, and real-time inference on new data.

big-data churn-analysis churn-prediction classification-algorithm data-analysis data-science data-visualization modeling pyspark

Last synced: 21 May 2026

https://github.com/leandrocollares/long-range-brilliance

A responsive scatterplot showing minutes played and 3-point field goals made by the best 3-point shooters in NBA history

d3 data-visualization svelte

Last synced: 15 May 2026

https://github.com/eins51/restaurantanalytics

Comprehensive business analytics project using Python and Tableau. Features include data visualization, interactive dashboards, and data-driven insights for restaurant performance and consumer behavior.

business-analytics data-visualization python-dashboard restaurant-analysis tableau

Last synced: 27 Jun 2025

https://github.com/arekflo2002/analiza_danych-rstudio-_dyskryminacja_kobiet

Wykorzystując rstudio oraz zestawy dane ze strony https://www.gapminder.org/data/ badam tematykę dyskrminacjii kobiet na poszczególnych kontynentach i wyciągam odpowiednie wnioski

data data-preparation-and-analysis data-visualization rstudio statistics

Last synced: 14 Apr 2025

https://github.com/codeonthespectrum/web-scrap

Este projeto realiza o web scraping da Wikipédia para obter dados sobre os municípios mais populosos do estado do Rio de Janeiro.

data-analysis data-visualization webscraping

Last synced: 16 Feb 2026

https://github.com/balajimohan18/power-bi-visualization-project

This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce damages by accidents & calamities.

data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-visuals powerpoint-slides

Last synced: 08 Mar 2026

https://github.com/gabboraron/plague-inc

Plague Inc: An Epidemic Forecast Concept and Data Visualization Tool. Previously accessible at http://20.234.177.167/. You are welcome to host it on your own server.

big-data data-mining data-science data-visualization epidemic-simulations hackathon

Last synced: 06 Apr 2025

https://github.com/saidulalimallick04/smart-traffic-violation-pattern-detector-dashboard

This project is a Streamlit web application designed to analyze traffic violation data. It provides a user-friendly interface to explore, visualize, and gain insights from traffic violation datasets. Users can upload their own data, perform analysis, and view summaries and trends.

dashboard data-analysis data-visualization internship-project pandas python smart-traffic streamlit

Last synced: 18 Apr 2026

https://github.com/ancapitigoi/mushrooms-selection

In order to find which mushrooms are safe to eat, the decision tree data mining method is used.

classification-model data-cleaning data-exploration data-mining data-visualization decision-tree penalty-method r-programming

Last synced: 15 Jun 2026

https://github.com/simranshaikh20/credit-card-dashboard

A Data Visualization Project using Microsoft Power bi

data-analysis data-visualization powerbi

Last synced: 02 Jan 2026

https://github.com/mmzong/gee_lifestyleeffectsonhypertension

Generalized Estimating Equations (GEE), Quasi-likelihood under the Independence Model Criterion (QIC), Longitudinal data, Embedded box plots within violin plots with hypertension risk categories, spaghetti plots, aggregate line plots, histograms, faceted-area plots, box and jitter plots. Investigating the impact of lifestyle on health.

aggregate-line-plot area-faceted-plots box-plots data-analysis data-manipulation data-science data-visualization generalized-estimating-equations histograms jitter-plots longitudinal-data qic quasi-likelihoods r spaghetti-plots violin-plots

Last synced: 29 Jul 2025

https://github.com/abhishekyadav915/diwali_sales_analysis

This project aims to analyze sales data during the Diwali festival using Python. The analysis focuses on identifying key trends, customer purchasing behavior, and sales performance across different segments. By leveraging data visualization and statistical analysis, we uncover insights.

data-analysis data-visualization matplotlib-pyplot numpy-library pandas-dataframe seaborn-python

Last synced: 05 Apr 2025

https://github.com/rsc-labs/see-open-data

Show www.dane.gov.pl in user friendly format. Generate flourish data or other data visualizations.

data data-visualization flourish government poland

Last synced: 04 Apr 2025

https://github.com/angchekar28/lung-cancer-prediction

This project builds and compares multiple machine learning models to predict lung cancer based on patient attributes. It evaluates classification models like Logistic Regression, Decision Tree, Random Forest, and SVM for early diagnosis.

data-science data-visualization jupyter-notebook lung-cancer-detection machine-learning model-comparison python

Last synced: 14 May 2026

https://github.com/amitkaps/vaccines

India COVID Vaccines Status Visualisation

data-visualization

Last synced: 25 Jan 2026

https://github.com/ledsouza/dataviz_estilizacao_tabelas

Desenvolver estilizações de tabelas utilizando o objeto Styler do pandas

data-science data-visualization pandas python vitrinedev

Last synced: 09 May 2026

https://github.com/hfagerlund/machine-learning-classifier-iris

Algorithm(s) for identifying/predicting type of iris

data-visualization machine-learning python-script python3

Last synced: 27 Jun 2025

https://github.com/deliprofesor/kizbasina_covid19-topic-modeling-nlp

NLP and topic modeling on COVID-19 scientific papers using LDA, visualizations, and metadata analysis from the CORD-19 dataset.

cord19 covid19 data-cleaning data-visualization gensim lda nlp pandas pyldavis seaborn

Last synced: 14 May 2026

https://github.com/hfagerlund/machine-learning-iris-analysis

No longer maintained. Moved to https://github.com/hfagerlund/machine-learning-classifier-iris/.

data-visualization jupyter-notebook machine-learning python37

Last synced: 22 Jul 2025

https://github.com/sjnims/chartjs-expert

Claude Code plugin for Chart.js v4.5.1 with 12 expert skills, interactive code generation, and React/Vue/Angular/Rails integrations

accessibility chart-js chartjs charts claude-code claude-code-plugin data-visualization javascript ng2-charts rails-chartjs react-chartjs stimulus typescript vue-chartjs

Last synced: 25 Jan 2026

https://github.com/nishumehta/british-airways-reviews-analysis

This project analyzes British Airways reviews using Tableau to create an interactive dashboard. The dashboard visualizes average ratings across multiple metrics and trends over time.

dashboard data-analysis data-visualization tableau tableau-public

Last synced: 12 Jan 2026

https://github.com/kuanjiahong/covid19-analysis

A simple project to familiarize myself with data analysis

data data-science data-visualization pandas python

Last synced: 02 Apr 2025

https://github.com/jarif87/pokeinsights

A Selenium-Powered Data Scraping and Tableau Visualization Project

data-visualization python scraping selenium tableau

Last synced: 21 May 2026

https://github.com/maazie-khan/austin-housing-insights-powerbi

Worked with a real estate dataset, we will build a tool to evaluate trends and drivers of house prices around Austin, Texas.

dashboard data-analysis data-science data-visualization database powerbi

Last synced: 02 Jan 2026

https://github.com/saniyaacharya04/youtube-trending-video-analyzer

Modular Streamlit dashboard for analyzing trending YouTube videos by views, engagement, and category—powered by the YouTube Data API.

api-analysis clustering-engagement-metrics dashboard data-visualization modular-architecture streamlit trending youtube

Last synced: 21 May 2026

https://github.com/bhavinpatel4199/machine-learning-framework

This repository, showcases various projects that explore key concepts in both supervised and unsupervised learning, with a focus on real-world applications. The projects utilize a range of machine learning techniques, including data preprocessing, feature selection, exploratory data analysis (EDA), and model optimization.

classification clustering data-science data-structures data-visualization exploratory-data-analysis machine-learning machine-learning-algorithms machine-learning-models pandas-dataframe predictive-modeling preprocessing-data sklearn supervised-learning unsupervised-learning

Last synced: 20 Jan 2026

https://github.com/omari-kd/environmental-impact-on-food-production

The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.

data data-analysis data-science data-visualization environmental-impact-analysis r

Last synced: 30 Mar 2025

https://github.com/omerdduran/riskfactor-heart

This ML project predicts heart disease using logistic regression on the Cleveland Heart Disease UCI dataset, featuring advanced preprocessing and medical feature engineering, achieving 82.1% accuracy with strong cross-validation.

cardiovascular-health data-science data-visualization heart-disease-prediction logistic-regression machine-learning medical-ai scikit-learn

Last synced: 14 May 2026

https://github.com/hemangsharma/bookingdataanalysisreport

The report helps understand key trends and insights around customer bookings, pricing, and other related attributes.

analysis data data-analysis data-analytics data-visualization streamlit streamlit-dashboard

Last synced: 14 May 2026

https://github.com/zborovskaanna/grosery_store_sales_analysis

Python data analysis project. Analysis of grocery store sales using visualizations and reporting in Tableau

data-analysis data-visualization matplotlib numpy pandas python seaborn tableau

Last synced: 08 Apr 2026

https://github.com/lut-ful/e-commerce-sales-report

This dashboard provides a visual analysis of e-commerce sales data

data data-analytics data-science data-visualization power-bi statics

Last synced: 28 Jun 2025

https://github.com/omari-kd/transborder-freight-data-analysis

This project analyses transportation data from the Bureau of Transportation Statistics (BTS) to uncover insights into cross-border freight's efficiency, safety and environmental impacts across road, rail, air and water modes.

data-analysis data-analysis-in-r data-cleaning-and-preprocessing data-science data-visualization powerbi

Last synced: 30 Mar 2025

https://github.com/toluwaa-o/stears-lite-overview

Central overview repository for the Stears Lite project — documentation, resources, and links to frontend and backend repositories.

africa charts data data-aggregation data-visualization documentation fastapi nextjs project-overview

Last synced: 14 May 2026

https://github.com/zulfachafidz/green_horizon_forecasting_peak_organic_avocado_sales_with_the_prophet_algorithm

The Green Horizon Project leverages the Prophet algorithm to predict peak sales of organic avocados, supporting the campaign "APEAM GO ORGANIC." Using Python and Looker Studio, this analysis aims to provide deep insight into sales trends and potential, forming the basis of smarter marketing strategies.

algorithm algorithms analytics data data-analysis data-engineering data-mining data-science data-visualization forecasting machine-learning machine-learning-algorithms prophet-model python python-script

Last synced: 17 May 2026

https://github.com/foufou-exe/occitanie-report-rental-yields

This project aims to develop a datavisualization and reporting tool to analyze rental yields in the Occitanie region, for use by real estate investors.

data-visualization jasper java opendata python reporting

Last synced: 22 May 2026

https://github.com/rubayeaalketbi/real-time-text-sentiment-analysis-with-azure-functions

A serverless application that performs real-time sentiment analysis on text messages using Azure Functions.

azure azure-functions data-visualization python sentiment-analysis

Last synced: 22 May 2026

https://github.com/aran203/cricanalytics

ADSC Fall 24 Project for cricket analytics with hawkeye data

data-engineering data-visualization python streamlit

Last synced: 14 May 2026

https://github.com/al-ghaly/hotel-revenue-excel-analysis

Excel Dashboard to analyze data of a hotel over the past three years.

dashboard data-analysis data-visualization excel excel-analysis

Last synced: 02 Jan 2026

https://github.com/vedikasnehil/sql-50

This project focuses on solving 50 SQL problems every weekend from LeetCode to strengthen SQL skills, master advanced techniques, and build consistency. Each solution is documented with clear explanations, creating a valuable resource for learning and application.

data-visualization database-management sql

Last synced: 06 Jan 2026

https://github.com/casperkristiansson/finance-tracker

A project which solved an issue of mine which was tracking my finance. This Finance Tracking application gives overviews of expenses and income to give its users an easy way to explore their data.

dashboard data-visualization finance-management firebase-auth react

Last synced: 29 Dec 2025

https://github.com/benmar2406/rent-in-germany

Interactive visualizations and maps depicting topics around rent prices and income in Germany built with Svelte.

charts d3 d3-visualization d3js data-analysis data-visualization gis gis-data infographic infographics map mapbox mapbox-gl mapbox-gl-js mapboxgl svelte

Last synced: 26 Mar 2025

https://github.com/fazej99/u.s-climate-and-temperature-analysis

This project analyzes historical temperature trends in the U.S., explores their economic impacts, predicts future changes using machine learning, visualizes regional anomalies with GIS, and presents findings through a secure and interactive Streamlit dashboard.

data-analysis data-science data-visualization gis machine-learning streamlit

Last synced: 22 May 2026

https://github.com/sadratehranian/pem-fuel-cell

The methodology section details the use of Python for data processing and analysis, employing statistical and machine learning-based anomaly detection techniques to identify potential issues in fuel cell stacks. It emphasizes data preprocessing, feature engineering, exploratory data analysis (EDA), and anomaly detection.

anomaly-detection data-analysis data-science data-visualization exploratory-data-analysis feature-engineering fuel-cell machine-learning preprocessing python statistical-analysis visual-studio-code

Last synced: 26 Mar 2025