An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/chandkund/spam-email-detection

This project focuses on detecting spam emails using a fine-tuned DistilBERT model, a lighter version of the BERT model. The model is trained to classify email text into two categories: spam (1) and not spam (0). The dataset consists of email texts labeled as either spam or non-spam.

data-visualization datapreprocessing matplotlib pandas python pytorch sklearn transformer

Last synced: 20 Jan 2026

https://github.com/virajbhutada/coursera-google-data-analytics-capstone

A repository containing the Capstone project for the Google Data Analytics Professional Certificate, focusing on analyzing FitBit fitness tracker usage data to derive insights relevant to Bellabeat, a wellness technology company.

coursera data-analysis data-visualization fitbit fitness-tracker google google-data-analytics-capstone-project python-programming

Last synced: 22 Nov 2025

https://github.com/mrankitgupta/titanic-survival-prediction-93-xgboost

Titanic Survival Prediction Project (93% Accuracy)🛳️ In this notebook, The goal is to correctly predict if someone survived the Titanic shipwreck using different Machine Learning Model & Hyperparameter tunning.

classification data-analysis data-science data-visualization gradient-boosting kaggle-competition linear-regression logistic-regression machine-learning machine-learning-algorithms ml ml-models nlp prediction predictive-modeling random-forest titanic titanic-kaggle titanic-survival-prediction xgboost

Last synced: 11 Mar 2025

https://github.com/seroviicai/pizza_analytics

The objective of this mini project is to predict how many ingredients and of what kind would be needed in the near future.

data-analysis data-science data-visualization pandas python tutorial

Last synced: 10 Jun 2025

https://github.com/czoido/conan-graph3d

Generate a Conan dependency graph visualization in 3d

conan cpp data-visualization

Last synced: 20 Jan 2026

https://github.com/jorgeatgu/apaga-luz

💡 ¿Cuánto cuesta la luz? 💶

data data-visualization flat-data

Last synced: 04 Feb 2026

https://github.com/guruakaashjn/te_project_microsoft_ai

AI based statistical analysis of land-use plastic pollution in India using AI/ML techniques.

artificial-intelligence data-analysis data-analytics data-science data-visualization machine-learning powerbi

Last synced: 27 Feb 2025

https://github.com/sreejabethu/sales-data-analysis-forecasting

Welcome to the Sales Data Analysis & Forecasting project! 🚀 This repository showcases my data analysis skills through exploratory data analysis (EDA), data cleaning, and visualization of sales and customer feedback data. The goal is to extract actionable insights to drive business decisions.

analysis barchart data-visualization datacleaning exploratory-data-analysis forecasting histogram matplotlib-pyplot numpy-library pandas-library pycharm-ide sales-analysis salesdata salesdataanalysis seaborn-plots transformation

Last synced: 31 Aug 2025

https://github.com/prem-ium/tax-merge

Consolidate multiple 1099 brokerage tax forms, visualize realized gains/losses, and track data for Fidelity, Chase, Vanguard, Schwab, and others!

1099 accounting chase data-science data-visualization data-wrangling desktop-application fidelity financial-analysis firstrade graphical-user-interface schwab tastytrade tax tax-forms taxes vanguard

Last synced: 02 Feb 2026

https://github.com/noturlee/imdb-dataanalysis

A data model that predicts the IMDb rating of a movie based on features like genre, director, and actors. Using regression techniques to tackle this problem.

data-analysis data-cleaning data-modeling data-science data-visualization

Last synced: 08 Apr 2025

https://github.com/affec-ds/dashboard-ventas-vinilos

Dashboard interactivo de ventas para tienda de vinilos. Análisis visual, KPIs clave y filtros dinámicos para decisiones comerciales.

business-intelligence data-analysis data-visualization ipywidgets jupyter-notebook kpis matplotlib music-industry notebook-project python retail-analytics sales-analysis seaborn vinyl-records

Last synced: 24 Mar 2025

https://github.com/vietdoo/real-estate-marketplace

a webapp that can visualize homes for sale on a cluster map. Data is continuously fetched from MongoDB, build filtering functions and APIs to find homes.

api bigdata data-visualization flask maps mongodb reactjs webscraping

Last synced: 31 Mar 2025

https://github.com/GZ430/global-christianity-dataviz-jp

A web app built in R Shiny for users to explore global Christianity data from Joshua Project, World Watch List, and others.

christianity data-visualization r shiny

Last synced: 10 Mar 2025

https://github.com/fatihilhan42/eda-on-data-science-salary-with-python

You can access the files of this project, which analyzes people working in the field of data science according to countries and working wages.

analysis data-science data-scientists data-visualization jupyter-notebook pyhton salary

Last synced: 23 Mar 2025

https://github.com/gutyoh/netflix-dashboard

Dashboard created with Python 🐍 and the Dash 📊 framework to visualize Netflix’s content distribution and classification.

dash dashboard data-visualization plotly python

Last synced: 08 Apr 2025

https://github.com/iguptashubham/ott-churn-eda-ml

Understanding why customers discontinue their subscriptions will be crucial in optimizing the user experience, reducing churn, and maximizing customer lifetime value. By using Machine learning model to predict the Customer Churn.

data-analysis data-analysis-project data-science data-science-portfolio data-science-projects data-visualization machine-learning python

Last synced: 11 Jun 2025

https://github.com/dev-owdenmag/dataflow-manager

A dynamic and versatile web application for managing, collecting, and presenting data with an integrated printing feature.

data data-management data-management-platform data-visualization python

Last synced: 30 Mar 2025

https://github.com/priyanshubiswas-tech/aws-mwaa-elt-airflow-sql-dbt-superset-project

This project was created as part of an assessment for DigitalXC AI. It demonstrates a cloud-based ELT pipeline using AWS MWAA, Airflow, dbt, PostgreSQL, and Superset. The pipeline automates data ingestion from S3, transformation with dbt, and visualization through Superset, following modern data engineering practices on a scalable AWS architecture.

apache-airflow apache-superset aws-s3 dag data-analysis data-engineering-pipeline data-visualization dbt elt-pipeline python rds-postgres

Last synced: 03 Jul 2025

https://github.com/gher-uliege/bluecloud-plankton

Spatial interpolation of plankton data using a neural network

data data-analysis data-visualization neural-network oceanography

Last synced: 30 Mar 2025

https://github.com/reubenmathew/dataopsexperiments

Machine Learning, NLP and various data operations

data-science data-visualization python3

Last synced: 22 Apr 2025

https://github.com/natanast/immunovisual

A collection of charts made with the R programming language, focusing on immunogenetics analyses. Different charts types are being organized into multiple sections, each accompanied by its reproducible code. The gallery spotlights the utilization of prominent R packages such as tidyverse, data.table, and ggplot2.

data-visualization ggplot2 quarto r-programming

Last synced: 11 Mar 2026

https://github.com/GiveMePseudonyms/PiVisualisations

A way to visualise millions of digits of Pi. Written in Python using Pygame and Tkinter.

data data-visualization pi pygame python self-organising-criticality tkinter

Last synced: 08 Apr 2025

https://github.com/rani-sikdar/python-data-structures

A repository for sharing Python implementations of common data structures and algorithms, including arrays, linked lists, stacks, queues, trees, graphs, and sorting/searching techniques. Perfect for learning, revisiting fundamentals, and collaborating on computer science concepts. Contributions welcome! 🚀

data-analysis data-structure data-visualization numpy-library pandas-dataframe python python-3

Last synced: 30 Mar 2025

https://github.com/nrobledosagredo/lda-topic-analysis-news-chile

Topic analysis on news from Chile using LDA for extracting and visualizing relevant patterns.

data-visualization latent-dirichlet-allocation

Last synced: 16 Mar 2025

https://github.com/sathyasris27/time-series-and-spectral-analysis-

The aim of this project involves the analyses the data, removing trends and seasonal effects, identifying the underlying process, understanding the dominant frequencies, and using the residuals to make predictions.

data-analysis data-visualization forecasting r spectral-analysis time-series-analysis

Last synced: 27 Feb 2025

https://github.com/aubainmbk/optimisation-ventes-supermarche

Notre objectif est de ressortir des informations grâce aux données de ventes d'un Supermarché.

data-visualization dataanalysis excel powerbi

Last synced: 04 Feb 2026

https://github.com/qtle3/decision-tree-regression

This project implements **Decision Tree Regression** to predict the salary of an employee based on their position level. By using a dataset containing position levels and their corresponding salaries, the project highlights how decision trees can capture complex relationships in data through non-linear regression.

data-visualization decision-tree-regression prediction-model

Last synced: 08 Apr 2025

https://github.com/vara-co/python-api-challenge

Weather and Perfect Vacationing Spots Worldwide, by using APIs

api apis data-analysis data-visualization hvplot jupyter-notebook matplotlib pandas vacation weather

Last synced: 28 Mar 2025

https://github.com/ksmooi/mscs_data_visualization

This series of reports demonstrates the process of creating, refining, and evaluating interactive data visualizations using Altair, focusing on Seattle weather data, with an emphasis on accessibility, usability, and effectively communicating insights.

altair data-visualization

Last synced: 01 Apr 2025

https://github.com/colburncodes/se_pudding_2023

This project is a React app designed to showcase research conducted by a team of data scientists and data analysts. The app is utilizing React and React-Chartjs-2

chartjs-2 data-analysis data-science data-visualization react-chartjs-2 reactjs

Last synced: 12 Jun 2025

https://github.com/ineelhere/shinyDwight

A dashboard depicting the great battle of Dwight Schrute and the Computer | The Office

bslib data-visualization imola r rshiny rshiny-application sass server shiny-apps ui-components

Last synced: 01 Apr 2025

https://github.com/dadosdelaplace/hilostwitter

Códigos R de los hilos de Twitter (R codes of Twitter activity) https://twitter.com/dadosdelaplace

data-visualization dataviz divulgacion ggplot2 maths plotly r rstats statistical-analysis threads twitter

Last synced: 11 Mar 2026

https://github.com/samuelbarbosadev/roof_imoveis_data_analysis

The company hired you because they want to know what would be the 5 properties they should invest in and why, and which 5 you would not recommend investing in at all.

data-preparation data-understanding data-visualization pandas python

Last synced: 12 Apr 2026

https://github.com/macagua/entrenamiento.data_scientist_python

Repositorio de manuales y recursos del entrenamiento "Data Scientist en Python" realizado por Leonardo J. Caballero G.

data-analytics data-scientist data-visualization numpy pandas-dataframe python37 streamlit

Last synced: 26 Feb 2025

https://github.com/mohagungnursalim/da-covid-data-cleansing

Identifying and correcting errors, inconsistencies, and inaccuracies within datasets related to the COVID-19 pandemic. This is essential for obtaining reliable and actionable insights from the data.

data-analytics data-visualization pandas-dataframe

Last synced: 25 Mar 2025

https://github.com/ax-va/interactive-data-visualization-with-d3-dale-2023

These examples on Interactive Data Visualization with D3.js in the web browser are compiled with some modifications from the book "Data Visualization with Python and JavaScript: Scrape, Clean, Explore, and Transform Your Data", Second Edition, written by Kyran Dale and published by O'Reilly Media in 2023.

ax-va d3 data-science data-visualization dataviz frontend javascript web

Last synced: 12 Jun 2025

https://github.com/yusuf-abol/alumni-interaction-and-conversation-dynamics-nlp

This Natural Language Processing (NLP) project took a dive into chat engagement dynamics within the University of Ilorin’s Class of 2018 Statistics alumni group. By applying Latent Dirichlet Allocation (LDA) for topic modeling and network analysis, I uncovered communication patterns, topic distributions, and member interactions.

alumni-network anonymization conversation data-science data-visualization engagement machine-learning network-analysis nlp python-3 sentiment-analysis statistics whatsapp

Last synced: 09 Apr 2025

https://github.com/kurosawaxyz/covid4eu-sorbonne

Economy: “Analysis of Labor Market decisions of men and women during the COVID-19 pandemic in the 4EU+ countries”.

covid-19 data-analysis data-science data-visualization pandas

Last synced: 04 Jul 2025

https://github.com/avallecam/cdcper

Miscelanea de funciones customizadas a tareas de análisis en CDC Perú

data-manipulation data-mining data-visualization data-wrangling r tidyverse

Last synced: 29 Nov 2025

https://github.com/alexeyraspopov/vepr

🚧 Visualization protocol for complex and sizable data visualizations

data-visualization zero-dependency

Last synced: 04 Jul 2025

https://github.com/rani-sikdar/pwc-virtual-internship-powerbi

Comprehensive Power BI dashboards showcasing insights on Call Centre Trends, Customer Retention, and Diversity & Inclusion to drive business impact.

business-analytics business-intelligence data-analysis data-cleaning data-visualization interactive interactive-visualizations powerbi

Last synced: 07 Jan 2026

https://github.com/cpscript/github-profile-monitor

A system for tracking GitHub followers with automated data collection, change detection, and a sleek dashboard interface. Using GitHub Actions for automation and GitHub Pages for hosting.

data-visualization github-actions github-analytics github-pages json-api responsive-design web-dashboard

Last synced: 04 Sep 2025

https://github.com/sivas-2/coffee-sales-visualization

This repository contains data visualization scripts and notebooks analyzing coffee sales data from a vending machine, sourced from Kaggle. The visualizations explore sales trends, customer preferences, and product popularity over time.

data data-analysis data-science data-visualization python visualization

Last synced: 02 Apr 2025

https://github.com/geetisha/sales_insight_data_analysis_using_sql_and_tableau-etl-

Sales Insights - A Data Analysis Project performed on Tableau & SQL Topics

dashboard data-analysis data-visualization mysql project sales-analysis sql tableau

Last synced: 07 Jan 2026

https://github.com/whitehathackerpr/data-visualization-tool

This is a Python-based web application that allows users to upload datasets, analyze data, and create visualizations interactively. The tool is designed for ease of use and provides a simple interface to perform basic data analysis and generate visualizations

data data-analysis data-visualization python python3

Last synced: 05 Sep 2025

https://github.com/aromoh/data-visualization-nobel-laureates

Data Visualization using a interactive dashboard create in Tableau. Data set has been obtained in Kaggle

dashboard data-visualization kaggle-dataset nobel-laureates tableau

Last synced: 07 Jan 2026

https://github.com/brianruizy/data-visualization

Personal track demo of pure Python data visualizations

data-visualization matplotlib pandas plotly python seaborn

Last synced: 27 Apr 2026

https://github.com/vincent-tran-94/dataviz_tweets_chatgpt

Une application Streamlit pour analyser et visualiser les données et les tweets sur la sortie de ChatGPT. Ce projet comprend la gestion des données, l'analyse des sentiments, les tendances émergentes et les applications potentielles de ChatGPT.

data-management data-visualization python streamlit text-mining twitter

Last synced: 08 Nov 2025

https://github.com/webuccinoco/mysql-pivot-tables

Build complex MySQL pivot tables without touching a single line of code. This free PHP tool lets you visually connect your database and map out your data sources with a few simple clicks.

business-analytics business-intelligence crosstab data-analysis data-analytics data-visualization mysql mysql-database mysql-pivot-table mysql-reports mysql-virtualization php php-pivot-table php-reports pivot-tables reporting-tools

Last synced: 04 Feb 2026

https://github.com/gappeah/nike_web_crawler

This project involves web scraping Nike's product pages to extract product names, prices and links. The project showcases three different implementations of the web crawler using Selenium and BeautifulSoup. It also includes visualisation of the scraped data using Matplotlib and Seaborn.

beautifulsoup data-analysis data-visualization python selenium web-crawler web-scraper webcrawler webscraper webscraping webscraping-beautifulsoup

Last synced: 04 Jul 2025

https://github.com/asifdotexe/air-quality-analysis-aqa

AQA is a data-driven project focused on analyzing air quality data sourced from data.gov.in. The project encompasses data preprocessing, analysis, and visualization to gain insights into air pollution levels across various locations in India. By examining six key pollutants, the project aims to raise awareness about the environmental issues

aqi-analysis data-analysis data-preprocessing data-science data-visualization presentation

Last synced: 02 Dec 2025

https://github.com/wadiebenabdouh/socialmedia-usage-pipeline

Data from Kaggle, containing wide range of users with different age, gender, and interest.

apache-spark data-visualization jupyter-notebook kaggle pyspark python spark

Last synced: 11 Mar 2026

https://github.com/vidhi1290/hr_employee_prediction

"Welcome to the HR Employee Promotion Prediction project! This repository contains the code and resources for a machine learning project that focuses on predicting employee promotions. By analyzing various employee attributes, this project aims to provide valuable insights for HR decision-making and talent recognition within organizations.

data-exploration data-science data-visualization docker hr-employee-prediction hyperparameter-tuning machine-learning matplot model-building numpy pandas scikit-learn seaborn streamlit streamlit-webapp

Last synced: 13 Apr 2026

https://github.com/mxagar/statistics_with_python_coursera

My personal notes done while following the Coursera Specialization "Statistics with Python", from the University of Michingan, hosted by Dr. Brenda Gunderson.

data-modeling data-science data-visualization hypothesis-testing machine-learning pandas python statistics

Last synced: 09 Apr 2025

https://github.com/imageomics/emb-explorer

An interactive tool for classifying images with a pretrained model and exploring clustering results in 2D space.

clustering data-visualization dimensionality-reduction eda embedding exploratory-data-analysis exploratory-data-visualizations image-exploration

Last synced: 03 Mar 2026

https://github.com/whis99/userfunnelanalysis

An ecommerce user funnel conversion data analysis with matplotlib & python.

data-analysis data-analysis-python data-analyst data-visualization google-colab jupyter-notebook matplotlib python

Last synced: 13 Apr 2026

https://github.com/lebrancconvas/streaming-service-price

Visualize the price from Streaming Service that available in Thailand.

data-visualization price-comparison streaming-video

Last synced: 13 Jun 2025

https://github.com/owengombas/r_place

🖼 Data analysis on r/place 2022

data-science data-visualization rplace

Last synced: 18 Jan 2026

https://github.com/mgobeaalcoba/matplotlib_y_seaborn

Aquí dejaré trabajos de visualización realizados con ambas librerías de Python.

data-analysis data-science data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/guidodipietro/fmcranks

Raw text input to nicely formatted XLSX file + PNG image of the range

data-analysis-python data-visualization python

Last synced: 26 Jan 2026

https://github.com/rayyan9477/coin-detection-project

This Coin Detection Project leverages machine learning techniques to identify coins using a dataset from Kaggle. Key libraries utilized include OpenCV for image processing, TensorFlow for model training, and Pandas for data manipulation. The project also employs NumPy for numerical operations and Matplotlib for visualization.

computer-vision data-analysis data-science data-visualization machine-learning notebook python

Last synced: 27 Feb 2025

https://github.com/hhejo/pyviz

Data visualization using Python

data-visualization python

Last synced: 02 Apr 2025

https://github.com/shiweihe0713/data-science-for-business-techincal

TECH-GB_2336 will teach you how to think about data based problems in the business world through the lens of data analytics. We will focus on data-analytic thinking, how to approach problems, how to develop insights using data, how to apply machine learning and other analytic techniques...

data-science data-visualization nyu pandas python stern

Last synced: 02 Apr 2025

https://github.com/soumya-thoutam/revenue-and-demand-forecasting-analysis

End-to-end data analysis project with SQL and Power BI for revenue and demand forecasting in bike-sharing.

data-analysis data-visualization powerbi sql sql-server

Last synced: 16 Mar 2025

https://github.com/michelmf/projeto_tb

Applying machine learning models to detect tuberculosis during screening process.

boosting-tree classification-model data-mining data-visualization machine-learning python tuberculosis

Last synced: 06 Jul 2025

https://github.com/tejashah88/Visualflow-Unity

Visualizing Vitech insurance data with Unity and the Leap Motion (for HTN 2017).

data-visualization leap-motion unity unity3d virtual-reality vitech vr

Last synced: 11 Mar 2025

https://github.com/garcane/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 01 Mar 2026

https://github.com/vi/csvdimreduce

Command-line tool to run a dimensionality reduction algorithm on CSV files

cli command-line-tool csv csv-files data-science data-visualization dimension-reduction dimensionality-reduction

Last synced: 14 Apr 2025

https://github.com/chauxvive/uschildpoverty

An interactive choropleth map visualizing U.S. state-level child poverty data using D3.js. Compare child poverty rates over time with data from KIDS COUNT and the US Census Bureau.

choropleth-map d3 d3js data-visualization dataviz

Last synced: 01 Mar 2025

https://github.com/2kabhishek/pyestate

Real estate price analysis 🏠💰

analysis data-visualization finance mapbox panel plotly pyviz real-estate

Last synced: 01 Mar 2025

https://github.com/gracysapra/heart-disease-prediction-using-logistic-regression

This project uses Logistic Regression to predict the likelihood of heart disease based on medical attributes such as age, cholesterol levels, and blood pressure. It includes model training, evaluation, and an interactive Gradio interface for real-time heart disease risk prediction.

classification data-preprocessing data-science data-visualization gradio-interface heart-disease-prediction logistic-regression machine-learning

Last synced: 27 Feb 2025

https://github.com/ssreeramj/binod-detector

Scrapes comments of a youtube video and shows distribution of comments having 'binod' in it

data-visualization heroku python streamlit youtube-api

Last synced: 21 Feb 2025

https://github.com/vetrivel07/data-visualization-portfolio

This repository showcases my Data Visualization projects using Power BI and Tableau, along with Python-based exploratory analysis. It includes dashboards, data storytelling, and business insights

dashboard data-cleaning data-schema data-visualization tableau

Last synced: 02 Mar 2026

https://github.com/jubinjacob03/heartdiseaseclassify-ml

Heart Disease Dataset Analysis & Classification using ML models such as linear, support vector machine, k-means, k-nearest neighbors and logistic regression.

data-analysis data-science data-visualization ipython-notebook kaggle-dataset kmeans knn linear-regression logistic-regression machine-learning matplotlib python seaborn support-vector-machine

Last synced: 18 Jan 2026

https://github.com/jubinjacob03/dvttviz_eqdataset-reactjs

Data Visualization Tools & Techniques - ReactJS project with Tableau Public APIv3 on Earthquake Dataset for analysis of seismic waves and activity

api data-visualization javascript kaggle-dataset reactjs tableau

Last synced: 13 Jul 2025

https://github.com/Rayyan9477/Calorie-Burnage-Exploratory-Data-Analysis

Calorie Burnage is the measure of calories burned during physical activity or exercise, crucial for weight management and fitness goals. This project focuses on analyzing a dataset that includes information on duration, pulse rates, and calories burned during exercise sessions.

data-analytics data-science data-visualization exploratory-data-analysis linear-regression r-language r-programming

Last synced: 29 Apr 2025

https://github.com/is-leeroy-jenkins/sherpa

A budget execution & data analysis tool based on Winforms, .NET 6, and written in C# for EPA analysts

budget-management data-analysis data-science data-visualization federal-government

Last synced: 01 Mar 2025

https://github.com/epfromer/x2-angular

Angular app for data visualization and searching of Enron email via x2 server.

angular auth0 data-visualization graphql

Last synced: 13 Apr 2026

https://github.com/emilhvitfeldt/ggtetris

Create Tetris Chart Visualizations in R

data-visualization datavisualization dataviz ggplot2 r rstats

Last synced: 05 Apr 2025

https://github.com/paumatas/dpa-visual-analytics-tool

Visual analysis application for the data collected by a Formula Student car during driving, aiming to evaluate and compare the drivers of the team.

car-telemetry data-driven-decisions data-science data-visualization formula-student motorsport sports-analytics user-interface

Last synced: 17 Mar 2025