An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/chrka/d3-chessboard-count

Plot per-square frequencies on a chessboard

chess d3 data-visualization

Last synced: 30 Apr 2026

https://github.com/abhinav330/instagram-influencers-analysis

This Jupyter Notebook focuses on preprocessing and visualizing data from an Instagram profiles dataset. It includes data loading, inspection, visualization, and some data preprocessing steps.

data data-science data-visualization exploratory-data-analysis exploratory-data-visualizations influncer-products instagram scikit-learn sklearn

Last synced: 08 Jun 2026

https://github.com/tashi-2004/global-ecommerce-retail-trends-analysis

The Global E-commerce & Retail Analysis project involves data preprocessing, dimensionality reduction with PCA, CLV calculation and What-If analysis . Key insights include effective PCA for data reduction, detailed CLV analysis across segments , and the impact of pricing strategies on sales.

boxplot clv-analysis data-science data-visualization dataintegration deep-learning dimensionality-reduction ecommerce heatmap machine-learning normalization outlier-detection outlier-removal pca-analysis preprocessing python scatter-plot whatif-analysis

Last synced: 30 Apr 2026

https://github.com/dina-hosny/import-preprocess-and-visualize-a-dataset-project

A simple project to practice importing a dataset, data cleaning and preparation processes, and visualize the results to answer some given questions.

data-cleaning data-engineering data-science data-visualization jupyter-notebook matplotlib numpy pandas python

Last synced: 30 Apr 2026

https://github.com/mxagar/eda_fe_summary

An 80/20 guide for Data Processing: Data Cleaning, Exploratory Data Analysis, Feature Engineering, Feature Selection.

data-analysis data-cleaning data-modeling data-science data-visualization eda exploratory-data-analysis feature-engineering feature-selection machine-learning pandas

Last synced: 30 Apr 2026

https://github.com/fernandesotero/project-data-exploration

Student Performance Prediction with Data Science

data-visualization jupyter-notebook python

Last synced: 30 Apr 2026

https://github.com/priyam-hub/covid-19-data-analysis

Explore COVID19 case numbers and deaths related to Coronavirus outbreak 2019/2020 in Pandas and in Jupyter notebook

analysis data data-visualization jupyter-notebook machine-learning python

Last synced: 08 Jun 2026

https://github.com/samuelpillai/machine-learning-classification-regression-nlp

A curated collection of machine learning mini-projects covering classification, regression, and natural language processing (NLP). This project demonstrates model training, evaluation, feature engineering, and pipeline integration using real-world datasets and Python tools like Scikit-learn, pandas, and NLTK.

classification data-analysis data-science data-visualization feature-engineering jupyter-notebook machine-learning ml-pipeline model-evaluation nlp python regression-models scikit-learn supervised-learning text-mining

Last synced: 30 Apr 2026

https://github.com/shefreenkaur/comp_430_project

A comprehensive, open-source business intelligence visualization tool designed for algorithmic trading systems. This application transforms complex trading data into intuitive visualizations, enabling traders and analysts to make data-driven decisions.

algorithmic-trading api-development business-intelligence data-analytics data-visualization etl-pipeline fastapi finance financial-analysis interactive-dashboard plotly streamlit

Last synced: 13 Apr 2026

https://github.com/garcane/solana-ml-forecast

This project uses machine learning, specifically an XGBoost regressor, to predict the price of Solana (SOL) based on historical data and engineered features.

cryptocurrency data-visualization machine-learning solana xgboost

Last synced: 10 May 2026

https://github.com/vassilevsky/4sq73

Демонстрация неточности координат заведений Foursquare на примере Ульяновска

data-visualization foursquare-api geo yandex-maps

Last synced: 15 Mar 2025

https://github.com/rayxiang03/indeed-job-scraping

Python toolkit for scraping Indeed job listings, preprocessing data, and generating visualizations for market analysis.

cloudscraper data-visualization indeed job-analysis nlp pandas python web-scraping

Last synced: 30 Apr 2026

https://github.com/ginalamp/covid_dashboard_twitternews

Corona Dashboard & report based on Twitter media outlet news.

dashboard data-analysis data-visualization twitter

Last synced: 28 Jan 2026

https://github.com/mansogf/datascience_introduction

Data Science Introductions Practices

data-analysis data-science data-visualization graph

Last synced: 04 Apr 2025

https://github.com/anilyigitsel/tourist-attraction-data-analysis

This project analyzes tourism trends from 2017 to 2021, focusing on visitor numbers, ratings, and attraction popularity during these years.

data-analysis data-visualization excel sql tourism

Last synced: 26 Jan 2026

https://github.com/srking501/futurelearn_mooc

A summative coursework for CSC8631 Data Management and Exploratory Data Analysis

crisp-dm data-mining data-preprocessing data-science data-visualization deployment eda exploratory-data-analysis

Last synced: 23 Mar 2025

https://github.com/wattyven/opencovidca-dashboard

A Jupyter notebook for making quick and dirty visualizations of Canadian COVID statistics using data from the OpenCOVID API. (API unfortunately deprecated)

canada-covid-19 covid-19 data-visualization jupyter jupyter-notebook statistics

Last synced: 14 Mar 2025

https://github.com/nahiyanhkhan/sales-insight-dashboard_powerbi

Build a dashboard to display the sales insights of a company's sales data over the 4 years period. It includes displaying revenue, sales quantity in different regions over the years.

dashboard data-analysis data-analytics data-visualization powerbi salesdashboard

Last synced: 08 Jan 2026

https://github.com/thenazar9/user-behavior-email-campaign-analysis-sql

Analysis of user behavior and email campaign performance using BigQuery and Looker Studio, focusing on account creation trends, email engagement, and user segmentation.

analytics bigquery data-analysis data-visualization etl looker-studio sql structured-query-language

Last synced: 16 Oct 2025

https://github.com/atharva309/car-sales-dashboard-powerbi

A Advanced and Interactive Dashboard for Car Sales and more using PowerBI

dashboard data-visualization dax-query interactive-visualizations powerbi

Last synced: 11 Jan 2026

https://github.com/vinitgurjar/r_lang_exp

This is a collection of my collage Data Analytics lab work and assignment, the files here contains program of R language

data-analysis data-visualization r

Last synced: 02 Jul 2025

https://github.com/shahbajtahershezan/heart-disease-prediction-ann

Heart Disease Prediction using ANN and ML models. This project performs detailed EDA on heart disease data and trains ANN, Random Forest, Decision Tree, and SVM models to predict heart disease likelihood using features like age, cholesterol, and blood pressure, with visualizations and performance metrics for evaluation.

classification confusion-matrix data-visualization deep-learning-algorithms f1-score heart-disease-prediction keras machine-learning-projects neural-network precision recall tensorflow

Last synced: 14 Apr 2026

https://github.com/wai-l/dataviz

Data visualisation with Excel, R and other tools.

data-visualisation data-visualization data-viz datavisualisation datavisualization excel ggplot2

Last synced: 05 Jan 2026

https://github.com/rgzafra11/excel_sales_analytics

# Excel_Sales_Analytics📊 This repository contains a comprehensive business intelligence report for AtliQ Hardware, focusing on sales performance and strategic insights. 🚀 Explore data-driven analytics to enhance product offerings and optimize sales strategies for improved profitability.

businessanalytics businessinsights data-visualization dataquality eda excel excel-dashboard jupyter-notebook kpi pivot-table powerquery recommandation sales-insights seaborn

Last synced: 01 Jul 2025

https://github.com/adilshamim8/eda-on-health-and-sleep-data

Exploratory Data Analysis (EDA) on health and sleep data, uncovering patterns and insights using Python and visualization tools.

data-analysis data-visualization eda health healthcare sleep sleep-analysis

Last synced: 15 Mar 2025

https://github.com/darksoulnelson/json-to-excel-converter

This repository provides a tool to convert JSON data to Excel format (.xlsx). It allows you to easily transform structured JSON data into a well-organized spreadsheet for better analysis and visualization.

automation-script automation-tools data-analysis data-converter data-export data-formatting data-tools data-visualization excel excel-automation excel-converter excel-tools json json-exporter json-parser json-processing json-to-csv json-to-excel programming-tools spreadsheet-tools

Last synced: 05 Jul 2025

https://github.com/rios0rios0/investmate

Go-based application designed to scrape and analyze ETF (Exchange-Traded Fund) data, focusing on dividend cash amounts, average closing prices, and dividend yields over a specified number of years. The application uses the colly library for web scraping and the tablewriter library for displaying the data in a formatted table.

crawling data-visualization etf-investments financial-analysis golang

Last synced: 23 May 2026

https://github.com/neerajcodes888/data-science

This repository is a hub for data science enthusiasts, offering a diverse collection of projects, notebooks, and resources covering topics such as data analysis, machine learning, deep learning, and generative AI. Explore innovative ideas, contribute to cutting-edge research, and enhance your skills in the dynamic field of data science

data-analysis data-science data-visualization deep-learning deep-learning-algorithms eda genai jupyter-notebook machine-learning machine-learning-algorithms openai-api pandas plotting python3 sklearn-library streamlit

Last synced: 01 May 2026

https://github.com/darkdk123/simple-heart-disease-classification

This Experiment provides a comprehensive approach to forecast heart disease risks by performing a detailed data analysis, predictive modeling & hyperparameter tuning. This leads to a `LinearSVC` model with 90% Accuracy

classification-algorithm data-science data-visualization exploratory-data-analysis heart-disease-prediction machine-learning

Last synced: 17 Nov 2025

https://github.com/rajatdiptabiswas/iris-flower-dataset

:cherry_blossom: Trying out data visualization and data science on the iris flower dataset

data-science data-visualization iris iris-dataset

Last synced: 15 Mar 2025

https://github.com/chrispsang/customerchurnanalysis

Predicting customer churn using a RandomForestClassifier with detailed EDA, model evaluation, and visualization. Includes a Tableau dashboard for interactive insights.

customerchurn data-analysis data-visualization datapreprocessing machine-learning python scikit-learn tableau

Last synced: 31 Jan 2026

https://github.com/ezrahsieh/academicdatabasedashboard

The Academic Faculty and Research Insight Dashboard utilizes SQL and NoSQL databases and is designed to support academic institutions, research departments, and individual students by providing comprehensive insights into faculty members and their research activities.

dashboard data-visualization database-management mongodb mysql neo4j sql

Last synced: 21 Feb 2026

https://github.com/adriangalvanzamora/ecommerce-analytics-olist

Data analysis project based on the Olist Brazilian E-Commerce dataset. Includes data cleaning, exploratory analysis, delivery performance metrics, customer satisfaction modeling, and geospatial insights. Built entirely in Python (Jupyter Notebook) using real-world data from Kaggle.

brazil customer-satisfaction data-analysis data-visualization ecommerce folium geospatial-analysis machine-learning matplotlib notebook pandas plotly python seaborn

Last synced: 06 May 2026

https://github.com/katiesaund/tidy_tuesday

A weekly data project in R from the R4DS online learning community

data-analysis data-visualization datascience plot r rstats tidytuesday

Last synced: 24 Mar 2025

https://github.com/katiesaund/jpm_healthcare_2019

Analysis of healthcare stock changes over JPM week versus rest of the year.

biotech data-science data-visualization finance jpmorgan r rstats stock

Last synced: 24 Mar 2025

https://github.com/gthomas08/data-visualization-project

Project to download and store files from a site and extract the necessary diagrams using python libraries.

data-visualization python

Last synced: 14 Mar 2025

https://github.com/fatihilhan42/turkey_earthquake_analysis_1915-2021_python

In this project, earthquakes in Turkey from 1915 to 2021 were analyzed. The data taken from the data set, which you can find in the repo, was first organized using data cleaning algorithms. Afterwards, these cleaned data were printed out as graphics and animation using data visualization algorithms.

data-analysis data-cleaning data-visualization jupyter-notebook

Last synced: 23 Mar 2025

https://github.com/eea/eea.reveal

Reveal hidden knowledge by visualizing network structure in your data.

data-analysis data-visualization graphviz network-visualization

Last synced: 18 Mar 2025

https://github.com/mmartin46/county-health-findings-project

Analyze the data set given by United Health Group(UHG) to determine the impact on race, social and demographic factors on health, survival, and mortality.

analysis data-science data-visualization linear-regression machine-learning pandas

Last synced: 30 Apr 2026

https://github.com/victorlcastro-dsa/coping_struggles_prediction

Repositório para prever dificuldades de enfrentamento com base em dados de saúde mental. Inclui análise, visualização e modelagem usando aprendizado de máquina. Resultados alcançam 86.58% de acurácia com um Voting Classifier.

classification-algorithm data-analysis data-science data-visualization machine-learning-algorithms problem-solving project-based-learning python

Last synced: 19 Apr 2025

https://github.com/mzprog/datatables

create a better tables with few lines of code.

data-visualization datatables laravel-package livewire

Last synced: 27 Apr 2026

https://github.com/rahult18/atmo-flow

AtmoFlow is a robust data engineering pipeline built on Google Cloud Platform (GCP) that processes and analyzes weather and air quality data in both batch and streaming modes

airflow data data-modeling data-science data-visualization dataengineering gcp-bigquery gcp-cloud-composer gcp-cloud-functions pyspark

Last synced: 23 Jun 2026

https://github.com/82luli02/sakila_dvd_rental_database_analysis

Analysis of the Sakila DVD Rental database using SQL

data data-analysis data-science data-visualization sql

Last synced: 10 Mar 2026

https://github.com/relostar-devil/analyzing-naming-trends-using-python

Analyzes naming trends by processing baby names data from the Social Security Administration (SSA)

data-visualization python

Last synced: 30 Apr 2026

https://github.com/binjewarkunal/top-10-worlds-largest-economy-analysis

Top 10 Largest Economies by GDP and GDP Per Capita. Data Collection: Forbes India, Data Visualization: Created a Horizontal Bar Chart and Documention.

data-visualization datacollection microsoft-excel notion

Last synced: 27 Mar 2026

https://github.com/cube-m4st3r/sphere.discord-bot

The main discord frontend project for sphere, designated for general purpose commands and visualization.

data-visualization discord-bot discord-py discord-py-bot python3

Last synced: 16 Jan 2026

https://github.com/37743/ml-starterkit

This project is designed to streamline the data science workflow by allowing users to input a file path and receive a comprehensive analysis tailored to their needs. The tool automates key stages in the data science pipeline, including Exploratory Data Analysis (EDA), data preprocessing, and model training.

data-preprocessing data-visualization exploratory-data-analysis machine-learning python

Last synced: 07 Apr 2025

https://github.com/abhiram-kandiyana/us-bikeshare-analysis

Explorative analsis on a bike-share system (Motivate) to understand it's pain points

data-analysis data-visualization

Last synced: 26 Mar 2025

https://github.com/abhipatel35/diabetes_ml_classification

Predict diabetes using machine learning models. Experiment with logistic regression, decision trees, and random forests to achieve accurate predictions based on health indicators. Complete lifecycle of ML project included.

classification data-analysis data-science data-visualization descision-tree diabetes-prediction jupiter-notebook logistic-regression machine-learning model-evaluation open-source pandas pycharm-ide python random-forest scikit-learn

Last synced: 20 Jan 2026

https://github.com/danielrosehill/data-projects-index

Data apps and datasets deployed to Streamlit Community Cloud, Hugging Face, and elsewhere.

data-analysis data-science data-visualization

Last synced: 16 Mar 2026

https://github.com/ledsouza/curso_de_estatistica_parte_3

Projeto de curso de estatística sobre distribuições e teste de hipósteses

data-science data-visualization pandas scipy seaborn statsmodels vitrinedev

Last synced: 29 Apr 2026

https://github.com/the-clone-xyz/stats-lapas-pakam

Visualisasi data narapidana berdasarkan jenis kelamin di Lapas Lubuk Pakam menggunakan data BPS Deli Serdang secara otomatis via GitHub Actions.

bps-api data-visualization github-actions lubuk-pakam statistics

Last synced: 30 Apr 2026

https://github.com/sumanadithan/react-admin-dashboard

A modern React Admin Dashboard built with React, TypeScript, Vite, Tailwind CSS, Framer Motion, TanStack Table, Zustand, and Recharts. It features fast performance, responsive design, dynamic data handling, and testing with Vitest.

admin-dashboard data-visualization framer-motion frontend react recharts responsive-design state-management tailwind-css tanstack-table testing typescript vite vitest web-development zustand

Last synced: 12 Apr 2026

https://github.com/riju18/data-analysis-and-visualizaton

Most complex data analyzing for clustering, preparing, complex calculation, joining, cross-over & more for Data science.

data-analysis data-mining data-science data-visualization powerbi tableau

Last synced: 04 Jan 2026

https://github.com/ddeepanshu-997/datascience-e-commerce-shopping-details-

in this project i am going to apply data preprocessing technique on the dataset in order to clean the data using libraries, etc. make some insights/analyses to findout the hotpicks of the shopping along with some data visualsation libraries to get the trends and many more aspects in order to make a small contribution to the field of data science

cleaning-data data data-science data-visualization dataframe datapreprocessing dataset libraries matplotlib-pyplot numpy pandas plots python visualization

Last synced: 30 Apr 2026

https://github.com/lytello/data-visualizations

Assortment of data visualizations I have created

data-visualization r

Last synced: 28 May 2026

https://github.com/abdul-aa/drug-sentiment-analysis

Extracting Themes and Sentiments in Birth Control Drug Reviews with DistilBert and LDA Topic Modeling

data-visualization distilbert lda-topic-modeling natural-language-processing sentiment-analysis tableau

Last synced: 22 Apr 2026

https://github.com/hemangsharma/dataanalysis

This repo contains analysis like a dashboard and time series forecast on NASDAQ data

analysis data data-analysis data-visualization python

Last synced: 10 Mar 2026

https://github.com/filip-kustura/statistics-olympics-analysis

A group seminar analyzing the relationship between citizens' average height and a country's Olympic success. The project involved data collection, descriptive statistics and statistical testing. Created and presented as part of the mandatory undergraduate Statistics course in spring 2021.

correlation-analysis data-analysis data-visualization descriptive-statistics group-project hypothesis-testing olympic-games r-programming research sports-analytics statistical-testing statistics university-project

Last synced: 05 Jan 2026

https://github.com/gitchaell/computer-scrapping

Tool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.

data-analysis data-structures data-visualization database dbdiagram export-excel powerbi scrapper-script scrapping xlsx

Last synced: 01 May 2026

https://github.com/trim0500/fe-stats-classifier

An experiment to create a machine learning model via PyTorch to classify select Fire Emblem unit base stat distributions.

creational-patterns data-analysis data-science data-visualization design-patterns excel jupyter jupyter-notebook matplotlib-pyplot numpy pandas python python-modules python3 pytorch singleton

Last synced: 11 Apr 2026

https://github.com/fazatholomew/marlboroplan

In order to contribute to a more inclusive sustainable energy program in Massachusetts, this project is part of my work for a nonprofit organization called All In Energy and undergraduate thesis for my degree.

data-analysis data-visualization energy jupyter-notebook massachusetts python

Last synced: 01 May 2026

https://github.com/jigyasag18/aircraft-data-management

This repository offers a comprehensive simulation of global military air deployments involving 10 countries, aircraft models, mission types, and strategic zones. It analyzes air power distribution, mission intent (offensive, defensive, support), and geopolitical positioning. The project provides structured insights into regional & zone level threat

aircraft-data aircraft-performance data data-analysis data-visualization database database-management dataset datavisualisation mysql powerbi powerbi-report powerbi-visuals sql

Last synced: 04 Feb 2026

https://github.com/fbarffmann/tornado-damage-dashboard

Built a Flask dashboard visualizing 1,000+ US tornadoes from 2023 using Leaflet.js and MongoDB. Interactive maps show tornado magnitude, damage, and frequency.

api data-visualization flask geospatial leaflet mongodb pandas python tornado-dashboard

Last synced: 11 Apr 2026

https://github.com/balajimohan18/loan-classification-datascience-project

This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.

classification data-analysis data-cleaning data-science data-visualization loan-prediction loan-status machine-learning sql supervised-learning

Last synced: 03 Sep 2025

https://github.com/hfzdzakii/dicoding-solvinghrproblem

This repo is a master submission for my Dicoding Final Project. Employee Attrition & Performance Dataset was being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!

data-analysis data-visualization

Last synced: 16 May 2025

https://github.com/jimohola/streamlit_ml

How to build a Web App using Streamlit for Machine Learning Algorithms

data-visualization exploratory-data-analysis machine-learning streamlit webapp

Last synced: 09 May 2026

https://github.com/shafaq-aslam/data-analytics-dairy

A comprehensive repository for Data Analytics learning and projects. It includes MySQL, Python, Power BI, Tableau, and Excel. The goal is to analyze data, generate insights, and create compelling visualizations for real-world datasets.

data-analysis data-visualization excel excel-based-data-analysis powerbi python-scripts sql sql-queries sql-queries-for-data-manipulation sql-query-for-data-visualization tableau

Last synced: 20 Jan 2026

https://github.com/misaghmomenib/shop-revenue-analysis

A Data Analysis Project Aimed at Analyzing and Forecasting Shop Revenue Based on Sales and Other Business Metrics. It Helps to Identify Trends, Patterns, and Key Factors Influencing Revenue to Make Data-driven Decisions for Business Growth.

data-analysis data-visualization python

Last synced: 24 Mar 2025

https://github.com/corndogit/dataspaceart

A generative art project which generates stylized patterns from weather data

data-visualization python weather

Last synced: 06 Oct 2025

https://github.com/noturlee/iris-dataanalyis

This project aims to classify Iris flowers into three species—setosa, versicolor, and virginica—based on their sepal and petal measurements using machine learning techniques. The dataset comprises 150 samples evenly distributed among these species

data-analysis data-modeling data-science data-structures-and-algorithms data-visualization

Last synced: 08 Apr 2025

https://github.com/noturlee/sales-dataanalysis

This project aims to predict product sales based on advertising expenditures, focusing on 'TV advertising'. Machine learning techniques are employed to analyze and interpret data, enabling businesses to optimize advertising strategies and maximize sales potential.

data-modeling data-science data-structures-and-algorithms data-visualization linear-regression

Last synced: 08 Apr 2025

https://github.com/codyguru/energy-monitoring-dashboard

Energy Monitoring Dashboard to help the client to find the issued sensor quickly

data-visualization mock-server react tailwindcss typescript

Last synced: 10 Apr 2026

https://github.com/dheyhasan/echo-trends

EchoTrends is a data visualization app that analyzes your Spotify playlists and reveals insightful patterns—such as track duration, popularity, and statistical correlations—using interactive charts and statistical tests. Built with React (frontend) and FastAPI (backend), it offers both functional analysis and a demo landing

correlation-analysis data-visualization fastapi javascript music-analysis python react recharts spotify-api tailwindcss

Last synced: 11 Apr 2026

https://github.com/pythoncoderunicorn/gi-joe

Dataset for GI Joe action figures from 1980s & 1990s. My dataset for TidyTuesday

data-science data-visualization gi-joe r toy-project

Last synced: 27 May 2026

https://github.com/ledsouza/dataviz_vendas_lojas

Projeto de visualização de dados para realizar comparações e distribuições

data-science data-visualization matplotlib pandas pyplot python seaborn vitrinedev

Last synced: 11 Apr 2026

https://github.com/stopyransky/wdvp

World Government Data Visualisation Prize - submitted work

d3 d3js data-visualization dataviz react svg

Last synced: 18 May 2026