An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/fatihilhan42/cyclistic_bike_share_data_analysis

This repo contains the Google Data Analytics Capstone - Case Study 1 project, which is the final stage of the Google Data Analytics course on coursera. The description of the code and analysis is posted on my Kaggle account. I hope this repo will help everyone who wants to do this project. thanks.

bike-share capstone-project cyclistic data-science data-visualization google rprogramming

Last synced: 25 Jun 2025

https://github.com/karthikarajagopal44/data-analysis-using-python-libraries-

The COVID-19 pandemic has significantly impacted India, necessitating a detailed analysis of the virus’s spread within the country. In this project, we explore an India-specific COVID-19 dataset, leveraging Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.

data-cleaning data-visualization matplotlib numpy pandas python python3 scikit-learn seaborn

Last synced: 07 Apr 2026

https://github.com/autumnchris/treemap-sales-diagram

A D3.js treemap built in React.js that presents the top 100 sold video games grouped by their associated gaming platform.

babel css3 d3 d3-js d3js data-visualization freecodecamp javascript react reactjs sass scss treemap treemap-diagram treemap-diagram-challenge treemap-sales-diagram webpack

Last synced: 07 Apr 2026

https://github.com/drkenreid/steam-stats-visualized

🎮 Spotify Wrapped, but for your Steam library. Paste your profile, get roasted.

data-science data-visualization gaming plotly portfolio python steam steam-api streamlit

Last synced: 16 May 2026

https://github.com/julianjuko/subset-prompter

Reduce large datasets down to unique subsets - quickly.

data-structures data-visualization

Last synced: 11 Apr 2025

https://github.com/abhishekyadav915/data-analytics-projects

This project focuses on performing comprehensive data analysis to extract valuable insights from a given dataset. By leveraging various data manipulation, cleaning, and visualization techniques, the project aims to uncover patterns, trends, and correlations that can inform decision-making and strategy.

data-analysis data-visualization dataset

Last synced: 05 Apr 2025

https://github.com/sitek94/react-d3-bar-chart

A bar chart made with React and D3.

d3 data-visualization react

Last synced: 10 May 2026

https://github.com/ljadhav25/django-data-analyzer

Django Data Analyzer is a web application built using the Django framework, designed to streamline data analysis tasks. Users can upload CSV files containing data for analysis. The application utilizes the powerful data manipulation capabilities of Python libraries like pandas and numpy to perform various analyses on the uploaded data.

data-analysis data-visualization django-application matplotlib numpy pandas python seaborn

Last synced: 01 Mar 2026

https://github.com/lucycatherine/healthinsuranceproject

This repository contains a machine learning project that analyzes the factors influencing health insurance charges, such as age, smoking status, and medical conditions.

data-analysis data-science data-visualization jupyter-notebook machine-learning python

Last synced: 18 May 2026

https://github.com/stam/corona-map

Corona measures visualized per country

corona data-visualization ecdc

Last synced: 11 Apr 2025

https://github.com/htsandaruvan/attrition-analytics-suite-by-hello-green

I have created a comprehensive data analytics dashboard to identify factors contributing to attrition,

data-analysis data-analytics data-visualization powerbi

Last synced: 20 Jan 2026

https://github.com/sreyashidey/scrape-analyze-visualize

A project for web scraping, data analysis, and visualization using Selenium, BeautifulSoup, and Python.

bs4 data-visualization selenium

Last synced: 03 May 2026

https://github.com/gui-sitton/y.music

In this project I compared the musical preferences of the citizens of Springfild and Shelbyville. I examined real Y.Music data to test hypotheses and compare the behavior of users in these two cities.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 May 2026

https://github.com/eins51/moiveanalytics

Comprehensive analysis of movie data using Python and Tableau. Features interactive dashboards and insights into content distribution, genre trends, and platform preferences in the streaming industry.

data-visualization movie-analysis python streaming-platforms tableau-dashboard

Last synced: 16 May 2026

https://github.com/niniola-creator/niniola-creator

This is a repository that I have created to show my skills, share my projects and track my progress in my data science/web development journey.

bootstrap5 css3 data-analysis data-science data-visualization database html5 javascipt javascript matplotlib pandas powerbi python spreadsheets sql

Last synced: 07 Apr 2026

https://github.com/lucashomuniz/project-15

[Dashboard] Enhancing Business Intelligence: Leveraging SQL, Python, and DAX for Strategic Insights in Sales Analysis

business-analytics business-intelligence data-analysis data-science data-visualization dax-languague machine-learning powerbi python

Last synced: 12 Jul 2025

https://github.com/madhuresh2011/kulturehire-internship

☺️Hi folk, During my internship at KultureHire, I completed a real-world Data Analyst project. I created an interactive dashboard using pivot tables, conducted a thorough analysis, and provided actionable recommendations. I'm excited to share my work and the insights I discovered.

data data-analytics data-cleaning data-standardization data-visualization excel excel-pivot-charts excel-pivot-tables genz-aspirations my-sql

Last synced: 17 Feb 2026

https://github.com/gfav-cybergeek/prodigy_ml_01

A linear regression model to predict house prices based on square footage, number of bedrooms, and bathrooms. Includes feature engineering, preprocessing, and model evaluation.

ai airtificialintelligence algorithms algorithms-and-data-structures data-structures data-visualization jupyter jupyter-notebook jupyterlab machine-learning machine-learning-algorithms machine-learning-models python

Last synced: 05 Apr 2025

https://github.com/cyprianfusi/world-happiness-report-for-2015-2019

World Happiness Report for 2019 with strange and unexpected results for Sub-Sahara African Countries! But it's data speaking...

data-visualization pandas-python

Last synced: 21 Mar 2025

https://github.com/namratagulati/fraud_detection

This fulfills all the requirements of a fraud detection model developed on linear regression using feature scaling, engineering and testing model with the help of auc-roc curve and others.

data-analysis data-visualization machine-learning machine-learning-algorithms machinelearning-python

Last synced: 04 Jun 2026

https://github.com/piras-s/tuningcurvesnestedbayesianinference

Bayesian inference of neural tuning curves using nested sampling (PyMultiNest), with theory, simulation, and diagnostic visualizations.

bayesian-inference data-visualization machine-learning model-evaluation nested-sampling neuroscience pymultinest python3 simulation

Last synced: 18 May 2026

https://github.com/cyprianfusi/uk-covid-19-data-via-opendata-api

With recommendation to the UK government to halt all mandatory testing! Tests should only be conducted on patients as part of diagnosis and treatment. This is because with low prevalence of the disease most positive test results are false positives. This is due to irreducible error in the test.

api covid-19 data-visualization pandas-python uk

Last synced: 21 Mar 2025

https://github.com/gohsato/playlistobservatory

Spotify Playlist Visualizer. Demo: https://gohsato.github.io/PlaylistObservatory/

data-visualization spotify spotify-api

Last synced: 04 Apr 2025

https://github.com/cyprianfusi/new-york-city-public-schools-and-sat-scores

One of the most controversial issues in the U.S. educational system is the efficacy of standardized tests and whether they're unfair to certain groups. We could correlate SAT scores with factors like race, gender, income, and more.

data-analysis-python data-cleaning data-visualization data-wrangling

Last synced: 21 Mar 2025

https://github.com/jarif87/pokeinsights

A Selenium-Powered Data Scraping and Tableau Visualization Project

data-visualization python scraping selenium tableau

Last synced: 21 May 2026

https://github.com/emilyjspencer/coronavirus-daily-deaths-chart.js

Visualizing the Uk's daily deaths from Coronavirus between March and June 2020

chartjs data-visualization

Last synced: 04 Apr 2025

https://github.com/alfioma/ada-xtq

🔗 Simplify data transfer with ada-xtq, a lightweight tool for seamless integration and efficient handling of data between platforms.

ada algorithms api-development artificial-intelligence automation data-analysis data-visualization docker machine-learning neural-networks open-source programming python software-development xtq

Last synced: 01 May 2026

https://github.com/cosmoduende/r-ggcats

StrangeR things: Adding… Cats? to your plots on R. How to analyze and visualize data with the help of funny cats with the ”ggcat” package.

data-analysis data-analytics data-science data-visualisation data-visualization data-viz dataviz ggcats r-language r-library r-package r-programming r-scripts r-studio rstats rstudio

Last synced: 22 Jul 2025

https://github.com/radhikareddy-chintareddy/big-data-analysis-ny-weather-air-quality-2022

End-to-end workflow showcasing database setup, API development, and interactive data retrieval of large datasets. Includes integration and analysis of 2022 SURFACE HOURLY weather data (global, US, and NY) merged with NY air pollution data from the EPA to uncover actionable insights.

big-data-analytics data-integration data-visualization flask-restful jupyter-notebook pymysql python

Last synced: 18 May 2026

https://github.com/vit0r/trino-datavirtualization

POC trino - some catalogs, mariadb,postgresql,mongodb and minio

data-visualization

Last synced: 07 Mar 2026

https://github.com/prakhar-code/netflix_dashboard

Analysis of the netflix titles, based on there upload history, distribution, ratings, etc. The Dashboard contains Filters, to give specific Information on a movie/TV Show,

data-cleaning data-visualization excel tableau-dashboards tableau-public tableau-visualization

Last synced: 17 Feb 2026

https://github.com/m-dadej/excess_deaths_poland

Estimation of excess deaths during COVID-19 pandemic in Poland

covid-19 data-science data-visualization rstats time-series

Last synced: 14 May 2026

https://github.com/mfakhriazhar/housing-price-analysis

Determining the price of a house also depends on various factors such as building area, exterior quality, and amenities. This dataset provides information on properties for sale, and through Exploratory Data Analysis (EDA), patterns and key factors affecting house prices can be identified.

data-analysis data-science data-visualization eda exploratory-data-analysis python

Last synced: 16 May 2026

https://github.com/kylemit/livedataisbeautiful

A casual attempt at data visualizations

data-visualization highcharts

Last synced: 20 May 2026

https://github.com/andersoncrs/analisis-de-texto-tweets

En este proyecto exploro el análisis de texto de tweets para descubrir tendencias, opiniones y temas relevantes en redes sociales. Usando herramientas de procesamiento de lenguaje natural, convierto grandes volúmenes de mensajes en información clara y visualmente atractiva.

data-analysis data-visualization eda text-mining

Last synced: 21 Jul 2025

https://github.com/nadamarei/data-analyzer

The Qualitative Data Analysis Tool is a powerful Streamlit application designed for researchers to analyze word frequencies in corporate documents. This tool processes PDF reports, identifies target words and their contextually relevant synonyms, and generates comprehensive reports with document statistics, summary analysis, and per-file breakdowns

data-analysis data-visualization python-3 streamlit

Last synced: 18 May 2026

https://github.com/ddihora1604/iitk_task

A comprehensive financial data analysis system that collects, processes, and analyzes data from approximately 500 tickers in the S&P Global Index. It provides detailed financial information, ESG metrics, and various financial statements for comprehensive market analysis.

beautifulsoup4 data-analysis data-visualization datamodelling dataset esg machine-learning python yahoo-finance

Last synced: 30 Jun 2026

https://github.com/nadahamdy217/skincaresentinel

This project analyzes customer feedback for skincare products by predicting sentiment using an unsupervised model. It includes a web application for real-time sentiment analysis, an ETL pipeline built with Azure Data Factory, Azure Databricks, and Azure Synapse Analytics, and a Power BI dashboard for visualizing review trends.

azure customer-feedback data-engineering data-science data-visualization database databricks etl-pipeline flask machine-learning powerbi python sentiment-analysis synapse-analytics unsupervised-learning web-application

Last synced: 07 Apr 2026

https://github.com/j5py/py4e

Python for Everybody Specialization (from University of Michigan on Coursera).

api data-visualization json python sql sqlite xml

Last synced: 05 May 2026

https://github.com/pronzzz/diabetes-prediction

Diabetes prediction using a KNN model and Pima Indian Diabetes Dataset

data-analysis data-manipulation data-preprocessing data-visualization knn machine-learning outlier-detection seaborn

Last synced: 13 Apr 2025

https://github.com/usman619/data-science-project

COVID-19 detection using Supervised Learning and Data Science techniques.

data-science data-visualization elt machine-learning

Last synced: 11 Jun 2026

https://github.com/katarinatmb/serbia-protest-analysis

This project analyzes the frequency, regional distribution, and group characteristics of protests that emerged across Serbia following the fatal collapse of the Novi Sad train station roof in November 2024. The analysis explores how different communities responded in the aftermath of the disaster, using data visualization in RStudio

data-analysis data-visualization r r-mark rstudio

Last synced: 10 Jul 2025

https://github.com/kishorereddypudi/pizza-sales-data-analysis

Pizza Sales Analysis project demonstrates proficiency in using SQL and Power BI to analyze and visualize data effectively.

dashboard data-visualization dataanalysis database powerbi salesanalysis sql

Last synced: 28 Mar 2025

https://github.com/ireneflorez/exploration_r

Data exploration on the 'White Wine Quality' dataset using R

data-analysis data-visualization r

Last synced: 16 Jun 2026

https://github.com/annaanastasy/regression-project-flood-prediction

This project uses machine learning regression models to predict flood risks based on environmental and historical data, employing techniques such as linear regression, polynomial regression, SGDRegressor, and XGBoost for accurate flood prediction.

data-preprocessing data-science data-visualization feature-engineering machine-learning-algorithms regression xgboost-regression

Last synced: 05 Apr 2025

https://github.com/annaanastasy/clustering-fish-species

A comprehensive project demonstrating the use of various clustering techniques to analyze and group fish data effectively.

clustering-algorithm data-science data-visualization machine-learning-algorithms unsupervised-clustering unsupervised-machine-learning

Last synced: 05 Apr 2025

https://github.com/sanjana-bongale/cancer_survival_data_analysis_and_prediction_using_logistic_regression

This project performs data analysis using Python to predict cancer patient survival outcomes. It involves data cleaning, exploratory analysis, and visualizations to explore factors like cancer type, stage, and treatments. A logistic regression model is built to predict patient survival based on demographic and medical data.

data-analysis data-cleaning data-science data-visualization eda jupyter-notebook kaggle logistic-regression machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/omar7001-b/listenify-cost-metrics-and-estimation

A comprehensive cost estimation and metrics analysis tool for the Listenify speech transcription application. Features function point analysis, complexity calculations, timeline projections, and automated report generation with visualizations.

cost-estimation data-visualization function-point-analysis metrics nodejs pdf-generation project-management project-metrics project-planning python software-development software-estimation

Last synced: 30 Apr 2025

https://github.com/analyticalnahid/matplotlib-tutorial

A complete Notebook on Matplotlib for Data Science

data-visualization matplotlib matplotlib-python matplotlib-tutorial

Last synced: 28 Mar 2025

https://github.com/another-guy/use-d3

React hooks for D3.js data visualization library.

d3 d3js d3js-hook d3js-hooks data-visualization data-viz react react-hook react-hooks reactjs

Last synced: 16 Jan 2026

https://github.com/shubhammittal-data/sales-customer_dashboard_tableau

An interactive Tableau project showcasing advanced data visualization techniques for sales performance and customer analytics. This dashboard provides key business insights using KPIs, trend analysis, and customer segmentation. Designed for executives, sales managers, and marketing teams to drive data-driven decision-making.

customer-behavior-analysis customer-segmentation data-analysis data-visualization product-analytics sales-analysis tableau tableau-dashboards tableau-public

Last synced: 07 Mar 2026

https://github.com/vshelke/databot

:robot: A databot made for jaano india iniative.

data-visualization flask flask-application portfolio python search-engine

Last synced: 03 Apr 2025

https://github.com/jlee9503/defense-risk-prediction

Build a machine learning pipeline that ingests defense procurement data, identifies high-risk contracts, and visualizes the results in an interactive dashboard.

data-analysis data-visualization exploratory-data-analysis python

Last synced: 25 Jan 2026

https://github.com/hassanislam463/british-airways-data-science

Analyze Skytrax reviews to uncover customer sentiments and key themes while predicting booking behavior using machine learning. This repository includes data collection, analysis, and modeling scripts alongside concise, visualized insights to improve customer experience and operational efficiency.

data-analysis data-science data-visualization

Last synced: 28 Mar 2025

https://github.com/hassanislam463/sentiment_analysis_of_financial_news_headlines_and_affect_on_stock_price_prediction

This project analyzes financial news sentiment using a fine-tuned RoBERTa model and integrates it with stock data to predict price movements using LSTM and GRU. It highlights the role of sentiment in enhancing stock market forecasting.

data-analysis data-science data-visualization deep-learning lstm-neural-networks nlp-machine-learning

Last synced: 28 Mar 2025

https://github.com/chahelgupta/fitness-data-analysis-r-project

This project focuses on analyzing fitness data collected from various tracking devices to gain insights into users' activity levels, sleep patterns, calorie expenditure, and heart rate. The dataset used in this project consists of multiple CSV files, each containing different aspects of fitness-related data.

data-analysis data-cleaning data-exploration data-science data-visualization r r-language r-programming r-studio

Last synced: 18 May 2026

https://github.com/zmyzheng/browserassistant

Big Data & Cloud Computing project for recommendation, cluster analysis, data visualization with Hadoop and Spark deployed in auto- scaling cloud environment, youtube link:

angular big-data-analytics cloud cluster-analysis data-visualization elasticsearch flask hadoop recommendation-system spark spring-boot

Last synced: 14 Apr 2026

https://github.com/phette23/treemap-koha-facets

d3.js treemap example with library catalog facet usage

d3js data-visualization treemap

Last synced: 21 May 2026

https://github.com/takk8is/datasetanalysiseda

A robust Python tool for comprehensive dataset analysis and machine learning model evaluation. This project automates the process of data preprocessing, exploratory data analysis (EDA), and predictive modeling, with a focus on handling common data inconsistencies.

analytics analyzer chart csv-files data-science data-visualization datascience dataset datasets davidccavalcante eda fjallstoppur graphics machine-learning python python3 takk-ag takk-design takk8is xlsx-files

Last synced: 02 Sep 2025

https://github.com/satyacoder29/smartfinance-dynamic-financial-dashboard

SmartFinance: Dynamic Financial Dashboard is an interactive tool designed to visualize key financial metrics like revenue, expenses, and profit. It features real-time data updates, charts, slicers, and navigation for easy analysis. This dashboard helps businesses make data-driven decisions and optimize financial performance.

data-analysis data-cleaning data-modeling data-visualization powerbi powerbi-desktop powerbi-visuals powerquerym

Last synced: 13 Feb 2026

https://github.com/singhdivyank/visualization

Wrangling NYPD data and visualising using graphs and maps in Python, Tableau, and R

data-visualization data-wrangling geopandas ggplot2 plotly pygwalker

Last synced: 13 Jun 2026

https://github.com/shreedata/data-analysis-using-python-libraries-

The COVID-19 pandemic has significantly impacted India, necessitating a detailed analysis of the virus’s spread within the country. In this project, we explore an India-specific COVID-19 dataset, leveraging Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.

covid-19 data data-cleaning data-visualization datana kaggle-dataset matplotlib numpy pandas-python python3 pythonlibrarires scikit seaborn

Last synced: 28 Mar 2025

https://github.com/majajuri/vizualizacija-podataka

Labosi iz predmeta Vizualizacija podataka (FER)

d3js data-visualization jupyter-notebook tableau

Last synced: 05 Apr 2025

https://github.com/nick-peter-marcus/detect-fake-job-postings

Detecting Fake Job Postings - Data Visualization, TF-IDF, XGBoost, SVC

cross-validation data-visualization machine-learning svc tf-idf xgboost

Last synced: 09 Jul 2025

https://github.com/aryasoni98/github-repo-analysis

An analysis of the most popular repos on Github.

analytics data-visualization github

Last synced: 30 Jun 2026

https://github.com/smahala02/materials-science-data-analysis

Analysis of diffraction and spectrum data in materials science using Python for data visualization and interpretation.

data-visualization diffraction-analysis materials-science python spectrum-analysis

Last synced: 18 May 2026

https://github.com/nick-peter-marcus/chocolate-bar-analysis

Analyzing Chocolate Bar Features and Ratings - Data Visualization, Decision Trees, Random Forest

data-analysis data-visualization decision-trees python random-forest seaborn sklearn

Last synced: 10 May 2026

https://github.com/jigyasag18/data-analysis-using-ms-excel

This project is on analyzing real-time data from Ambuvians Healthcare, a health products startup. It included data cleaning, such as removing duplicates and addressing missing values, followed by analyses to reveal insights into sales trends, customer demographics, and purchasing behaviors. Visualizations in MS-Excel including bar and pie charts.

analysis data data-visualization dataanalysis datacleaning datapreprocessing dataset msexcel visualization

Last synced: 07 Mar 2026

https://github.com/jigyasag18/amazon-power-bi-dashboard

The Amazon Power BI Dashboard Project repository provides an interactive analytics dashboard for visualizing and analyzing sales performance across various product categories within Amazon's ecosystem. Utilizing comprehensive sales data, it empowers stakeholders with actionable insights to enhance decision-making and improve business strategies.

data data-visualization dataanalysis dataanalytics dataset datasets datavisualization-project powerbi powerbi-report powerbi-visuals powerbidashboard

Last synced: 07 Mar 2026

https://github.com/yusuf4030/the-data-analyst-toolkit

📊 Explore essential data analysis tools organized by role and task, empowering users from students to professionals with quick access to valuable resources.

budget budget-management business-intelligence charts cookbook cureated-list data data-analysis-python data-visualization internet-of-everything internet-of-transport large-language-models nse open-source python selenium stock-market traffic-analysis

Last synced: 18 May 2026