An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/sreyashidey/scrape-analyze-visualize

A project for web scraping, data analysis, and visualization using Selenium, BeautifulSoup, and Python.

bs4 data-visualization selenium

Last synced: 03 May 2026

https://github.com/gui-sitton/y.music

In this project I compared the musical preferences of the citizens of Springfild and Shelbyville. I examined real Y.Music data to test hypotheses and compare the behavior of users in these two cities.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 May 2026

https://github.com/niniola-creator/niniola-creator

This is a repository that I have created to show my skills, share my projects and track my progress in my data science/web development journey.

bootstrap5 css3 data-analysis data-science data-visualization database html5 javascipt javascript matplotlib pandas powerbi python spreadsheets sql

Last synced: 07 Apr 2026

https://github.com/gfav-cybergeek/prodigy_ml_01

A linear regression model to predict house prices based on square footage, number of bedrooms, and bathrooms. Includes feature engineering, preprocessing, and model evaluation.

ai airtificialintelligence algorithms algorithms-and-data-structures data-structures data-visualization jupyter jupyter-notebook jupyterlab machine-learning machine-learning-algorithms machine-learning-models python

Last synced: 05 Apr 2025

https://github.com/cyprianfusi/world-happiness-report-for-2015-2019

World Happiness Report for 2019 with strange and unexpected results for Sub-Sahara African Countries! But it's data speaking...

data-visualization pandas-python

Last synced: 21 Mar 2025

https://github.com/piras-s/tuningcurvesnestedbayesianinference

Bayesian inference of neural tuning curves using nested sampling (PyMultiNest), with theory, simulation, and diagnostic visualizations.

bayesian-inference data-visualization machine-learning model-evaluation nested-sampling neuroscience pymultinest python3 simulation

Last synced: 18 May 2026

https://github.com/cyprianfusi/uk-covid-19-data-via-opendata-api

With recommendation to the UK government to halt all mandatory testing! Tests should only be conducted on patients as part of diagnosis and treatment. This is because with low prevalence of the disease most positive test results are false positives. This is due to irreducible error in the test.

api covid-19 data-visualization pandas-python uk

Last synced: 21 Mar 2025

https://github.com/cyprianfusi/new-york-city-public-schools-and-sat-scores

One of the most controversial issues in the U.S. educational system is the efficacy of standardized tests and whether they're unfair to certain groups. We could correlate SAT scores with factors like race, gender, income, and more.

data-analysis-python data-cleaning data-visualization data-wrangling

Last synced: 21 Mar 2025

https://github.com/cosmoduende/r-ggcats

StrangeR things: Adding… Cats? to your plots on R. How to analyze and visualize data with the help of funny cats with the ”ggcat” package.

data-analysis data-analytics data-science data-visualisation data-visualization data-viz dataviz ggcats r-language r-library r-package r-programming r-scripts r-studio rstats rstudio

Last synced: 22 Jul 2025

https://github.com/radhikareddy-chintareddy/big-data-analysis-ny-weather-air-quality-2022

End-to-end workflow showcasing database setup, API development, and interactive data retrieval of large datasets. Includes integration and analysis of 2022 SURFACE HOURLY weather data (global, US, and NY) merged with NY air pollution data from the EPA to uncover actionable insights.

big-data-analytics data-integration data-visualization flask-restful jupyter-notebook pymysql python

Last synced: 18 May 2026

https://github.com/vit0r/trino-datavirtualization

POC trino - some catalogs, mariadb,postgresql,mongodb and minio

data-visualization

Last synced: 07 Mar 2026

https://github.com/kylemit/livedataisbeautiful

A casual attempt at data visualizations

data-visualization highcharts

Last synced: 20 May 2026

https://github.com/andersoncrs/analisis-de-texto-tweets

En este proyecto exploro el análisis de texto de tweets para descubrir tendencias, opiniones y temas relevantes en redes sociales. Usando herramientas de procesamiento de lenguaje natural, convierto grandes volúmenes de mensajes en información clara y visualmente atractiva.

data-analysis data-visualization eda text-mining

Last synced: 21 Jul 2025

https://github.com/nadamarei/data-analyzer

The Qualitative Data Analysis Tool is a powerful Streamlit application designed for researchers to analyze word frequencies in corporate documents. This tool processes PDF reports, identifies target words and their contextually relevant synonyms, and generates comprehensive reports with document statistics, summary analysis, and per-file breakdowns

data-analysis data-visualization python-3 streamlit

Last synced: 18 May 2026

https://github.com/ddihora1604/iitk_task

A comprehensive financial data analysis system that collects, processes, and analyzes data from approximately 500 tickers in the S&P Global Index. It provides detailed financial information, ESG metrics, and various financial statements for comprehensive market analysis.

beautifulsoup4 data-analysis data-visualization datamodelling dataset esg machine-learning python yahoo-finance

Last synced: 29 Oct 2025

https://github.com/nadahamdy217/skincaresentinel

This project analyzes customer feedback for skincare products by predicting sentiment using an unsupervised model. It includes a web application for real-time sentiment analysis, an ETL pipeline built with Azure Data Factory, Azure Databricks, and Azure Synapse Analytics, and a Power BI dashboard for visualizing review trends.

azure customer-feedback data-engineering data-science data-visualization database databricks etl-pipeline flask machine-learning powerbi python sentiment-analysis synapse-analytics unsupervised-learning web-application

Last synced: 07 Apr 2026

https://github.com/j5py/py4e

Python for Everybody Specialization (from University of Michigan on Coursera).

api data-visualization json python sql sqlite xml

Last synced: 05 May 2026

https://github.com/pronzzz/diabetes-prediction

Diabetes prediction using a KNN model and Pima Indian Diabetes Dataset

data-analysis data-manipulation data-preprocessing data-visualization knn machine-learning outlier-detection seaborn

Last synced: 13 Apr 2025

https://github.com/ireneflorez/exploration_r

Data exploration on the 'White Wine Quality' dataset using R

data-analysis data-visualization r

Last synced: 16 Jun 2026

https://github.com/annaanastasy/regression-project-flood-prediction

This project uses machine learning regression models to predict flood risks based on environmental and historical data, employing techniques such as linear regression, polynomial regression, SGDRegressor, and XGBoost for accurate flood prediction.

data-preprocessing data-science data-visualization feature-engineering machine-learning-algorithms regression xgboost-regression

Last synced: 05 Apr 2025

https://github.com/annaanastasy/clustering-fish-species

A comprehensive project demonstrating the use of various clustering techniques to analyze and group fish data effectively.

clustering-algorithm data-science data-visualization machine-learning-algorithms unsupervised-clustering unsupervised-machine-learning

Last synced: 05 Apr 2025

https://github.com/another-guy/use-d3

React hooks for D3.js data visualization library.

d3 d3js d3js-hook d3js-hooks data-visualization data-viz react react-hook react-hooks reactjs

Last synced: 16 Jan 2026

https://github.com/shubhammittal-data/sales-customer_dashboard_tableau

An interactive Tableau project showcasing advanced data visualization techniques for sales performance and customer analytics. This dashboard provides key business insights using KPIs, trend analysis, and customer segmentation. Designed for executives, sales managers, and marketing teams to drive data-driven decision-making.

customer-behavior-analysis customer-segmentation data-analysis data-visualization product-analytics sales-analysis tableau tableau-dashboards tableau-public

Last synced: 07 Mar 2026

https://github.com/jlee9503/defense-risk-prediction

Build a machine learning pipeline that ingests defense procurement data, identifies high-risk contracts, and visualizes the results in an interactive dashboard.

data-analysis data-visualization exploratory-data-analysis python

Last synced: 25 Jan 2026

https://github.com/chahelgupta/fitness-data-analysis-r-project

This project focuses on analyzing fitness data collected from various tracking devices to gain insights into users' activity levels, sleep patterns, calorie expenditure, and heart rate. The dataset used in this project consists of multiple CSV files, each containing different aspects of fitness-related data.

data-analysis data-cleaning data-exploration data-science data-visualization r r-language r-programming r-studio

Last synced: 18 May 2026

https://github.com/singhdivyank/visualization

Wrangling NYPD data and visualising using graphs and maps in Python, Tableau, and R

data-visualization data-wrangling geopandas ggplot2 plotly pygwalker

Last synced: 13 Jun 2026

https://github.com/majajuri/vizualizacija-podataka

Labosi iz predmeta Vizualizacija podataka (FER)

d3js data-visualization jupyter-notebook tableau

Last synced: 05 Apr 2025

https://github.com/smahala02/materials-science-data-analysis

Analysis of diffraction and spectrum data in materials science using Python for data visualization and interpretation.

data-visualization diffraction-analysis materials-science python spectrum-analysis

Last synced: 18 May 2026

https://github.com/jigyasag18/data-analysis-using-ms-excel

This project is on analyzing real-time data from Ambuvians Healthcare, a health products startup. It included data cleaning, such as removing duplicates and addressing missing values, followed by analyses to reveal insights into sales trends, customer demographics, and purchasing behaviors. Visualizations in MS-Excel including bar and pie charts.

analysis data data-visualization dataanalysis datacleaning datapreprocessing dataset msexcel visualization

Last synced: 07 Mar 2026

https://github.com/jigyasag18/amazon-power-bi-dashboard

The Amazon Power BI Dashboard Project repository provides an interactive analytics dashboard for visualizing and analyzing sales performance across various product categories within Amazon's ecosystem. Utilizing comprehensive sales data, it empowers stakeholders with actionable insights to enhance decision-making and improve business strategies.

data data-visualization dataanalysis dataanalytics dataset datasets datavisualization-project powerbi powerbi-report powerbi-visuals powerbidashboard

Last synced: 07 Mar 2026

https://github.com/yusuf4030/the-data-analyst-toolkit

📊 Explore essential data analysis tools organized by role and task, empowering users from students to professionals with quick access to valuable resources.

budget budget-management business-intelligence charts cookbook cureated-list data data-analysis-python data-visualization internet-of-everything internet-of-transport large-language-models nse open-source python selenium stock-market traffic-analysis

Last synced: 18 May 2026

https://github.com/as16082023/data-professional-survey-breakdown-

Created a dashboard to visualize survey data of data professionals

alex-the-analyst dashboard data-visualization guided-project power-bi power-query

Last synced: 20 Mar 2026

https://github.com/lparham2/factors-driving-ev-adoption-charging-station-deployment

This project explores factors driving EV adoption and charging station deployment using Python-based data analysis. It examines sales trends, infrastructure growth, and socioeconomic influences to uncover key insights. The goal is to aid policymakers and businesses in optimizing EV infrastructure and accelerating sustainable transportation.

data-analysis data-visualization electric-vehicle-charging-station electric-vehicles powerpoint-presentations python

Last synced: 18 May 2026

https://github.com/aidanv22/kagglecompetitions2024

These are coding projects I worked on during my 2024 Fall Semester. Each of these was ranked either in the 18pt or 20pt benchmark.

data-visualization data-wrangling dplyr embeddings feature-engineering feature-selection linear-regression logistic-regression models pca-analysis r xgboost

Last synced: 06 Apr 2025

https://github.com/neuraladitya/polynomial_regression_c

A high-performance polynomial regression implementation in pure C with gradient descent optimization and visualization support.

algorithm-implementation c-programming csv-processing data-science data-visualization high-performance-computing machine-learning numerical-computing polynomial-regression regression-analysis

Last synced: 05 Apr 2025

https://github.com/ayax537/codsoft-task2

Second task on CodSoft Internship Transaction Fraud Detection! During my CodSoft internship, I worked on a challenging project focused on detecting fraudulent credit card transactions

data-modelling data-visualization eda machine-learning model-evaluation

Last synced: 29 Oct 2025

https://github.com/sean-doody/gmu-chss-degrees

Analysis of labor market outcomes for humanities and social science college degree holders.

data-visualization r statistics

Last synced: 05 Jul 2025

https://github.com/lucs1590/triathlon-dashboard

This is a repository that shows some graphics and makes a dashboard related to triathlon data.

angular dashboard data-visualization data-viz graphs plotly plotly-dash plotlyjs storytelling triathlon

Last synced: 12 May 2026

https://github.com/dkoh2018/car_shopping

A car price analysis tool with brand comparisons, trend tracking, and interactive visualizations. Built with Python and Streamlit

automotive car-market data-visualization price-analysis price-tracker python streamlit web-scraping

Last synced: 18 May 2026

https://github.com/parvatijay2901/homelessness-in-the-us

Data511: Data Visualization for Data Scientists (Final Project)

data-visualization python tableau

Last synced: 18 May 2026

https://github.com/gui-sitton/games

Identify patterns that determine whether a game is successful or not. This will allow you to identify potential big winners and plan advertising campaigns.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 May 2026

https://github.com/adhit-r/holocron

A 3D Star Wars universe explorer. Galaxy, timeline, Force-lineage, datapad — one selection drives all four. Next.js + R3F + Tailwind. Real public data, no backend.

3d data-visualization embeddings force-graph good-first-issue interactive knowledge-graph nextjs react react-three-fiber semantic-search shaders star-wars tailwindcss threejs transformers-js typescript webgl webgpu wookieepedia

Last synced: 18 May 2026

https://github.com/kevinwood15/python_twitter_datawrangling_project

The main objectives of this project is to wrangle (clean) and analyze twitter data. I deal with some messy data, clean it, then plot some visualizations of the data to analyze it.

cleaning-data data-science data-visualization python wrangling-data

Last synced: 18 May 2026

https://github.com/sanatladkat/superstore-dashboard

This repository contains a PowerBI dashboard analyzing sales data. The dataset provides insights into various aspects such as sales performance, profitability, customer segments, and regional trends. The dashboard was created to facilitate a clear understanding of key metrics and to guide decision-making processes.

analytics data-visualization powerbi

Last synced: 08 Apr 2026

https://github.com/ddihora1604/social_media_analysis

A powerful, interactive dashboard for analyzing social media conversations, trends, and network dynamics. This tool allows researchers and analysts to explore patterns in social media data, identify key trends, and detect coordinated behavior.

aiml css data-analysis data-visualization html javascript python

Last synced: 30 Oct 2025

https://github.com/leonardoberlatto/tableau-life-expectancy

Animation to present the development of life expectancy and fertility rate over the years in different countries

charts data-science data-visualization tableau

Last synced: 12 Jan 2026

https://github.com/sayamalt/titanic-survival-prediction

Successfully developed a Logistic Regression model for predicting the survival of a passenger aboard the Titanic ship based on his/her various features such as gender, age, passenger class, no. of siblings, embarkation location, etc.

data-cleaning data-preprocessing data-visualization exploratory-data-analysis logistic-regression machine-learning sklearn

Last synced: 18 May 2026

https://github.com/adriangalvanzamora/ecommerce-analytics-olist

Data analysis project based on the Olist Brazilian E-Commerce dataset. Includes data cleaning, exploratory analysis, delivery performance metrics, customer satisfaction modeling, and geospatial insights. Built entirely in Python (Jupyter Notebook) using real-world data from Kaggle.

brazil customer-satisfaction data-analysis data-visualization ecommerce folium geospatial-analysis machine-learning matplotlib notebook pandas plotly python seaborn

Last synced: 06 May 2026

https://github.com/drisskhattabi6/meteo-data-mining

This repo contains using Data Mining Techniques to analyze meteorological (meteo) data. The objective is to extract meaningful insights and patterns from the data that can aid in understanding weather phenomena and predicting future weather conditions.

cart data-analysis data-mining data-visualization decision-making decision-tree extract-data extract-insights insights-analytics insights-data k-means knn machine-learning svm

Last synced: 21 Mar 2025

https://github.com/veydantkatyal/carbon-emission-analysis

explore and visualize global carbon emissions trends and their environmental impact.

carbon-emissions climate-change data-visualization

Last synced: 12 Apr 2025

https://github.com/ahmedmmahrous/movie-recommendation-and-analysis

Perform analysis and Basic Recommendations based on Similar Genres and Movies which Users prefer.

data-visualization feature-engineering nu pan py recommender-system seaborn

Last synced: 03 Feb 2026

https://github.com/batthulavinay/basic-linear-regression

This project demonstrates Basic Linear Regression using Python. The notebook includes dataset loading, exploratory data analysis, model training, evaluation, and visualization of results.

data-visualization datapreprocessing exploratory-data-analysis linear-regression matplotlib modelevaluation pandas-library

Last synced: 13 Apr 2025

https://github.com/bamresearch/sofa

SOftware for Force Analysis - A graphical user interface to analyze Atomic Force Microscopy Force Spectroscopy data

atomic-force-microscopy data-science data-visualization

Last synced: 17 Jan 2026

https://github.com/rathod-shubham/google-data-analytics

Learning a wide range of skills that are useful in everyday life as well as being a data analyst.

data-analysis data-analysis-in-r data-analyst data-analyst-nanodegree data-analytics data-visualization google

Last synced: 03 Feb 2026

https://github.com/omarsalemdmet/multidimensional_visualization_in_opengl

This project demonstrates two distinct techniques for visualizing multidimensional data using C++ and OpenGL

cpp data-visualization opengl visualization

Last synced: 07 May 2026

https://github.com/muhammadmoiz367/powerbi-loan-dashboard

Interactive Power BI dashboard analyzing loan amounts, year-over-year changes, and regional default trends.

business-intelligence dashboard data-visualization financial-analysis loan-analysis powerbi reporting

Last synced: 24 Feb 2026

https://github.com/code-with-zeeshan/remote-work-productivity-analyzer

ProductivityAnalyzer v2.0 — A desktop application that tracks and analyzes user activity with smart categorization, focus mode, goal setting, visual reports, AI suggestions, and dark mode. Built with Python, PyQt5, PostgreSQL.

activity-tracker data-visualization desktop-app focus-mode postgresql productivity pyqt5 python

Last synced: 27 Jun 2026

https://github.com/tralahm/octave-matlab

Using matlab and octave for machine learning and numerical computing

data-science data-visualization machine-learning matlab octave-functions octave-scripts tralahm tralahtek

Last synced: 13 Jun 2026

https://github.com/vaxdata22/zillow-rapid-api-end-to-end-etl-data-pipeline-by-airflow-on-ec2

This is an end-to-end AWS Cloud ETL project. This data pipeline orchestration uses Apache Airflow on AWS EC2 as well as AWS Lambda. It demonstrates how to build ETL data pipeline that would perform data transformation using Lambda function as well as loading into a Redshift cluster table. The data would then be visualized using Amazon QuickSight.

amazon-quicksight amazon-redshift apache-airflow aws-ec2 aws-lambda aws-s3 business-intelligence dags data-visualization etl-pipeline orchestration python3 rapid-api zillow-house-listings

Last synced: 19 May 2026

https://github.com/bjornmelin/data-analytics-playground

🧐 Collection of academic data analytics projects showcasing exploratory data analysis, geographic visualization, and interactive dashboards.

data-analysis data-analytics data-visualization geographic-analysis ggplot interactive-maps leaflet r r-programming shiny tidyverse

Last synced: 06 Apr 2025

https://github.com/abdoomohamedd/data-science-projects

A collection of data science projects ranging from exploratory data analysis to predictive modeling and clustering. Each project is designed to solve specific problems or explore particular datasets using various data science techniques and tools.

data-analysis data-analysis-python data-cleaning data-science data-visualization machine-learning machine-learning-algorithms

Last synced: 14 May 2025

https://github.com/curatorcodicis/reddit-sentiment-analyzer

A Python-based tool that analyzes sentiment trends in Reddit discussions. Fetch posts, analyze sentiment using NLP, and visualize trends in an interactive Streamlit dashboard.

data-visualization docker mongodb nlp praw python reddit sentiment-analysis streamlit

Last synced: 13 Apr 2026

https://github.com/shivabajelan/squamous_cell_carcinoma_treatment_analysis

The study involved treating 249 mice with SCC tumors using a range of drug regimens, including Pymaceuticals' drug of interest, Capomulin. Over 45 days, tumor development was observed and measured to compare the performance of Capomulin against other treatments. My task was to generate tables and figures for the technical report of the study.

data-visualization matplotlib pandas python

Last synced: 11 Apr 2026

https://github.com/mindrones/dualagn

Active galaxies: emission line diagrams (https://github.com/astroboxio/d3.js_dAGN_BPT)

astronomy chart d3js data-visualization galaxy interactive

Last synced: 16 May 2026

https://github.com/shivam5992/bokeh-vis

Visualising the acquisitions made by Google using python - Bokeh

bokeh bokeh-server data-visualization eda exploratory-data-analysis python

Last synced: 26 Jun 2025

https://github.com/touppercase78/salary-prediction-collection

Salary predictions with ML models and analyses on datasets from several other GitHub repos

data-analysis data-visualization datasets machine-learning python3 regression-models

Last synced: 02 May 2026

https://github.com/ramonanf/tc1002s_semanatec

Herramientas computacionales: El arte de la analítica

data-analysis data-visualization jupiter-notebook pandas-python

Last synced: 15 Jun 2025

https://github.com/amlanmohanty1/fannie-mae-borrower-behavior-and-characteristics-2007vs2019

Analysis using R and tidyverse to compare borrower behavior and characteristics between the years 2007 and 2019, focusing on key financial metrics such as credit scores, interest rates, debt to income ratios, and loan to value ratios.

data-visualization fannie-mae r tidyverse

Last synced: 13 Sep 2025

https://github.com/mimi-netizen/python-and-machine-learning-in-financial-analysis

This comprehensive repository covers financial data analysis using Python and machine learning techniques, including time series modeling, portfolio optimization, risk assessment, credit risk prediction, and deep learning applications in finance.

data-analysis data-science data-visualization finance financial-analysis financial-data financial-modeling

Last synced: 19 May 2026

https://github.com/notthestallion/data_visualisation-examples

This repository was created to learn and practice graph showing and data visualization. The goal is to gain experience in creating compelling and informative visualizations.

data data-science data-visualization database learn learn-to-code learning learning-by-doing matplotlib matplotlib-figures matplotlib-pyplot visualization

Last synced: 12 May 2026

https://github.com/emircanakyuzz/logistic_regression_ile_tahmin-estimating_with_logistic_regression

Çeşitli şirketlerin yaptığı harcamaları makine öğrenmesi algoritmalarından biri olan logistic regression kullanarak sınıflandırmaya çalışıyoruz. Eğittiğimiz model ile şirketlerin gelecek yıllardaki harcamalarının hangi alana ait olduğunu tahmin ediyoruz (sınıflandırıyoruz) ve performansını değerlendiriyoruz.

artificial-intelligence data-science data-visualization jupyter-notebook logistic-regression machine-learning prediction

Last synced: 29 Mar 2025

https://github.com/sarvamm/zeno-chat

Chat with your data in natural language and get insights and plots without any writing any code

chatbot data-science data-visualization large-language-models streamlit

Last synced: 19 May 2026

https://github.com/Lightning-Chart/lcjs-example-0053-dataGaps

Example showcasing how data gaps can be handled XY series. Particularly highlights line and area series in a trading use case

data-visualization demo example lightningchart-js template trading

Last synced: 22 Jul 2025

https://github.com/technologiestiftung/ihk-gewerbedaten-time-travel

Data visualization that allows time travelling through the Berlin business landscape of the past decades, highlighting business registrations for each year

businesses data-visualization map maplibre-gl-js open-data stimulus-js

Last synced: 22 Jul 2025

https://github.com/jabulente/tanzania-geographical-zones

This project provides a geospatial visualization of Tanzania's geographical zones and regions. It uses geospatial data to map each zone, display regions, and annotate them for easy identification. The visualizations include simulated data to demonstrate thematic mapping techniques.

ai data-analysis data-science data-visualization geopandas geospatial location matplotlib ml python tanzania tanzania-geographic tanzania-locations

Last synced: 19 May 2026