An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/samir-atra/share-lm_dataset_analysis

Analysis, studies and optimizations on the ShareLM extension dataset

data-analysis data-visualization gemma3n huggingface huggingface-transformers pandas

Last synced: 19 May 2026

https://github.com/shuyib/london_weather_prediction

The London Weather Project aims to predict the mean temperature in London using historical weather data, involving data cleaning, feature engineering, and modeling with techniques like imputation, transformation, scaling, and the use of Mlflow for tracking model performance and hyperparameters.

data-cleaning data-lab data-science data-visualization datacamp-projects environmental-science feature-engineering forecasting jupyter-notebook machine-learning mlflow open-data python random-forest regression-analysis time-series weather-prediction

Last synced: 29 Mar 2025

https://github.com/no-country-simulation/c21-55-n-data-bi

Trabajo de análisis estadístico en Power Bi, sobre la deserción de alumnos en carreras culturales universitarias de argentina.

data-visualization

Last synced: 18 Feb 2026

https://github.com/ezeparziale/analisis-uso-bicicletas-caba

:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.

data data-science data-visualization

Last synced: 14 Mar 2025

https://github.com/saroshfarhan/irish_hospital_data_anaysis

Irish hospital's patient discharge data for four counties analysis

data-analysis data-science data-visualization healthcare irish-data r-programming-language

Last synced: 18 Feb 2026

https://github.com/abdoomohamedd/data-science-projects

A collection of data science projects ranging from exploratory data analysis to predictive modeling and clustering. Each project is designed to solve specific problems or explore particular datasets using various data science techniques and tools.

data-analysis data-analysis-python data-cleaning data-science data-visualization machine-learning machine-learning-algorithms

Last synced: 14 May 2025

https://github.com/nivasharmaa/friskwatch

A Java program for analyzing stop-and-frisk data from the NYPD. Features data import, organization, and statistical analysis to compare occurrences during and after policy implementation.

data-analysis data-visualization dataprocessing datascience file-io java java-oop nypd-data

Last synced: 19 May 2026

https://github.com/analyticalnahid/plotly-tutorial

A intro of Plolty for Data Science

data-science data-visualization ploty python3

Last synced: 28 Mar 2025

https://github.com/shellynagar27/marketing-content-performance-analysis

Analyzed 2024 social media campaign data from TikTok, Instagram, LinkedIn, and X.com using Power BI to uncover performance trends across platforms, content types, and regions. Built an interactive dashboard to drive insights on engagement, optimal posting times, and content strategy.

data-analysis data-modelling data-visualization excel figma marketing-analytics powerbi powerquery wireframing

Last synced: 26 Jun 2025

https://github.com/vaxdata22/zillow-rapid-api-end-to-end-etl-data-pipeline-by-airflow-on-ec2

This is an end-to-end AWS Cloud ETL project. This data pipeline orchestration uses Apache Airflow on AWS EC2 as well as AWS Lambda. It demonstrates how to build ETL data pipeline that would perform data transformation using Lambda function as well as loading into a Redshift cluster table. The data would then be visualized using Amazon QuickSight.

amazon-quicksight amazon-redshift apache-airflow aws-ec2 aws-lambda aws-s3 business-intelligence dags data-visualization etl-pipeline orchestration python3 rapid-api zillow-house-listings

Last synced: 19 May 2026

https://github.com/harmonicode/filtra

Digital Filter Designer is a powerful application built using PyQt5 and Matplotlib. It allows users to design and visualize digital filters, including standard filters and all-pass filters, and generate corresponding C code. Ideal for students, researchers, and engineers in digital signal processing.

data-visualization digital-signal-processing filter-design pyqt5 real real-time-processing

Last synced: 22 Mar 2025

https://github.com/Akhil-krishnan-r/super_market_analysis

The growth of supermarkets in most populated cities are increasing and market competitions are also high. This dataset is one of the historical sales of supermarket company which has recorded in 3 different branches for 3 months data. Predictive data analytics methods are easy to apply with this dataset

data-visualization matplotlib numpy pandas seaborn

Last synced: 03 Feb 2026

https://github.com/omarsalemdmet/multidimensional_visualization_in_opengl

This project demonstrates two distinct techniques for visualizing multidimensional data using C++ and OpenGL

cpp data-visualization opengl visualization

Last synced: 07 May 2026

https://github.com/bamresearch/sofa

SOftware for Force Analysis - A graphical user interface to analyze Atomic Force Microscopy Force Spectroscopy data

atomic-force-microscopy data-science data-visualization

Last synced: 17 Jan 2026

https://github.com/ahmedmmahrous/movie-recommendation-and-analysis

Perform analysis and Basic Recommendations based on Similar Genres and Movies which Users prefer.

data-visualization feature-engineering nu pan py recommender-system seaborn

Last synced: 03 Feb 2026

https://github.com/veydantkatyal/carbon-emission-analysis

explore and visualize global carbon emissions trends and their environmental impact.

carbon-emissions climate-change data-visualization

Last synced: 12 Apr 2025

https://github.com/shivasairam1706/mlops-project1

End-to-end ML-Ops project using PySpark and AWS, covering environment setup, model training, deployment with data capture, execution, and analysis. CI/CD pipelines (AWS CodePipeline) and monitoring (CloudWatch) ensure automated deployment, performance tracking, and model retraining for production-ready ML solutions.

aws aws-lambda aws-s3 data-engineering data-science data-visualization delta-lake docker forcasting mlops-project pyspark unix-shell

Last synced: 20 May 2026

https://github.com/asimpson/is-steph-mvp

🏀 Compare the 2018 NBA MVP contenders against Steph Curry's historic, unanimous, 2016 MVP season.

data-visualization nba reactjs

Last synced: 20 May 2026

https://github.com/karanch10/fraudshield

FraudShield is a machine learning credit card fraud detection system that analyzes transaction attributes to identify suspicious activities in real time. Built with Python, SQL, and Django, it provides a user-friendly interface for fraud prediction using OpenBanking APIs and advanced detection techniques. Ideal for businesses and individuals.

data-analysis data-science data-visualization machine-learning python3

Last synced: 20 May 2026

https://github.com/youssef-saaed/activity-recognition-using-various-ml-algorithms

This project involves a comprehensive comparative analysis of various machine learning models to classify activities based on a given dataset. The analysis follows a structured approach, including data exploration, model training, model evaluation, and results interpretation to identify the best performing model.

activity-recognition comparative-analysis cross-validation data-exploration data-visualization machine-learning model-evaluation model-training neural-networks

Last synced: 22 Mar 2025

https://github.com/leonardoberlatto/tableau-life-expectancy

Animation to present the development of life expectancy and fertility rate over the years in different countries

charts data-science data-visualization tableau

Last synced: 12 Jan 2026

https://github.com/mohamed-walied/customer-behavior-analysis-using-r

Customer Behavior Analysis project utilizing the "Groceries Market Basket Dataset" from Kaggle. The project employs a data-driven approach to uncover customer purchasing patterns and relationships within the grocery market using K-means Clustering and Association Rules using Apriori-Algorithm. In collaboration with some friends.

apriori-algorithm association-rule-learning dashboard data-cleaning data-visualization k-means-clustering r-programming-language

Last synced: 26 Jul 2025

https://github.com/williamd1k0/metacritic-games

Distribution of Metacritic scores for console games.

data-scraping data-visualization metacritic web-scraping

Last synced: 26 Jun 2025

https://github.com/ranxi2001/predicting-mental-health-risk

数据分析案例-精神健康预测(数据来源kaggle)

data-analysis data-visualization eda

Last synced: 27 Jun 2025

https://github.com/catalina2820/inteligencia-de-negocios

This repository contains materials and resources for the Business Intelligence course. It includes notes, workshops, and practical exercises that cover essential concepts and applications in data science, data visualization, machine learning, and big data.

bigdata data-cleaning data-science data-visualization web-scraping

Last synced: 04 Apr 2025

https://github.com/hannahgsimon/halmodeling2024

Developed code using the Hybrid Automata Library (HAL) to create a spatial agent-based model of radio-immune response to spatially fractionated radiotherapy. This project was in association with the Cleveland Clinic Lerner Research Institute, Jacob Scott Lab.

agent-based-model bifurcation-analysis cancer-models computational-biology data-visualization hybrid-automata immune-response mathematical-modelling ordinary-differential-equations radiation-therapy spatial-model statistics systems-biology

Last synced: 23 Nov 2025

https://github.com/anergictcell/esbmeplots

An extension of the D3.js library for fast and flexible generation of basic plot types

d3js data-visualization javascript plotting

Last synced: 13 Jun 2026

https://github.com/arosas17/mapping_earthquakes

Created a map to demonstrate the correlation between the tectonic plates and earthquakes. Circle were made on a map to indicate earthquakes, changing colors and size based on magnitude of the earthquake.

data-visualization javascript map

Last synced: 20 May 2026

https://github.com/kevinwood15/python_twitter_datawrangling_project

The main objectives of this project is to wrangle (clean) and analyze twitter data. I deal with some messy data, clean it, then plot some visualizations of the data to analyze it.

cleaning-data data-science data-visualization python wrangling-data

Last synced: 18 May 2026

https://github.com/kmohamedalie/excel-data-visualization

Visualizing and publishing chats with excel

data-visualization excel github html

Last synced: 23 Jun 2026

https://github.com/vzamboulingame/data-portfolio

This repository showcases my projects in Python and SQL, highlighting my skills in data analysis & visualization.

data-analysis data-portfolio data-science data-science-portfolio data-science-projects data-visualization jupyter-notebook portfolio python sql

Last synced: 20 May 2026

https://github.com/fvdavid/d3-in-action

Angular 19 Data Visualization D3js

angular d3-visualization d3js data-visualization typescript

Last synced: 08 May 2026

https://github.com/zhouzhuofei/juliadl

learning Julia, write some notebooks, like machine learning and data science, visualization.

data-science data-visualization julia mxnet

Last synced: 21 Apr 2026

https://github.com/maruf-hossen/kaggle-projects-and-learning

Comprehensive data science learning journey through Kaggle courses and exercises. Documenting progress in SQL, Python, ML, and data visualization with practical projects and business applications.

business-intelligence data-cleaning data-science data-visualization kaggle learning-journey machine-learning pandas python sql

Last synced: 05 May 2026

https://github.com/parvatijay2901/homelessness-in-the-us

Data511: Data Visualization for Data Scientists (Final Project)

data-visualization python tableau

Last synced: 18 May 2026

https://github.com/alinababer/data-science-and-insight-agent-rag-llama3-lava-llm

Data-Science-and-Insight-Agent-RAG-LLama3-Lava-LLM-Django-WebApplication is an advanced AI-driven chatbot designed to assist in data science, document analysis, and image interpretation. This repository contain the Datascience Agent of this project.

artificial-neural-networks classifcation data-analysis data-engineering data-visualization datascience large-language-models llama2 lstm machine-learning python random-forest regression

Last synced: 01 Jan 2026

https://github.com/htanh2003/datamate

DataMate là một công cụ phân tích dữ liệu thông minh, kết hợp sức mạnh của mô hình ngôn ngữ lớn (LLM) và giao diện trực quan, giúp người dùng dễ dàng tải lên tệp CSV, khám phá dữ liệu, và nhận các phân tích thông minh

agent data-visualization deployment docker docker-compose langchain nginx streamlit

Last synced: 08 Apr 2026

https://github.com/aidanv22/kagglecompetitions2024

These are coding projects I worked on during my 2024 Fall Semester. Each of these was ranked either in the 18pt or 20pt benchmark.

data-visualization data-wrangling dplyr embeddings feature-engineering feature-selection linear-regression logistic-regression models pca-analysis r xgboost

Last synced: 06 Apr 2025

https://github.com/mondial7/echarts-wc-import

Webcomponent to import Echarts library - current version echarts 3.8.5

data-visualization echarts3 polymer2 webcomponents

Last synced: 03 May 2026

https://github.com/kashirin-alex/thither.direct-onamove

an android skeleton-example application for using data from Thither.Direct platform on mobile applications

android-application data data-analysis data-structures data-visualization mobile-development mobility query research-data-management

Last synced: 27 Apr 2026

https://github.com/as16082023/data-professional-survey-breakdown-

Created a dashboard to visualize survey data of data professionals

alex-the-analyst dashboard data-visualization guided-project power-bi power-query

Last synced: 20 Mar 2026

https://github.com/cassiofb-dev/covid-grafico

Gráficos da COVID-19 (Mortes e Casos) com Chart.js.

chartjs covid-19 data-visualization

Last synced: 25 Jan 2026

https://github.com/jigyasag18/amazon-power-bi-dashboard

The Amazon Power BI Dashboard Project repository provides an interactive analytics dashboard for visualizing and analyzing sales performance across various product categories within Amazon's ecosystem. Utilizing comprehensive sales data, it empowers stakeholders with actionable insights to enhance decision-making and improve business strategies.

data data-visualization dataanalysis dataanalytics dataset datasets datavisualization-project powerbi powerbi-report powerbi-visuals powerbidashboard

Last synced: 07 Mar 2026

https://github.com/ivangrana/data-visualization

Data visualization repository made with Chart.Js, D3,Plotly and Rstudio

d3js data-visualization

Last synced: 20 Jul 2025

https://github.com/eins51/restaurantanalytics

Comprehensive business analytics project using Python and Tableau. Features include data visualization, interactive dashboards, and data-driven insights for restaurant performance and consumer behavior.

business-analytics data-visualization python-dashboard restaurant-analysis tableau

Last synced: 27 Jun 2025

https://github.com/smahala02/materials-science-data-analysis

Analysis of diffraction and spectrum data in materials science using Python for data visualization and interpretation.

data-visualization diffraction-analysis materials-science python spectrum-analysis

Last synced: 18 May 2026

https://github.com/tapas-gope/pizza-sales

This project analyzes Pizza Sales Data to provide insights into customer preferences and sales performance. Key metrics include total revenue, orders, and average order value, with a breakdown by pizza category and size. The dashboard identifies peak sales periods and top-selling items, supporting data-driven business decisions.

business-intelligence dashboard data-analysis data-visualization dax powerbi sales-analysis

Last synced: 02 Jan 2026

https://github.com/muneeb706/nei_pm2.5_data_analysis

Exploratory Data Analysis of PM2.5 Emission records from EPA National Emission Inventory

data-visualization exploratory-data-analysis r-programming

Last synced: 27 Jun 2025

https://github.com/balajimohan18/power-bi-visualization-project

This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce damages by accidents & calamities.

data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-visuals powerpoint-slides

Last synced: 08 Mar 2026

https://github.com/hayatiyrtgl/cryptocurrency_time_series_rnn

Python script for training a Simple RNN model on cryptocurrency price data to predict future prices, including data exploration and evaluation

data-analysis data-science data-visualization keras pandas pandas-python prediction predictive-modeling python python-script rnn rnn-tensorflow tensorflow time-series time-series-analysis

Last synced: 08 Apr 2026

https://github.com/gustavo-b-morales-s/footwear-market-data-pipeline

This project uses Python, Scrapy, SQLite3 and Streamlit to extract sports shoe data from Mercado Livre, perform transformations using Pandas, store the data in an SQLite database and create a data visualization interface emphasizing the main KPIs using Streamlit.

data-engineering data-visualization etl-pipeline webscraping

Last synced: 06 Apr 2025

https://github.com/ancapitigoi/mushrooms-selection

In order to find which mushrooms are safe to eat, the decision tree data mining method is used.

classification-model data-cleaning data-exploration data-mining data-visualization decision-tree penalty-method r-programming

Last synced: 15 Jun 2026

https://github.com/mmzong/gee_lifestyleeffectsonhypertension

Generalized Estimating Equations (GEE), Quasi-likelihood under the Independence Model Criterion (QIC), Longitudinal data, Embedded box plots within violin plots with hypertension risk categories, spaghetti plots, aggregate line plots, histograms, faceted-area plots, box and jitter plots. Investigating the impact of lifestyle on health.

aggregate-line-plot area-faceted-plots box-plots data-analysis data-manipulation data-science data-visualization generalized-estimating-equations histograms jitter-plots longitudinal-data qic quasi-likelihoods r spaghetti-plots violin-plots

Last synced: 29 Jul 2025

https://github.com/tashi-2004/data-visualization-tableau-traffic-collision-insights

Analysis of traffic collision data using Tableau, featuring interactive visualizations that highlight trends in injuries and fatalities, contributing factors, and geographic distributions. It includes various sheets and dashboards, with recommendations for enhancing road safety. The dataset is available for further exploration.

data-analysis data-visualization eda geospatial-analysis machine-learning predictive-modeling statistics tableau traffic-analysis

Last synced: 19 Mar 2026

https://github.com/rizz1406/crypto-price-tracker

A user-friendly Crypto Price Tracker built using Python and Streamlit. Track top 20 cryptocurrencies in INR and USD, visualize historical trends, and set up price alerts easily.

coingecko-api crypto-tracker cryptocurrency cryptocurrency-prices data-visualization pandas price-alerts python streamlit

Last synced: 11 May 2026

https://github.com/samirelanduk/quickplots

An object-oriented plotting library for Python, with simplicity as its principal aim.

charting-library charts data-visualization dataviz graphs visualization

Last synced: 15 Sep 2025

https://github.com/annaanastasy/regression-project-flood-prediction

This project uses machine learning regression models to predict flood risks based on environmental and historical data, employing techniques such as linear regression, polynomial regression, SGDRegressor, and XGBoost for accurate flood prediction.

data-preprocessing data-science data-visualization feature-engineering machine-learning-algorithms regression xgboost-regression

Last synced: 05 Apr 2025

https://github.com/pronzzz/diabetes-prediction

Diabetes prediction using a KNN model and Pima Indian Diabetes Dataset

data-analysis data-manipulation data-preprocessing data-visualization knn machine-learning outlier-detection seaborn

Last synced: 13 Apr 2025

https://github.com/vishal-038/real_estate_price_prediction

The Real Estate Price Prediction project aims to develop a machine learning model to predict house prices based on various features

data-analysis data-science data-visualization machine-learning python

Last synced: 21 May 2026

https://github.com/ddihora1604/iitk_task

A comprehensive financial data analysis system that collects, processes, and analyzes data from approximately 500 tickers in the S&P Global Index. It provides detailed financial information, ESG metrics, and various financial statements for comprehensive market analysis.

beautifulsoup4 data-analysis data-visualization datamodelling dataset esg machine-learning python yahoo-finance

Last synced: 29 Oct 2025

https://github.com/kylemit/livedataisbeautiful

A casual attempt at data visualizations

data-visualization highcharts

Last synced: 20 May 2026

https://github.com/divyashah0510/sales-insights-for-retail-company

This project is a data analysis project for a retail company. The company has dataset: sales_data.csv that contains the sales data for the company. The objective of this project is to analyze the sales data and provide insights to the company to improve their sales.

dash data-visualization pandas plotly sales sales-insights streamlit visualization

Last synced: 02 Jan 2026

https://github.com/jarif87/pokeinsights

A Selenium-Powered Data Scraping and Tableau Visualization Project

data-visualization python scraping selenium tableau

Last synced: 21 May 2026

https://github.com/piras-s/tuningcurvesnestedbayesianinference

Bayesian inference of neural tuning curves using nested sampling (PyMultiNest), with theory, simulation, and diagnostic visualizations.

bayesian-inference data-visualization machine-learning model-evaluation nested-sampling neuroscience pymultinest python3 simulation

Last synced: 18 May 2026

https://github.com/gfav-cybergeek/prodigy_ml_01

A linear regression model to predict house prices based on square footage, number of bedrooms, and bathrooms. Includes feature engineering, preprocessing, and model evaluation.

ai airtificialintelligence algorithms algorithms-and-data-structures data-structures data-visualization jupyter jupyter-notebook jupyterlab machine-learning machine-learning-algorithms machine-learning-models python

Last synced: 05 Apr 2025

https://github.com/maazie-khan/austin-housing-insights-powerbi

Worked with a real estate dataset, we will build a tool to evaluate trends and drivers of house prices around Austin, Texas.

dashboard data-analysis data-science data-visualization database powerbi

Last synced: 02 Jan 2026

https://github.com/rudra-g-23/power-bi-custom-visual

A custom Power BI visual that displays a customizable, interactive charts with advanced capabilities.

custom-visuals data-analysis data-visualization dax powerbi powerbi-custom-visuals svg visualization

Last synced: 02 Jan 2026

https://github.com/bhavinpatel4199/machine-learning-framework

This repository, showcases various projects that explore key concepts in both supervised and unsupervised learning, with a focus on real-world applications. The projects utilize a range of machine learning techniques, including data preprocessing, feature selection, exploratory data analysis (EDA), and model optimization.

classification clustering data-science data-structures data-visualization exploratory-data-analysis machine-learning machine-learning-algorithms machine-learning-models pandas-dataframe predictive-modeling preprocessing-data sklearn supervised-learning unsupervised-learning

Last synced: 20 Jan 2026

https://github.com/amanssur-tech/d3-visualizations

Modern React + D3 data visualization dashboard built with Vite, Tailwind & Framer Motion.

d3 dashboard data-visualization framer-motion react tailwindcss typescript vite

Last synced: 08 Apr 2026

https://github.com/lucycatherine/healthinsuranceproject

This repository contains a machine learning project that analyzes the factors influencing health insurance charges, such as age, smoking status, and medical conditions.

data-analysis data-science data-visualization jupyter-notebook machine-learning python

Last synced: 18 May 2026

https://github.com/zborovskaanna/grosery_store_sales_analysis

Python data analysis project. Analysis of grocery store sales using visualizations and reporting in Tableau

data-analysis data-visualization matplotlib numpy pandas python seaborn tableau

Last synced: 08 Apr 2026

https://github.com/lut-ful/e-commerce-sales-report

This dashboard provides a visual analysis of e-commerce sales data

data data-analytics data-science data-visualization power-bi statics

Last synced: 28 Jun 2025

https://github.com/ljadhav25/django-data-analyzer

Django Data Analyzer is a web application built using the Django framework, designed to streamline data analysis tasks. Users can upload CSV files containing data for analysis. The application utilizes the powerful data manipulation capabilities of Python libraries like pandas and numpy to perform various analyses on the uploaded data.

data-analysis data-visualization django-application matplotlib numpy pandas python seaborn

Last synced: 01 Mar 2026

https://github.com/omari-kd/transborder-freight-data-analysis

This project analyses transportation data from the Bureau of Transportation Statistics (BTS) to uncover insights into cross-border freight's efficiency, safety and environmental impacts across road, rail, air and water modes.

data-analysis data-analysis-in-r data-cleaning-and-preprocessing data-science data-visualization powerbi

Last synced: 30 Mar 2025

https://github.com/timjjting/escaping-flatland-slides

Slides for techniques behind escaping flatland

data-visualization glsl lod octree threejs

Last synced: 14 May 2025

https://github.com/al-ghaly/hotel-revenue-excel-analysis

Excel Dashboard to analyze data of a hotel over the past three years.

dashboard data-analysis data-visualization excel excel-analysis

Last synced: 02 Jan 2026

https://github.com/vedikasnehil/sql-50

This project focuses on solving 50 SQL problems every weekend from LeetCode to strengthen SQL skills, master advanced techniques, and build consistency. Each solution is documented with clear explanations, creating a valuable resource for learning and application.

data-visualization database-management sql

Last synced: 06 Jan 2026