An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/hassanislam463/british-airways-data-science

Analyze Skytrax reviews to uncover customer sentiments and key themes while predicting booking behavior using machine learning. This repository includes data collection, analysis, and modeling scripts alongside concise, visualized insights to improve customer experience and operational efficiency.

data-analysis data-science data-visualization

Last synced: 28 Mar 2025

https://github.com/hassanislam463/sentiment_analysis_of_financial_news_headlines_and_affect_on_stock_price_prediction

This project analyzes financial news sentiment using a fine-tuned RoBERTa model and integrates it with stock data to predict price movements using LSTM and GRU. It highlights the role of sentiment in enhancing stock market forecasting.

data-analysis data-science data-visualization deep-learning lstm-neural-networks nlp-machine-learning

Last synced: 28 Mar 2025

https://github.com/omerdduran/riskfactor-heart

This ML project predicts heart disease using logistic regression on the Cleveland Heart Disease UCI dataset, featuring advanced preprocessing and medical feature engineering, achieving 82.1% accuracy with strong cross-validation.

cardiovascular-health data-science data-visualization heart-disease-prediction logistic-regression machine-learning medical-ai scikit-learn

Last synced: 14 May 2026

https://github.com/hemangsharma/bookingdataanalysisreport

The report helps understand key trends and insights around customer bookings, pricing, and other related attributes.

analysis data data-analysis data-analytics data-visualization streamlit streamlit-dashboard

Last synced: 14 May 2026

https://github.com/amanssur-tech/d3-visualizations

Modern React + D3 data visualization dashboard built with Vite, Tailwind & Framer Motion.

d3 dashboard data-visualization framer-motion react tailwindcss typescript vite

Last synced: 08 Apr 2026

https://github.com/toluwaa-o/stears-lite-overview

Central overview repository for the Stears Lite project — documentation, resources, and links to frontend and backend repositories.

africa charts data data-aggregation data-visualization documentation fastapi nextjs project-overview

Last synced: 14 May 2026

https://github.com/takk8is/datasetanalysiseda

A robust Python tool for comprehensive dataset analysis and machine learning model evaluation. This project automates the process of data preprocessing, exploratory data analysis (EDA), and predictive modeling, with a focus on handling common data inconsistencies.

analytics analyzer chart csv-files data-science data-visualization datascience dataset datasets davidccavalcante eda fjallstoppur graphics machine-learning python python3 takk-ag takk-design takk8is xlsx-files

Last synced: 02 Sep 2025

https://github.com/nivasharmaa/friskwatch

A Java program for analyzing stop-and-frisk data from the NYPD. Features data import, organization, and statistical analysis to compare occurrences during and after policy implementation.

data-analysis data-visualization dataprocessing datascience file-io java java-oop nypd-data

Last synced: 19 May 2026

https://github.com/analyticalnahid/seaborn-tutorial

A complete Notebook on Seaborn for Data Science

data-visualization seaborn seaborn-tutorial

Last synced: 23 Aug 2025

https://github.com/analyticalnahid/plotly-tutorial

A intro of Plolty for Data Science

data-science data-visualization ploty python3

Last synced: 28 Mar 2025

https://github.com/saketr3/voting-policy-impact-visualizer

Data visualization web app where users can compare voter turnout of different demographics with states’ voting policy fairness scores

data-visualization voting

Last synced: 14 Mar 2025

https://github.com/shellynagar27/marketing-content-performance-analysis

Analyzed 2024 social media campaign data from TikTok, Instagram, LinkedIn, and X.com using Power BI to uncover performance trends across platforms, content types, and regions. Built an interactive dashboard to drive insights on engagement, optimal posting times, and content strategy.

data-analysis data-modelling data-visualization excel figma marketing-analytics powerbi powerquery wireframing

Last synced: 26 Jun 2025

https://github.com/zmyzheng/stack_overflow_qa_assistant

Big Data Analysis project with recommendation, cluster analysis and graph database

big-data-analytics cluster-analysis data-visualization graph-database hadoop mahout recommendation-system

Last synced: 30 Mar 2025

https://github.com/zulfachafidz/green_horizon_forecasting_peak_organic_avocado_sales_with_the_prophet_algorithm

The Green Horizon Project leverages the Prophet algorithm to predict peak sales of organic avocados, supporting the campaign "APEAM GO ORGANIC." Using Python and Looker Studio, this analysis aims to provide deep insight into sales trends and potential, forming the basis of smarter marketing strategies.

algorithm algorithms analytics data data-analysis data-engineering data-mining data-science data-visualization forecasting machine-learning machine-learning-algorithms prophet-model python python-script

Last synced: 17 May 2026

https://github.com/alan-oliveir/previsao_cartao_fidelidade

Projeto de ciência de dados para previsão de plano de fidelidade para clientes de uma companhia aérea.

data-science data-visualization database gradio python sql

Last synced: 04 May 2026

https://github.com/aran203/cricanalytics

ADSC Fall 24 Project for cricket analytics with hawkeye data

data-engineering data-visualization python streamlit

Last synced: 14 May 2026

https://github.com/ax-va/interactive-data-visualization-dale-2023

These examples on Interactive Data Visualization in the browser using Flask and D3.js are compiled with some modifications from the book "Data Visualization with Python and Javascript: Cleaning, Cleaning, Exploring, and Transforming Your Data" by Kyran Dale, published by O'Reilly Media in 2023.

ax-va d3 d3-visualization d3js data-science data-visualization dataviz javascript python

Last synced: 13 Mar 2025

https://github.com/grascya/sleep-health_-lifestyle-dataset

Classifier to predict the presence of a sleep disorder based on the other columns in the dataset.

data-visualization exploratory-data-analysis joblib machine-learning-algorithms pickle python statistical-analysis

Last synced: 20 May 2026

https://github.com/furkankarakuz/turkey_earthquake

This project focuses on analyzing and visualizing earthquake data specific to Turkey. It aims to provide insightful visualizations on topics such as earthquake frequency, location, and magnitude using data obtained from Boğaziçi University Kandilli Observatory and Earthquake Research Institute.

api data data-visualization earthquake python python3 request streamlit turkey turkey-earthquake

Last synced: 20 May 2026

https://github.com/casperkristiansson/finance-tracker

A project which solved an issue of mine which was tracking my finance. This Finance Tracking application gives overviews of expenses and income to give its users an easy way to explore their data.

dashboard data-visualization finance-management firebase-auth react

Last synced: 29 Dec 2025

https://github.com/benmar2406/rent-in-germany

Interactive visualizations and maps depicting topics around rent prices and income in Germany built with Svelte.

charts d3 d3-visualization d3js data-analysis data-visualization gis gis-data infographic infographics map mapbox mapbox-gl mapbox-gl-js mapboxgl svelte

Last synced: 26 Mar 2025

https://github.com/the-ethan-hunt/dekh-data

Playground for data visualization notebooks

data-visualization jupyter-notebook python

Last synced: 28 Mar 2025

https://github.com/eerkela/data-science-specialization

Coursework for Johns Hopkins University's Data Science Specialization, provided through Coursera

data-science data-visualization exploratory-data-analysis machine-learning nlp r regression statistics

Last synced: 31 May 2026

https://github.com/otsaloma/pollen-chart

Helsinki pollen count visualization

data-visualization javascript lambda pollen python

Last synced: 17 Apr 2026

https://github.com/saisathvik07/e-commerce-sales-analysis-using-sql-and-powerbi

This repository provides an extensive examination of Amazon Sales Data utilizing SQL

analytics data-science data-visualization mysql-database powerbi

Last synced: 22 Mar 2025

https://github.com/satyacoder29/smartfinance-dynamic-financial-dashboard

SmartFinance: Dynamic Financial Dashboard is an interactive tool designed to visualize key financial metrics like revenue, expenses, and profit. It features real-time data updates, charts, slicers, and navigation for easy analysis. This dashboard helps businesses make data-driven decisions and optimize financial performance.

data-analysis data-cleaning data-modeling data-visualization powerbi powerbi-desktop powerbi-visuals powerquerym

Last synced: 13 Feb 2026

https://github.com/mindlessmuse666/features-scaling

Проект по масштабированию признаков датасета Iris с использованием Python, Pandas, Scikit-learn, Seaborn и Plotly. Включает визуализацию данных, применение различных методов масштабирования и оценку производительности модели логистической регрессии.

data-scaling data-visualization feature-engineering iris-dataset machine-learning pandas plotly python scikit-learn seaborn student-project

Last synced: 16 Jun 2025

https://github.com/prsdthkr/viz-design-demo

📈 This repo houses my experiments with D3 (mostly inspired by other's work) for information visualization class project and demo.

d3 data-visualization lgbtq

Last synced: 06 May 2026

https://github.com/harmonicode/filtra

Digital Filter Designer is a powerful application built using PyQt5 and Matplotlib. It allows users to design and visualize digital filters, including standard filters and all-pass filters, and generate corresponding C code. Ideal for students, researchers, and engineers in digital signal processing.

data-visualization digital-signal-processing filter-design pyqt5 real real-time-processing

Last synced: 22 Mar 2025

https://github.com/omari-kd/environmental-impact-on-food-production

The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.

data data-analysis data-science data-visualization environmental-impact-analysis r

Last synced: 30 Mar 2025

https://github.com/Akhil-krishnan-r/super_market_analysis

The growth of supermarkets in most populated cities are increasing and market competitions are also high. This dataset is one of the historical sales of supermarket company which has recorded in 3 different branches for 3 months data. Predictive data analytics methods are easy to apply with this dataset

data-visualization matplotlib numpy pandas seaborn

Last synced: 03 Feb 2026

https://github.com/sadratehranian/pem-fuel-cell

The methodology section details the use of Python for data processing and analysis, employing statistical and machine learning-based anomaly detection techniques to identify potential issues in fuel cell stacks. It emphasizes data preprocessing, feature engineering, exploratory data analysis (EDA), and anomaly detection.

anomaly-detection data-analysis data-science data-visualization exploratory-data-analysis feature-engineering fuel-cell machine-learning preprocessing python statistical-analysis visual-studio-code

Last synced: 26 Mar 2025

https://github.com/piras-s/oceananalysisprocess

This project models expected temperature drop across the ocean's cool skin layer behavior from meteorological inputs, then compares predictions to 2021 satellite data to identify anomalies. Includes uncertainty quantification and spatial-temporal analysis.

bayesian-inference data-visualization gaussian-process-regression gaussian-processes machine-learning model-evaluation ocean python3 satellite-data

Last synced: 01 Jul 2026

https://github.com/holy-angel-university/student-performance-analysis

This project analyzes student data to understand factors affecting final exam scores. Data includes study habits, extracurriculars, family background, school environment, and demographics. The goal is to identify key contributors to academic success.

data-science data-visualization exploratory-data-analysis jupyter-notebook python3

Last synced: 06 Apr 2025

https://github.com/shreedata/data-analysis-using-python-libraries-

The COVID-19 pandemic has significantly impacted India, necessitating a detailed analysis of the virus’s spread within the country. In this project, we explore an India-specific COVID-19 dataset, leveraging Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.

covid-19 data data-cleaning data-visualization datana kaggle-dataset matplotlib numpy pandas-python python3 pythonlibrarires scikit seaborn

Last synced: 28 Mar 2025

https://github.com/jatin-mehra119/insurance_dataset

The objective of this project is to predict insurance charges based on various factors.

data-visualization dataanalysis prediction-model python regression-models

Last synced: 15 May 2026

https://github.com/cudavailable/gdp-data-visualization

A data visualization project for GDP data

data-visualization gdp vue

Last synced: 20 May 2026

https://github.com/maazie-khan/power-bi-projects

Welcome to my personal Power BI portfolio repository! Here you will find a collection of Power BI projects and dashboards that demonstrate my skills and expertise in data visualization, business intelligence, and analytics using Power BI.

dashboard data-analysis data-science data-visualization database excel powerbi

Last synced: 02 Jan 2026

https://github.com/nick-peter-marcus/detect-fake-job-postings

Detecting Fake Job Postings - Data Visualization, TF-IDF, XGBoost, SVC

cross-validation data-visualization machine-learning svc tf-idf xgboost

Last synced: 09 Jul 2025

https://github.com/vaxdata22/customer-churn-data-analytics-etl-pipeline-by-airflow-on-ec2

This is an end-to-end AWS Cloud ETL project. This orchestration uses Apache Airflow on AWS EC2 as well as AWS Glue. It demonstrates how to build ETL pipeline that would perform data transform using Glue job/crawler as well as loading into a Redshift table. It also shows how to connect Amazon Athena to Glue Data Catalog, and Power BI to Redshift.

amazon-athena amazon-redshift apache-airflow aws-ec2 aws-glue aws-s3 business-intelligence customer-churn-analytics dags data-visualization etl-pipeline orchestration power-bi python3

Last synced: 18 Jun 2025

https://github.com/arosas17/bikesharing

Use of Tableau Public to create unique graphs to visualize clear patterns in a bike-sharing program and possibly be applied to a new bike-sharing program.

data-visualization tableau-public

Last synced: 25 Apr 2026

https://github.com/srinibas-masanta/hotel-revenue-analysis-dashboard

This project focuses on analyzing hotel booking data to uncover key metrics and insights that drive revenue management decisions. By creating an interactive Power BI dashboard, the project aims to improve strategic decision-making, optimize occupancy rates, and enhance overall financial performance within the hospitality industry.

business-analytics data-analysis data-science data-visualization dax-functions hospitality powerbi

Last synced: 12 Jan 2026

https://github.com/nick-peter-marcus/chocolate-bar-analysis

Analyzing Chocolate Bar Features and Ratings - Data Visualization, Decision Trees, Random Forest

data-analysis data-visualization decision-trees python random-forest seaborn sklearn

Last synced: 10 May 2026

https://github.com/prakhar-code/house_sales_analysis

House Sales Analysis Of King County, Washington, USA and Clean Visualization.

data-cleaning data-visualization excel tableau tableau-dashboards tableau-public

Last synced: 12 Jan 2026

https://github.com/sanikaptl/snooze-monitor

SnoozeMonitor is your essential app for enhancing and managing your sleep habits. Offering personalized sleep recommendations, detailed sleep tracking, and insightful analysis of your sleep patterns, it’s designed to help you achieve better rest every night.

data-visualization insomnia machine-learning taipy-gui

Last synced: 26 Mar 2025

https://github.com/zborovskaanna/grosery_store_sales_analysis

Python data analysis project. Analysis of grocery store sales using visualizations and reporting in Tableau

data-analysis data-visualization matplotlib numpy pandas python seaborn tableau

Last synced: 08 Apr 2026

https://github.com/carol-neto/sprint-3-data-manipulation

In this project I analyzed the behavior of users of an online market.

data-visualization graphics matplotlib python

Last synced: 15 May 2026

https://github.com/iankitnegi/ms-data-analyst-professional-certificate

Journey through the Microsoft Power BI Data Analyst Certificate with notes, projects, and exercises. 🚀

data-visualization microsoft powerbi

Last synced: 24 Jan 2026

https://github.com/janashanaa/flightanalysis

This Jupyter Notebook presents an exploratory data analysis of data derived from a flight booking website.

data-analysis data-visualization exploratory-data-analysis jupyter-notebook python

Last synced: 15 May 2026

https://github.com/karishmagupta05/e-commerce-sales-dashboard

This project is an interactive E-Commerce Sales Dashboard built using Power BI. It provides key insights into sales, profit, and customer behavior through visually engaging charts and graphs.

data-analysis data-visualization powerbi

Last synced: 09 Feb 2026

https://github.com/archanakokate/bank_term_deposit_prediction

Build a Decision Tree classifier to predict if the client will subscribe to a Term Deposit based on their demographic and behavioral data.

data-analysis data-visualization exploratory-data-analysis machine-learning

Last synced: 14 Sep 2025

https://github.com/lut-ful/e-commerce-sales-report

This dashboard provides a visual analysis of e-commerce sales data

data data-analytics data-science data-visualization power-bi statics

Last synced: 28 Jun 2025

https://github.com/obacreativity/trading-gpt

TradeGPT is an intelligent trading bot built with ChatGPT and AI to automate and optimize trading strategies. It analyzes market data, predicts trends, and executes trades in real-time, providing traders with tools to enhance efficiency and profitability.

ai-trading algorithmic-trading automated-trading backtesting chatgpt crypto-trading data-visualization financial-analysis machine-learning market-data openai portfolio-management real-time-data risk-management sentiment-analysis stock-market technical-indicators trade-execution trading-strategies

Last synced: 27 Mar 2025

https://github.com/shivasairam1706/mlops-project1

End-to-end ML-Ops project using PySpark and AWS, covering environment setup, model training, deployment with data capture, execution, and analysis. CI/CD pipelines (AWS CodePipeline) and monitoring (CloudWatch) ensure automated deployment, performance tracking, and model retraining for production-ready ML solutions.

aws aws-lambda aws-s3 data-engineering data-science data-visualization delta-lake docker forcasting mlops-project pyspark unix-shell

Last synced: 20 May 2026

https://github.com/asimpson/is-steph-mvp

🏀 Compare the 2018 NBA MVP contenders against Steph Curry's historic, unanimous, 2016 MVP season.

data-visualization nba reactjs

Last synced: 20 May 2026

https://github.com/gui-sitton/carsells

In this project I am an analyst on the Crankshaft List. Hundreds of free vehicle advertisements are published on the site every day. I need to study the data collected over the last few years and determine which factors influence the price of a vehicle.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 20 May 2026

https://github.com/alra-code/data-analytics-com-power-bi

Desafio de projetos do Boocamp Data Analytics realizado pela Dio Me em 2024

analytics data-visualization desafios-resolvidos dio-bootcamp powerbi pt-br

Last synced: 25 Jan 2026

https://github.com/karanch10/fraudshield

FraudShield is a machine learning credit card fraud detection system that analyzes transaction attributes to identify suspicious activities in real time. Built with Python, SQL, and Django, it provides a user-friendly interface for fraud prediction using OpenBanking APIs and advanced detection techniques. Ideal for businesses and individuals.

data-analysis data-science data-visualization machine-learning python3

Last synced: 20 May 2026

https://github.com/youssef-saaed/activity-recognition-using-various-ml-algorithms

This project involves a comprehensive comparative analysis of various machine learning models to classify activities based on a given dataset. The analysis follows a structured approach, including data exploration, model training, model evaluation, and results interpretation to identify the best performing model.

activity-recognition comparative-analysis cross-validation data-exploration data-visualization machine-learning model-evaluation model-training neural-networks

Last synced: 22 Mar 2025

https://github.com/shuddha2021/interactive-data-visualization-app

An interactive web application for visualizing data using Chart.js. Users can explore and analyze data through dynamic charts and customize their view

chart data-visualization event-handling interactive-ui javascript real-time-updates responsive-design web-development

Last synced: 01 Nov 2025

https://github.com/erpix3lt/threejs-data-visualisation

Take on Jer Thorps data visualisation of 138 Years of Popular Science, only in Threejs.

138-years-of-popular-science data-visualization react-three-drei react-three-fiber three-js

Last synced: 12 Mar 2025

https://github.com/kaladabrio2020/dataanalysiswith-r

Analise de dados no R, estatistica etc

analise-de-dados data-visualization r

Last synced: 16 May 2026

https://github.com/kaustubh-indulkar/te-it-dsbda-assignmnets

This repository contains the solutions for a series of assignments covering Data Science And Big Data Analytics concepts.

big-data big-data-analytics data-analytics data-science data-visualization sppu-2019-pattern sppu-it-dept

Last synced: 29 Mar 2025

https://github.com/satyacoder29/comparison-of-region-based-sales-tableau

The region-based sales comparison analyzes sales performance across different regions. It identifies trends, top-performing regions, and areas needing improvement by comparing metrics like revenue, growth rate, and product demand. This analysis helps optimize sales strategies and resource allocation for better performance.

data-analysis data-cleaning data-collection data-visualization powerquerym relationships tableau tableau-desktop unions

Last synced: 02 Feb 2026

https://github.com/patricksferraz/aqw-madrid-data-analysis

Interactive analysis and visualization of Madrid's air quality and weather data (2001-2016) using Python, Dash, and Jupyter. Features interactive maps, statistical analysis, and data visualization tools.

air-quality dash data-analysis data-engineering data-science data-visualization data-wrangling environmental-data environmental-science interactive-dashboard jupyter jupyter-notebook madrid open-data pandas plotly python statistical-analysis time-series weather-data

Last synced: 30 Jan 2026

https://github.com/djsprenk/djsprenk.github.io

GitHub Pages site for DJ Sprenk

d3 d3-visualization data-visualization dj music python

Last synced: 20 May 2026

https://github.com/mohamed-walied/customer-behavior-analysis-using-r

Customer Behavior Analysis project utilizing the "Groceries Market Basket Dataset" from Kaggle. The project employs a data-driven approach to uncover customer purchasing patterns and relationships within the grocery market using K-means Clustering and Association Rules using Apriori-Algorithm. In collaboration with some friends.

apriori-algorithm association-rule-learning dashboard data-cleaning data-visualization k-means-clustering r-programming-language

Last synced: 26 Jul 2025

https://github.com/theshashanksinha/deloitte-au

Analyzed telemetry and salary equality data using Tableau and Excel to identify machine downtime patterns and assess gender pay equity, translating raw data into actionable business insights.

data-analytics data-visualization microsoft-excel tableau

Last synced: 06 Mar 2026

https://github.com/leonardoberlatto/1000-startups-analytics

Data analytics on startups data using Tableau

analytics data-science data-visualization tableau

Last synced: 11 Jan 2026

https://github.com/sayamalt/tmdb-movies-end-to-end-etl-and-ml-pipeline

This project encompasses end-to-end ETL and ML pipeline development. Data ingestion from TMDB API covered top-rated, current, upcoming, and popular movies with genres. Performed EDA to derive several valuable insights and observations. Developed a regression model with 97% r2 score to predict average movie ratings accurately.

azure-databricks azure-key-vault data-ingestion data-transformation data-visualization etl-pipeline exploratory-data-analysis extract-transform-load feature-engineering mlflow mlflow-tracking model-training-and-evaluation pyspark-mllib regression-models spark

Last synced: 15 May 2026

https://github.com/williamd1k0/metacritic-games

Distribution of Metacritic scores for console games.

data-scraping data-visualization metacritic web-scraping

Last synced: 26 Jun 2025

https://github.com/nmfs-opensci/nmfs-repos

A list of NMFS repos using the GitHub API

data-visualization github-api

Last synced: 11 Apr 2025

https://github.com/c2r0b/2q

Manage data and relationships with AI

data-visualization graphql relationships rust tauri

Last synced: 09 Apr 2026

https://github.com/vinay-ram1999/srdbv5-analytics

Exploratory data analysis and visualisation of a global Soil Respiration Database (SRDB) using R, MySQL and Tableau.

analytics data-visualization exploratory-data-analysis mysql soil-respiration tableau

Last synced: 01 May 2026