An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/gustavo-b-morales-s/footwear-market-data-pipeline

This project uses Python, Scrapy, SQLite3 and Streamlit to extract sports shoe data from Mercado Livre, perform transformations using Pandas, store the data in an SQLite database and create a data visualization interface emphasizing the main KPIs using Streamlit.

data-engineering data-visualization etl-pipeline webscraping

Last synced: 06 Apr 2025

https://github.com/faizantkhan/automated-eda

This repository showcases tools for automatic Exploratory Data Analysis (EDA) in Python. These tools help you quickly understand your datasets and generate insightful reports.

automatic automation autoviz data-analysis data-analysis-python data-science data-visualization dtale dtale-library eda exploratory-data-analysis ml pandas pandas-profiling python python-library sweetviz

Last synced: 18 Apr 2026

https://github.com/simranshaikh20/credit-card-dashboard

A Data Visualization Project using Microsoft Power bi

data-analysis data-visualization powerbi

Last synced: 02 Jan 2026

https://github.com/suryavamsi-p/customer-churn-prediction

This project is made in order to predict any Customer exit possibility for any Telecom Industries like AT & T, T - mobile and Verizon.

artificial-intelligence data-visualization decision-tree-classifier machine-learning r random-forest-classifier rstudio

Last synced: 06 Apr 2025

https://github.com/abhishekyadav915/diwali_sales_analysis

This project aims to analyze sales data during the Diwali festival using Python. The analysis focuses on identifying key trends, customer purchasing behavior, and sales performance across different segments. By leveraging data visualization and statistical analysis, we uncover insights.

data-analysis data-visualization matplotlib-pyplot numpy-library pandas-dataframe seaborn-python

Last synced: 05 Apr 2025

https://github.com/tashi-2004/data-visualization-tableau-traffic-collision-insights

Analysis of traffic collision data using Tableau, featuring interactive visualizations that highlight trends in injuries and fatalities, contributing factors, and geographic distributions. It includes various sheets and dashboards, with recommendations for enhancing road safety. The dataset is available for further exploration.

data-analysis data-visualization eda geospatial-analysis machine-learning predictive-modeling statistics tableau traffic-analysis

Last synced: 19 Mar 2026

https://github.com/thesfinox/mltools

A collection of simple tools for data science and machine learning projects.

ai data-analysis data-science data-visualization logging machine-learning matplotlib neural-network python toolbox

Last synced: 14 May 2025

https://github.com/mr-chang95/datascience_airbnb

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

airbnb data-analysis data-science data-visualization jupyter-notebook numpy pandas python sklearn

Last synced: 08 Apr 2026

https://github.com/cano1998/data-visualization-project

A project focused on data visualization to explore various aspects of a car dataset. The visualizations provide insights into car performance, efficiency, and characteristics based on different manufacturers and features.

bar-pl bar-plot data-analysis data-visualization histogram jupyter-notebook line-plot

Last synced: 17 Jul 2025

https://github.com/divyashah0510/sales-insights-for-retail-company

This project is a data analysis project for a retail company. The company has dataset: sales_data.csv that contains the sales data for the company. The objective of this project is to analyze the sales data and provide insights to the company to improve their sales.

dash data-visualization pandas plotly sales sales-insights streamlit visualization

Last synced: 02 Jan 2026

https://github.com/sreekar0101/-movie-recommendation-system-using-python

The Movie Recommendation System is designed to suggest personalized movie recommendations by analyzing extensive datasets containing movie details and credits.ultilizes python libraries numpy pandas and scikit learn.The system achieved a 15% improvement in accuracy compared to the baseline model by identifying key factors that influence user choice

data-analysis data-visualization numpy-library pandas-dataframe scikit-learn seaborn-python

Last synced: 02 Jan 2026

https://github.com/maazie-khan/austin-housing-insights-powerbi

Worked with a real estate dataset, we will build a tool to evaluate trends and drivers of house prices around Austin, Texas.

dashboard data-analysis data-science data-visualization database powerbi

Last synced: 02 Jan 2026

https://github.com/saniyaacharya04/youtube-trending-video-analyzer

Modular Streamlit dashboard for analyzing trending YouTube videos by views, engagement, and category—powered by the YouTube Data API.

api-analysis clustering-engagement-metrics dashboard data-visualization modular-architecture streamlit trending youtube

Last synced: 21 May 2026

https://github.com/rudra-g-23/power-bi-custom-visual

A custom Power BI visual that displays a customizable, interactive charts with advanced capabilities.

custom-visuals data-analysis data-visualization dax powerbi powerbi-custom-visuals svg visualization

Last synced: 02 Jan 2026

https://github.com/bhavinpatel4199/machine-learning-framework

This repository, showcases various projects that explore key concepts in both supervised and unsupervised learning, with a focus on real-world applications. The projects utilize a range of machine learning techniques, including data preprocessing, feature selection, exploratory data analysis (EDA), and model optimization.

classification clustering data-science data-structures data-visualization exploratory-data-analysis machine-learning machine-learning-algorithms machine-learning-models pandas-dataframe predictive-modeling preprocessing-data sklearn supervised-learning unsupervised-learning

Last synced: 20 Jan 2026

https://github.com/tushar2704/employee-distribution

This repository contains valuable insights and visualizations derived from an extensive HR dataset spanning from 2000 to 2020, with over 22,000 rows.

data-analysis data-visualization excel postgresql powerbi sql tushar2704

Last synced: 04 Nov 2025

https://github.com/amanssur-tech/d3-visualizations

Modern React + D3 data visualization dashboard built with Vite, Tailwind & Framer Motion.

d3 dashboard data-visualization framer-motion react tailwindcss typescript vite

Last synced: 08 Apr 2026

https://github.com/maazie-khan/power-bi-projects

Welcome to my personal Power BI portfolio repository! Here you will find a collection of Power BI projects and dashboards that demonstrate my skills and expertise in data visualization, business intelligence, and analytics using Power BI.

dashboard data-analysis data-science data-visualization database excel powerbi

Last synced: 02 Jan 2026

https://github.com/lut-ful/e-commerce-sales-report

This dashboard provides a visual analysis of e-commerce sales data

data data-analytics data-science data-visualization power-bi statics

Last synced: 28 Jun 2025

https://github.com/foufou-exe/occitanie-report-rental-yields

This project aims to develop a datavisualization and reporting tool to analyze rental yields in the Occitanie region, for use by real estate investors.

data-visualization jasper java opendata python reporting

Last synced: 22 May 2026

https://github.com/andryadsm/pizza-sales-report

🍕 Project Pizza Sales Report (MySQL, Tableau)

dashboards data-analysis data-visualization database-management mysql sales sql tableau

Last synced: 14 May 2025

https://github.com/rubayeaalketbi/real-time-text-sentiment-analysis-with-azure-functions

A serverless application that performs real-time sentiment analysis on text messages using Azure Functions.

azure azure-functions data-visualization python sentiment-analysis

Last synced: 22 May 2026

https://github.com/timjjting/escaping-flatland-slides

Slides for techniques behind escaping flatland

data-visualization glsl lod octree threejs

Last synced: 14 May 2025

https://github.com/gabe-zhang/cf-dataviz

A visual data exploration of campaign finance data

data-visualization ggplot2 r

Last synced: 06 Apr 2025

https://github.com/andrew-dev-p/chartjs-showcase

Interactive data visualizations using Chart.js with smooth animations and dynamic updates

bar-chart chartjs charts css data-visualization html interactive-graphs javascript line-chart pie-chart

Last synced: 18 Feb 2026

https://github.com/namratagulati/fraud_detection

This fulfills all the requirements of a fraud detection model developed on linear regression using feature scaling, engineering and testing model with the help of auc-roc curve and others.

data-analysis data-visualization machine-learning machine-learning-algorithms machinelearning-python

Last synced: 04 Jun 2026

https://github.com/sayamalt/fake-news-classification-using-fine-tuned-bert

Successfully developed a text classification model to predict whether a given news text is fake or not by fine-tuning a pretrained BERT transformed model imported from Hugging Face.

bert-embeddings bert-model data-analysis data-visualization deep-learning fine-tuning-bert model-evaluation model-training-and-evaluation text-classification text-preprocessing text-tokenization tokenizer-nlp wordcloud-visualization

Last synced: 05 Apr 2025

https://github.com/bala-1409/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

acf adf arima-model data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms pacf python python3 sarimax-model seasonality seasonality-analysis time-series time-series-forecasting trends

Last synced: 27 Apr 2026

https://github.com/bala-1409/loan-clustering-datascience-projects

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering clustering-algorithm data-analysis data-science data-visualization kmeans-clustering machine-learning machine-learning-algorithms sql unsupervised-learning unsupervised-machine-learning

Last synced: 22 Mar 2025

https://github.com/beolawork-art/novabank-churn-analysis

NovaBank has noticed that customers are closing accounts or going inactive, and they want to understand why.

data-analysis data-science-projects data-visualization eda machine-learning numpy pandas python scikit-learn sql

Last synced: 08 Apr 2026

https://github.com/rfonod/tableau-dashboard

A Tableau dashboard visualizing the change in the Summary Innovation Index (SII) from 2012 to 2019, relative to selected academic research trends in Europe. It includes a bar chart for country comparisons, a scatter plot for trend analysis, and a map to show geographic patterns, with interactive features for enhanced insights.

dashboard data-visualization innovation-index tableau visualization

Last synced: 25 Jan 2026

https://github.com/noor188/preswald-data-app

A data app to visualize and manipulate the graduate admission dataset

data-analysis data-visualization open-source

Last synced: 04 Jul 2025

https://github.com/nmatthews2203-del/rent-affordability-explorer

Interactive housing analytics dashboard using Zillow rent data and Census income data to analyze affordability, rent trends, and geographic housing differences across U.S. counties.

altair data-analytics data-visualization housing-data interactive-dashboard pandas plotly python real-estate sql sqlite streamlit

Last synced: 03 May 2026

https://github.com/samuelbarbosadev/walrmart_data_analysis

You have been hired by Walmart to survey the revenue of their stores in the USA and point out which store would be best to expand its size. It is necessary to analyze the weekly sales of each store, calculate some important information that will be asked, and at the end of it all, indicate which store should be invested in.

data-preparation data-understanding data-visualization pandas python

Last synced: 08 May 2026

https://github.com/muthukumar0908/phonepe-pulse-data-visualization-and-exploration

Creating a dashboard by using streamlit application. in this app visualizing the data taken from Phonepe pulse Github repository.

data-visualization github-config mysql-connector-python mysql-database pandas plotly python streamlit-webapp

Last synced: 29 Apr 2026

https://github.com/nimomach/amazon-sales-data

This is a small dataset containing Amazon sales data analysis for few regions.

dashboards data data-analysis data-visualization

Last synced: 08 Mar 2026

https://github.com/dbolotov/ts_smoothing_visualizer

Streamlit app for visualizing and comparing time series smoothing methods on real and synthetic datasets.

data-science data-visualization streamlit time-series

Last synced: 24 Jul 2025

https://github.com/bris0yzbekaye/json-to-excel-converter

This repository provides a tool to convert JSON data to Excel format (.xlsx). It allows you to easily transform structured JSON data into a well-organized spreadsheet for better analysis and visualization.

automation-script automation-tools data-analysis data-converter data-export data-formatting data-tools data-visualization excel excel-automation excel-converter excel-tools json json-exporter json-parser json-processing json-to-csv json-to-excel programming-tools spreadsheet-tools

Last synced: 25 Jul 2025

https://github.com/raufjatoi/data-vis-

Data visualization đź’» learning from kaggle

data-visualization matplotlib matplotlib-pyplot seaborn

Last synced: 24 Jul 2025

https://github.com/shinjimc/simulated_annealing_tsp

This project applies Simulated Annealing to solve the Traveling Salesman Problem using Peru's departments as nodes. Through iterative refinement, it finds the shortest route visiting each department once. Visual feedback enhances understanding and debugging, resulting in an optimal solution displayed with total distance.

data-visualization geospatial-analysis simulated-annealing simulated-annealing-algorithm simulated-annealing-edge-detection traveling-salesman-problem traveling-salesman-problem-solver

Last synced: 24 Jul 2025

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: 09 Apr 2026

https://github.com/armahdavi/data_pipeline_analytics_statistics_ML_PM_PSD_residential_QFF

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 17 Sep 2025

https://github.com/jaguzmana/colombia-covid-analysis

A project proposed to enhance SQL proficiency and develop skills in data visualization using Tableau.

data-visualization mssql-database tableau

Last synced: 08 Mar 2026

https://github.com/prachi005748/website-performance-data-analysis-project

Briefly describe the objective of the project—e.g., analyzing website performance metrics over time, uncovering trends in user engagement, or evaluating channel-wise traffic quality.

data-analyst data-cleaning data-preprocessing data-visualization data-visualization-python exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn storytelling

Last synced: 01 May 2026

https://github.com/samiksha29-patil/e-commerce-sales-insights-dashboard

This project focuses on analyzing e-commerce sales data through data visualization. It highlights customer behavior, popular sales channels, product category trends, and city-wise performance to provide actionable business insights.

analytics customer-insights data-visualization ecommerce matplotlib numpy pandas python sales-analysis seaborn

Last synced: 03 May 2026

https://github.com/samiksha29-patil/flipkart-mobiles-data-analysis-visualization-in-python

This project analyzes Flipkart Mobiles Dataset to extract useful insights about mobile phones, their pricing, ratings, discounts, and customer reviews. The analysis and visualization are done using Python to understand market trends and customer preferences.

data-analysis data-visualization matplotlib numpy pandas python seaborn

Last synced: 04 May 2026

https://github.com/ptdewey/spotipy-wrapped

Make sense out of Spotify personal data

data-visualization jupyter-notebook python spotify

Last synced: 01 Aug 2025

https://github.com/hecatops/ad_libs

A real time advertisement data analytics platforming, displaying important metrics in easy to understand language.

dashboard data-analysis data-visualization kpi plotly-dash python

Last synced: 07 Nov 2025

https://github.com/ornl/covid19vis

Visualizations of COVID-19 case data

data-visualization scientific-visualization

Last synced: 03 Jan 2026

https://github.com/sharoonjoseph321/indian-liver-diseases

Indian Liver Disease Analysis and Prediction This project leverages the Indian Liver Patient Dataset (ILPD) to analyze liver disease trends and develop predictive models for early diagnosis. Through data preprocessing, exploratory analysis, and machine learning, it identifies key risk factors and builds classification models

data-analysis data-science data-visualization logistic-regression machine-learning pandas python seaborn

Last synced: 27 Jul 2025

https://github.com/jain1shh/solar-flare-prediction

This repository contains code and data for predicting solar flare energy ranges using machine learning, based on NASA's RHESSI mission data. It includes preprocessing of FITS files into a unified CSV dataset and implements models like Gradient Boosting, Random Forest, and Decision Tree classifiers, achieving accuracies up to 87%.

data-visualization machine-learning numpy pandas python scikit-learn solar-flare-prediction

Last synced: 09 Apr 2026

https://github.com/smpotts/dash-live-updates

Figuring out how to do live updates in Dash Plotly.

dash-plotly data-visualization python

Last synced: 27 Jul 2025

https://github.com/easonlai/covid19_hk_analysis

This is code sample of data analysis (with visualization) for COVID-19 cases in Hong Kong. Data is obtained from official data.gov.hk.

covid-19 data-analytics data-science data-visualization matplotlib pandas python seaborn seaborn-plots

Last synced: 12 Apr 2026

https://github.com/akhi07rx/petals-using-r

This R code generates a plot of a flower. It uses polar coordinates and the sine function to create the petal shapes and then plots them.

data-visualization graphics opensource plot r trignometry

Last synced: 23 May 2026

https://github.com/christos-pelekis/harsourcerer

An inclusive MERN stack-based platform for comprehensive analysis and exploration of HTTP traffic data extracted from HAR (HTTP Archive) files.

data-visualization har-files http-traffic mern-stack

Last synced: 29 Jul 2025

https://github.com/shreedata/covid-da-dasboard-using-powerbi

This repository showcases a PowerBI dashboard focused on visually representing COVID-19 data for Indian states and Union Territories in an easily understandable way. The dataset is sourced from Kaggle.

data-cleaning data-visualization datanalaysis microsoft microsoft-powerbi powerbi-report powerbi-visuals powerbidashboard

Last synced: 19 Feb 2026

https://github.com/swethajoseph/statistical-stock-performance-analysis

Conducted a statistical analysis of Microsoft, Tesla, and Apple stock performance compared to the S&P 500, examining price trends, volatility, and correlations to derive investment insights.

advancedexcel comparative-analysis data-analysis data-visualization datapreparation descriptive-statistics moving-average msexcel performance-analysis performance-metrics regression-analysis statistical-analysis

Last synced: 03 Jan 2026

https://github.com/yash22222/literacy-exploration-analysis

Delve into India's literacy landscape through data analysis. Uncover regional disparities, high/low literacy states & gender imbalances.

csv data-analysis data-visualization government-data india literacy literacy-analysis states

Last synced: 29 Jul 2025

https://github.com/cyprianfusi/data-scientist-technical-exercise-10ds

With recommendations to UK Department for Education of 10 Local Authorities where National Tutoring Programme (NTP) should be intensified and a response to UK Secretary of Health regarding a 76% Accident and Emergency (A&E) performance target which seems far-fetched.

data-analysis data-cleaning data-visualization hypothesis-testing pandas-python policy statistics

Last synced: 21 Sep 2025

https://github.com/zborovskaanna/e-commerce-web-events-analysis

SQL project based on the Big Query public database 'The Look e-Commerce' and a dashboard in Looker Studio

analysis bigquery dashboard data-visualization looker-studio sql

Last synced: 03 Jan 2026

https://github.com/farseenmanekhan1232/analyse-economic-cycle

A Python-based CLI tool for analyzing economic cycles and making data-driven investment decisions in the Indian stock market using Kite Connect API.

data-visualization investment matplotlib portfolio-optimization python stock-market

Last synced: 30 Jul 2025

https://github.com/sinsunsan/earth-survival-kit

Global warning data visualisation app to make everyone understand global warning and take actions that matter

angular angular7 d3 data-analysis data-visualization ecology global-warning ngx-charts

Last synced: 05 May 2026

https://github.com/shaheerazam-dev/cyclistic-case-study-google-data-analytics-certificate

This case study simulates the real-world experience of a junior data analyst at Cyclistic, a fictional company. We will leverage the data analysis process framework (Ask, Prepare, Process, Analyze, Share, Act) to address critical business questions and provide data-driven insights to guide strategic decision-making.

bigquery data-science data-visualization spreadsheet sql tableau

Last synced: 06 Feb 2026

https://github.com/hooopo/ossinsight-pick

Handpicks, features, or highlights a selection of open-source repositories each week. We cherry-pick the best, trending, or otherwise interesting repositories, providing an in-depth analysis you won't find elsewhere, thus enabling developers to discover, learn from, and contribute to these noteworthy projects.

analytics data-visualization github open-source trending-repositories visualization

Last synced: 30 Jul 2025

https://github.com/sanveed-adnan/supermarket-sales-sql-project

SQL-based data analysis project on supermarket sales performance using SQLite and Power BI.

business-intelligence data-analysis data-science data-science-projects data-visualization power-bi sales-data sql sqlite

Last synced: 08 Nov 2025

https://github.com/teamtigers/echartify

A web application built with .net core 2.2 that has come with the idea of reading the National Election's Data-set of Bangladesh in a fastest possible time and then representing the data-set with different statistical charts.

bangladesh chartjs code-first-migration cross-platform data-analysis data-structures data-visualization dotnet-core election-analysis election-data entity-framework-core materializecss mvc npoi razor-pages

Last synced: 16 Apr 2026

https://github.com/alrza2003/google-data-analysis-case-study-cyclistic

This project analyzes Cyclistic’s trip data to identify patterns in bike usage between casual riders and annual members. The findings help optimize marketing strategies and membership conversions.

business-task cyclistic-bike-share-analysis-case-study data-analysis data-science data-visualization google-data-analytics google-data-analytics-capstone-project google-data-analytics-professional jupyter-notebook python rmarkdown tableau

Last synced: 09 May 2026

https://github.com/robwiederstein/kytc_loc

Plot Kentucky licensing locations

data-visualization ggmap leaflet r xml2

Last synced: 31 Jul 2025

https://github.com/farrelfaricaf/exploratorydataanalyst---titanic

This project analyzes the Titanic dataset using exploratory data analysis (EDA) and visualization techniques to identify survival patterns. The goal is to understand how demographic factors like gender and age influenced survival rates during the 1912 disaster.

data data-analysis data-science data-visualization eda python titanic-dataset

Last synced: 31 Jul 2025

https://github.com/darkdk123/handwashing-discovery-analysis

A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.

data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots

Last synced: 09 Apr 2026

https://github.com/mr-chang95/twitter_datawrangling

Twitter Data Wrangling for Udacity's Data Analyst Nanodegree Program

data-visualization data-wrangling dogs matplotlib numpy pandas python twitter

Last synced: 09 Apr 2026

https://github.com/analyst-lochan/flight-delay-and-cancellation-dataset-2019-2023-

This project demonstrates a complete data analytics pipeline starting from raw real-world flight data to professional visual dashboards using SQL Server and Power BI. It showcases data import, cleaning, optimization, transformation, and dynamic DAX-based visual reporting.

airline-performance business-intelligence data-analysis data-cleaning data-modeling data-visualization dax etl flight-data kaggle-dataset portfolio-project powerbi powerbi-dashboard sql sql-server

Last synced: 09 Sep 2025

https://github.com/ashwin331133/powerbi-data_professional_survey_breakdown

This project analyzes survey data from individuals interested in transitioning to the data field. The survey aims to understand their backgrounds, motivations, and the challenges they face. Using Power BI for data visualization, the project provides insights into the demographics and preferences of these aspirants.

data-analysis data-visualization powerbi

Last synced: 03 Jan 2026

https://github.com/jigyasag18/ai-ml-salaries-and-ai-tools-usage-trends

This repository presents an in-depth Power BI analytics report on the AI job market trends and student AI tool usage from 2020 to 2025. It combines structured datasets (job postings, salaries, surveys) with custom DAX measures to uncover key patterns in salaries, remote work, industry demand, and student engagement. 5 interaractive dashboards made.

analysis data data-analysis data-visualization dataanalysis dataanalytics dataset datavisualization power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-visuals powerbidashboard visualization

Last synced: 16 Feb 2026

https://github.com/jigyasag18/global-terrorism-1970-2017-analysis-using-big-data

This repository explores over 180,000 terrorist incidents across 205 countries using Hadoop and Power BI. The project identifies global and regional patterns in terrorism, analyzes the impact on civilians, and highlights high-risk areas. Key insights include attack trends,weapon usage,top terror groups,& country-specific risks like those in India.

big-data big-data-analytics data data-analysis data-visualization dataanalytics dataset hadoop hive hive-database hive-db hivedb power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-report-validation powerbi-visuals powerbidashboard

Last synced: 19 Feb 2026

https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics

This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.

big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard

Last synced: 01 Aug 2025

https://github.com/ryanga09/digitalent_fundamentaldatascience-selfpractice

A repository of hands-on projects from DigiTalent’s Fundamental Data Science training, covering web scraping, data exploration, data cleaning, and data annotation. Includes Jupyter notebooks and example code for practical learning.

data data-analysis data-science data-visualization dataset digitalent komdigi notebook-jupyter notebooks

Last synced: 02 Aug 2025

https://github.com/takshak26/predict_blood_donations-

About The title of the project is “Predict Blood Donations”. It uses python as language, data science, and machine learning as the field of operation, TPOT library for model selection, logistic regression for model building, and jupyter notebook as the code editor.

data-analysis data-visualization datascience machine-learning python3

Last synced: 16 May 2026

https://github.com/thedevreda/jadaerospace

A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero

data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python

Last synced: 02 Aug 2025

https://github.com/azaz9026/eda

Exploratory Data Analysis (EDA) refers to the method of studying and exploring record sets to apprehend their predominant traits, discover patterns, locate outliers, and identify relationships between variables. EDA is normally carried out as a preliminary step before undertaking extra formal statistical analyses or modeling.

data-cleaning data-visualization encoding machine-learning matplotlib numpy pandas plotly python3 seaborn sklearn-library

Last synced: 15 Apr 2026

https://github.com/aaryan-agr/options-visualizer

A visualizer for profit and loss regions of different options strategies, allowing users to input various option parameters and visualize the resulting profit/loss zones interactively

data-visualization interactive-visualizations options options-pricing plotly python stock-options stock-options-calculator

Last synced: 22 Aug 2025

https://github.com/tashi-2004/apache-flink-spark-data-streaming

This project showcases a real-time data streaming pipeline using Apache Flink, Apache Spark, and Grafana. It streams data, stores it in Parquet format, and performs aggregations for insights, with seamless visualization via Grafana dashboards.

apache-flink apache-spark data-aggregation data-analysis data-science data-streaming data-visualization flink flink-stream-processing flink-streaming grafana-dashboard grafana-plugin pyflink python3

Last synced: 09 Feb 2026

https://github.com/drisskhattabi6/text-to-sql

Chat with DB : A powerful web application that transforms natural language questions into executable SQL queries against a PostgreSQL or MySQL database and visualizes the results, Using Langchain (Ollama and ChromaDB), LangGraph and Streamlit

ai-agent chat-with-db chromadb data-visualization gemini langchain langgraph mysql ollama openai postgresql streamlit text-to-sql text2sql txt2sql

Last synced: 09 Apr 2026

https://github.com/cecoeco/networks-r-project

Visualizing static networks with R (Coursera)

data-visualization igraph network-analysis r

Last synced: 04 Aug 2025

https://github.com/hari00887/analysis-of-global-terrorism

Analysis of Global Terrorism Using AHP A quantitative study of GTD data to assess attack severity and evolution across time and space.

data-analysis data-visualization powerbi

Last synced: 02 Mar 2026