Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/fatihilhan42/wnba-draft-player-dataanalysis-1997-2022-with-python
In this project, the statistics of the players in the WNBA drafts from 1997 to 2022 were examined. The data in the dataset, which you can find in the repo, was first organized using data cleaning algorithms. These cleaned data were then graphically extracted using data visualization algorithms.
data-analysis data-analysis-python data-visualization jupyter-notebook python
Last synced: 17 May 2026
https://github.com/dhruvil-26/sql-projects
This repository contains SQL projects focusing on data analysis and insights. Currently, it includes: 1. RSVP Movies Analysis - SQL queries to analyze movie trends, ratings, and genres. 2. Pizza Sales Analysis - SQL queries to explore sales patterns, customer behavior, and profitability in a pizza business.
analysis data-analysis database mysql pizza-sales-analysis rdbms rsvp sql
Last synced: 17 May 2026
https://github.com/leoz0214/foodhygieneanalysis
Data analysis regarding Food Hygiene Ratings in England, Wales and Northern Ireland.
data-analysis food-hygiene-ratings pandas python
Last synced: 17 May 2026
https://github.com/iamber12/stack-overflow-analysis-using-stack-exchange-api
This Python-based project utilizes the Stack Exchange API to analyze StackOverflow data, focusing on the 'R' and 'Dot Net' programming tags.
data-analysis data-visualization python stack-exchange-api
Last synced: 20 Jul 2025
https://github.com/blankscreen-exe/tsf_datascience
Repo for all TSF internship tasks
data-analysis data-mining data-mining-algorithms python
Last synced: 17 May 2026
https://github.com/dsite42/simple_data_visualizer
This is a simple tool to visualize data for a quick Exploratory Data Analysis (EDA). You can create various plot types as seaborn or plotly plot via a GUI in multiple windows (RelPlot, PairPlot, JointPlot, DisPlot, CatPlot, LmPlot, 3DPlot).
data-analysis data-science data-visualisation data-visualization eda exploratory-data-analysis plotly seaborn
Last synced: 12 May 2026
https://github.com/jaymax01/website-performance-analysis
Analyzing retail performance
data-analysis data-visualization feature-engineering google-colab metrics presentations python
Last synced: 19 May 2026
https://github.com/abdullahashfaqvirk/PowerBI-Dashboards
A collection of Microsoft Power BI dashboards and reports designed to address business challenges and support data driven decision-making.
dashboards data-analysis data-driven data-science microsoft powerbi reports visualization
Last synced: 27 Sep 2025
https://github.com/josafary-ds/curso_dnc
Repositório para armazenamento dos arquivos de estudo e projetos DNC - Cientista de Dados
data-analysis data-science data-visualization machine-learning powerbi python
Last synced: 13 Mar 2025
https://github.com/alansteinbarth/eksploracyjna-analiza-danych-o-pasazerach-statku-titanic
🔍 Titanic EDA: odkrywanie wzorców przeżywalności przez analizę danych. Profesjonalny projekt z wizualizacjami i insights
analytics csv data-analysis data-science data-visualization dataset eda exploratory-data-analysis jupyter-notebook kaggle machine-learning matplotlib numpy pandas portfolio python seaborn statistics titanic visualization
Last synced: 11 Apr 2026
https://github.com/rachkat/random-foresst-analysis-r-studio-plotting-classification-tree
Classification analysis in R using the birthwt dataset. Built and compared Decision Tree and Random Forest models to predict low birth weight. Both achieved 71.05% accuracy, with Random Forest reducing overfitting and confirming maternal weight and age as key predictors.
classification data-analysis decision-trees machine-learning predictive-modeling r random-forest
Last synced: 04 Oct 2025
https://github.com/sharoonjoseph321/social_media_eda
Data Analysis on social media apps ,using pandas, python, matplotlib.
data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects
Last synced: 03 Mar 2025
https://github.com/jm199504/data-analysis-practice
数据分析练习(Titanic / BankCustomers)
Last synced: 02 May 2026
https://github.com/rahulsm20/trackbyte
A full-stack web application that helps users keep track of their playlist and provides analytics based on their music taste. Built using React, Node.js, Express.js, MySQL and Bootstrap.
bootstrap data-analysis expressjs mysql nodejs reactjs sql
Last synced: 07 Apr 2026
https://github.com/sadratehranian/pem-fuel-cell
The methodology section details the use of Python for data processing and analysis, employing statistical and machine learning-based anomaly detection techniques to identify potential issues in fuel cell stacks. It emphasizes data preprocessing, feature engineering, exploratory data analysis (EDA), and anomaly detection.
anomaly-detection data-analysis data-science data-visualization exploratory-data-analysis feature-engineering fuel-cell machine-learning preprocessing python statistical-analysis visual-studio-code
Last synced: 26 Mar 2025
https://github.com/pylena/movies-prediction
This project focuses on clustering movies based on their genres using machine learning techniques. By analyzing genre data, the model groups similar movies together, facilitating recommendations and insights into genre-based patterns.
data-analysis machine-learning render streamlit unsupervised-learning
Last synced: 18 May 2026
https://github.com/lc-rezende/eqx_boston_dataset
Exploratory data analysis, clustering, and forecasting on Boston crime data (2011-2015), revealing key crime trends, hotspots, and temporal patterns to support data-driven insights for urban safety and policing strategies.
data-analysis exploratory-data-analysis jupyter-notebook kmeans matplotlib numpy pandas prophet-facebook python scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/judyway2/de-data
A brief analysis on schools ARR data
data-analysis jupyter-notebook
Last synced: 11 May 2025
https://github.com/natanel567/university_machine_learning_project
Machine Learning final project Tel Aviv University
data-analysis jupyter-notebook machine-learning
Last synced: 11 May 2025
https://github.com/victoorv/detection_malwares
L'objectif de ce projet est de développer un classifieur capable de différencier les logiciels malwares des goodwares.
classification data-analysis data-science machine-learning machine-learning-algorithms malware-analysis malware-detection oversampling-algorithms python scikit-learn supervised-learning undersampling-algorithms
Last synced: 28 Apr 2026
https://github.com/prakhar-code/british_airways_review_analysis
Analysis of the British Airways Reviews by Customers, filtered by several different factors such as food, entertainment, services, etc.
data-analysis data-cleaning excel tableau-dashboards tableau-public tableau-visualization
Last synced: 15 Jan 2026
https://github.com/preciousclement/maternal-experiences-in-nigeria
This repository contains a Python-based project that generates realistic synthetic data simulating the maternal health journey of 5,000 women in Nigeria.
data-analysis data-generation maternal-health nigeria public-health python
Last synced: 08 May 2025
https://github.com/chen0040/python-data-analytics-feature-selection
Python project on feature selection
data-analysis feature-engineering feature-selection
Last synced: 03 Apr 2025
https://github.com/malexandersalazar/tools-python-mssql-statistics-descriptor
A lightweight tool based on sweetviz that generates high-density visualizations to kickstart Exploratory Data Analysis within Microsoft Azure SQL Database using ODBC with just one line of code
azure-sql-database data-analysis data-visualization eda python
Last synced: 16 May 2026
https://github.com/ziaeemehr/neuro_toolbox
Single Header File C++ library for analysis of neurophysiological and simulated data.
data-analysis data-science signal-processing synchronization
Last synced: 21 Jul 2025
https://github.com/rafinha0rafinha/web-analyzer-backend
(Legacy) This is the backend for Mazaoro SARLU's lead magnet "Web Analyzer". This project analyzes websites using Google Lighthouse and returns a detailed report consumed by the frontend.
azure-app-service azure-devops chartjs cicd data-analysis data-science data-visualization express flask hacktoberfest lighthouse numpy sentiment-analysis vader-sentiment-analyzer
Last synced: 10 Apr 2026
https://github.com/mfakhriazhar/stock-price-prediction
Stock prices are highly volatile and influenced by various factors, making accurate prediction a major challenge in investment decisions.
data-analysis data-science deep-learning python recurrent-neural-networks
Last synced: 18 May 2026
https://github.com/mh0386/motorcycle_data_analysis
Data analysis applied to motorcycle dataset.
Last synced: 19 Jul 2025
https://github.com/alexjackson1/commons-indicative-votes
A cluster analysis of the House of Commons' Indicative Brexit Voting Process on 27 Match 2019
Last synced: 19 Jul 2025
https://github.com/spring-0/netflix-media-data-analysis
Exploring and analyzing Netflix data to uncover trends through data visualization and statistical analysis.
Last synced: 27 Mar 2025
https://github.com/jasonsu131/cps188-term-project
A data analysis program developed in C to extract information about diabetic patients across Canada from a governmental spreadsheet available online. The program showcases summaries and averages based on the extracted data.
c data-analysis data-statictics file-reading
Last synced: 28 Mar 2025
https://github.com/kwonnayeon/urban-parks-childrens-happiness
Grad thesis on urban parks’ impact on children’s happiness – data, results, and code
causal-inference data-analysis environmental-psychology latex matching propensity-score public-health r social-science statistical-analysis thesis-project urban-studies weighting
Last synced: 17 Feb 2026
https://github.com/saisurajmatta/data-warehousing-and-advanced-data-analytics
Data Analytics Project: Analyzed Promotions and Provided Tangible Insights to Sales Director
data data-analysis data-architecture data-flow-analysis data-modeling data-pipeline data-segmentation data-visualization data-warehousing docker etl etl-pipeline mssql sql tableau
Last synced: 17 May 2026
https://github.com/sidsin0809/hmdb-endo-flagger
A Python toolkit to identify and score endogenous human metabolites from HMDB XML metadata
data-analysis hmdb metabolomics ontology pipeline python-3 streaming-parser xml-parsing
Last synced: 06 Jul 2025
https://github.com/velut/thesis-sw
Software and datasets used in the "Cost-effective and Scalable Activity Matching using Crowdsourcing" thesis
bpmn cost crowdflower crowdsourcing data-analysis dataset performance-analysis plotting-algorithms r thesis
Last synced: 19 Jun 2025
https://github.com/mae776569/weratedogs-wrangling
Wrangling WeRateDogs Twitter data to create interesting and trustworthy analyses and visualizations
data-analysis data-science data-visualization tweets twitter-api
Last synced: 25 Jan 2026
https://github.com/simranjeet97/covid-19
Covid-19 Data Analysis and Important Topics to be Covered to get the Impact and Solution.
coronavirus coronavirus-analysis coronavirus-dataset coronavirus-prediction coronavirus-tracking covid-19-data-analysis covid19 covid19-data covid19-india dash dash-app dash-plotly data data-analysis data-science data-science-projects data-visualization python3
Last synced: 18 May 2026
https://github.com/faint-liebfraumilch101/fraud-detection-sql-unsupervised
🕵️♂️ Detect fraud in bank transactions using SQL for feature engineering and Python's Isolation Forest for unsupervised anomaly detection.
anomaly-detection banking-data data-analysis data-science financial-analytics fraud-detection isolation-forest machine-learning portfolio-project python sql sqlite unsupervised-learning
Last synced: 07 May 2026
https://github.com/bhaveshbhakta/mobile-price-prediction-using-xgboost
Mobile Price Prediction
data-analysis data-visualization machine-learning mobile-price-prediction xgboost
Last synced: 19 Jul 2025
https://github.com/prasannnnn/real-time-share-price-scraping-and-analysis
The Stock Sentiment Analyzer is a web-based application built with Streamlit, BeautifulSoup, and Pandas to help users analyze the sentiment of a stock (BUY, SELL, or HOLD) based on its financial data. The tool extracts key financial metrics like Market Cap, Stock P/E, Dividend Yield, ROCE, ROE, and the 52-week High/Low from Screener.in.
beautifulsoup4 data-analysis python sentiment-analysis streamlit streamlit-dashboard webscraping
Last synced: 03 Aug 2025
https://github.com/amyanchen/sf-airbnb
Exploratory Data Analysis of San Francisco Airbnb's
data-analysis data-science data-visualization r rmarkdown statistics
Last synced: 18 Jul 2025
https://github.com/mfakhriazhar/ecom-qtt-prediction
In e-commerce, understanding seasonal sales trends and best-selling products is critical to business strategy. However, companies often struggle with predicting sales, determining factors that influence sales (discounts, product categories, locations), and optimizing stock and marketing.
data-analysis data-science data-visualization e-commerce-project eda machine-learning python
Last synced: 19 May 2026
https://github.com/hadjiprocopis/histocurse
A Java implementation of a multidimensional histogram backed on dense/conventional OR sparse array. Extremely efficient when number of dimensions is large and back-store is sparse array. This module depends on other projects which can be found on my repo here. See README below to see what you need to download.
data-analysis data-structures histogram multidimensional
Last synced: 03 Jul 2026
https://github.com/kenwuqianghao/scotiabank-datathon-2023
Code and data analysis done for 2023 Scotiabank Datathon
data-analysis fraud-detection jupyter-notebook python
Last synced: 18 May 2026
https://github.com/lulloooo/bizdata-nexus
Collection of my Business & Data Analysis projects, from professional/academic endeavors to passion-driven explorations 📊
business-analysis data-analysis economics etl excel finance mysql python r risk-analysis
Last synced: 05 Apr 2026
https://github.com/abhinavhariyal/diwali-sales-analysis
This project is based on data visualization and analysis using python and jupyter notebook on the data for diwali sales.
data-analysis data-visualization jupyter python
Last synced: 19 May 2026
https://github.com/yash22222/british-airways-data-science-internship
All 2 Task Assigned By British Airways Data Science Virtual Internship Programme
csv data-analysis data-science data-visualization google-colaboratory jyputer-notebook machine-learning microsoft-excel microsoft-powerpoint python
Last synced: 16 May 2026
https://github.com/yash22222/web-scraping-for-data-analysis-predictive-model-on-customer-data
Utilized web scraping for customer feedback at Air India, conducting robust data analysis, and applying machine learning for predictive modeling. Drove data-driven decisions, enhancing services, and elevating customer satisfaction. Expertise in web scraping, analysis, and predictive modeling for actionable insights.
data-analysis data-preprocessing data-science data-visualization exploratory-data-analysis machine-learning powerbi random-forest-classifier sentiment-analysis tableau web-scraping
Last synced: 30 May 2026
https://github.com/sabdikay/telco-customer-churn-analysis-ibm-dataset
This project explores customer churn trends for a company in California using an IBM dataset. Built in a Jupyter Notebook, it employs pandas, NumPy, matplotlib, seaborn, plotly, and scipy to clean, analyze, and visualize data. Through statistical tests and interactive maps, it uncovers key drivers behind customer cancellations
business-intelligence customer-churn data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-noteboook matplotlib numpy pandas plotly predictive-modeling python scipy seaborn statistical-analysis
Last synced: 07 Apr 2026
https://github.com/annaanastasy/classification-project-student-grades
A machine learning project to predict students' academic performance using features like demographics, study habits, and parental involvement, achieving 74% accuracy with the CatBoost model.
catboost-classifier classification data-analysis data-visualization machine-learning-algorithms predictive-modeling
Last synced: 29 Mar 2025
https://github.com/stynw7/asa-international-data-quest-2025
ASA: International Data Quest 2025 🔥
data-analysis data-mining data-visualization jupyter-notebook python
Last synced: 17 May 2026
https://github.com/manuelgil/vscode-data-pack
This extension pack includes the essential extensions for data analysts.
data-analysis data-science data-structures data-visualization vscode-extension
Last synced: 07 Apr 2026
https://github.com/theveryhim/massive-text-processing
cleaning, processing and analysis of papers' dataset in pyspark(rdd) framework
big-data data-analysis frequent-itemsets massive-datasets pyspark text-preprocessing
Last synced: 18 Jul 2025
https://github.com/prarthana-singh/heart-attack-prediction-model
A Machine Learning model that predicts the risk of a heart attack based on health parameters like cholesterol levels, blood pressure, BMI, smoking habits, and age. Built using Classification models, Scikit-Learn, Pandas, and Python.
classification data-analysis data-science heart-attack-prediction logistic-regression machine-learning numpy pandas python scikit-learn
Last synced: 25 Jun 2025
https://github.com/sparkerdata/hockeyshotmap
Interactive Streamlit app for NHL shot maps & player analysis. Pulls live (or demo) play-by-play data, normalizes rink coordinates, and visualizes shots with context filters (strength, period, player).
data-analysis data-visualization duckdb hockey hockey-analytics ice-hockey nhl nhl-data python sports sports-analytics
Last synced: 18 May 2026
https://github.com/robinmillford/analyzing-e-commerce-transactions---data-cleaning-cohort-analysis-and-sql
In this project, I aimed to analyze the profitability of products in an e-commerce dataset. I performed various SQL queries to extract valuable insights about product profitability, including the identification of the top 5 products with the highest profit margin, and unique combinations of brands and product lines with the highest profitability.
cohort-analysis data-analysis data-visualization excel jupyter-notebook powerbi python3 sql
Last synced: 18 May 2026
https://github.com/ivanayala96/end-to-end-business-intelligence-solution-logistics-financial-performance-dashboard
Project Overview: This project features a comprehensive Power BI solution developed for Ayala's Consultancy. It transforms raw operational data (generated via Python) into a strategic decision-making tool, managing a dataset of $7.71M in total sales and over 2,500 transactions.
anlytics bussines-report bussiness-intelligence data-analysis dax power-bi powerbi python
Last synced: 22 Apr 2026
https://github.com/theveryhim/web-scraping-and-statistical-tests
Crawling web for data and perform statistical tests to verify judgments
data-analysis hypothesis-testing web-scraping
Last synced: 18 Jul 2025
https://github.com/dacosmicgiant/marketing-sms-analyser
Mini project for R language SEM - V
Last synced: 21 Mar 2025
https://github.com/kammarah/studentdata
I created & deployed a Streamlit app to store, manage & analyze student data. 📊🎓
connection data data-analysis data-visualization deploy deployments libraries python streamlit streamlit-webapp webapp
Last synced: 18 May 2026
https://github.com/stefagnone/unsupervised-analysis-project
This project investigates the impact of video content on social media engagement using advanced analytics techniques like PCA, k-means clustering, and logistic regression. It provides actionable insights for optimizing social media strategies for Thai fashion and cosmetics retailers.
data-analysis data-visualization engagement-metrics facebook-live-sellers k-means-clustering logistic-regression marketing-insights pca-analysis python social-media-analytics
Last synced: 05 Apr 2025
https://github.com/stefagnone/data_storyboarding_visualization
Data Storyboarding and Visualization Techniques for Effective Communication
data-analysis data-visualization ggplot2-analysis r tableau-dashboards
Last synced: 05 Apr 2025
https://github.com/stefagnone/-wedding-vendor-pricing-and-customer-satisfaction-analysis
Data-driven analysis of wedding vendor pricing and customer satisfaction, with database design, SQL optimization, and cost breakdown generation.
business-intelligence cost-optimization customer-satisfaction data-analysis database-design python sql vision-board-analysis wedding-planning wedding-vendor-pricing
Last synced: 03 May 2026
https://github.com/stefagnone/moneyball_project
Data-driven analysis inspired by the Moneyball approach, identifying affordable replacements for key Oakland A's players using R and sabermetrics to support cost-effective recruitment.
baseball-statistics data-analysis data-driven-decision-making player-replacement-strategy r-programming sabermetrics sports-analytics
Last synced: 05 Apr 2025
https://github.com/rorrell/rightwhaledata
A Jupyter Notebook where I wrangle some data on right whale sightings and create a visualization
data-analysis data-visualization jupyter-notebook python3
Last synced: 11 May 2026
https://github.com/nour-zayed/shopping-trends-analytics-sql-python-power-bi
"End-to-end Shopping Trends analytics project using SQL, Python, Excel & Power BI — data cleaning, EDA, KPI generation, and interactive dashboards with DAX for actionable business insights."
business-intelligence data-analysis data-visualization dax powerbi python sql
Last synced: 18 May 2026
https://github.com/jatin-mehra119/car_price_prediction
Predicting price of the cars using small dataset.
data-analysis data-visualization jupyter-notebook machine-learning python regression-models sklearn sklearn-pipeline
Last synced: 07 Apr 2026
https://github.com/tarasbln/big-quant
Official public repository of the Berlin Investment Group (BIG) Quant Team, featuring quantitative finance research, algorithmic trading strategies, market analyses, educational materials, and open-source projects.
data-analysis education finance investment investment-club python3 quantative-finance quantative-trading quantitative-research research
Last synced: 21 Mar 2025
https://github.com/misszeferino/netflix-exploratory-analysis
Netflix exploratory analysis using python
data-analysis data-visualization pandas plotly python
Last synced: 07 Apr 2026
https://github.com/jayita11/exploring-most-streamed-songs-for-last-four-decades-eda
Perform EDA to uncover trends in streaming patterns, likes, and artists over the last four decades.
data-analysis eda hypothesis-testing matplotlib most-streamed-songs pandas python seaborn
Last synced: 07 Apr 2026
https://github.com/ahadly/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql
Last synced: 18 May 2026
https://github.com/theveryhim/basic-data-analysis
Working with basic Python tools frequently used in data science
data-analysis data-processing visualization
Last synced: 18 Jul 2025
https://github.com/enayar478/nomad_machine_learning_dash_app
An interactive Machine Learning app built with Dash and Plotly, developed as part of the Data Analytics Bootcamp at Le Wagon Bordeaux. It allows users to visualize data, make real-time predictions, and explore various model insights.
analytics cachetools dash dashboard-application data-analysis data-science deployment gunicorn interactive-visualization machine-learning pandas plotly plotly-dash prediction-model python python3 render scikit-learn web-application
Last synced: 02 Jan 2026
https://github.com/tashi-2004/apache-flink-spark-data-streaming
This project showcases a real-time data streaming pipeline using Apache Flink, Apache Spark, and Grafana. It streams data, stores it in Parquet format, and performs aggregations for insights, with seamless visualization via Grafana dashboards.
apache-flink apache-spark data-aggregation data-analysis data-science data-streaming data-visualization flink flink-stream-processing flink-streaming grafana-dashboard grafana-plugin pyflink python3
Last synced: 09 Feb 2026
https://github.com/oshinrathor/Data-Science-Systems-and-Analytics-Projects
Dive into my Data Science Projects Repository, featuring a Spam SMS Classifier, NIA Dashboard, H1N1 Vaccine Prediction, and NYC Taxi Fare Prediction. Each project showcases my skills in data cleaning, exploratory analysis, modeling, and visualization, offering valuable insights and methodologies for data enthusiasts and practitioners.
dashboard data-analysis data-driven-decisions data-presentation data-science data-visualization dataexploration eda insights nia webanalytics
Last synced: 12 Sep 2025
https://github.com/cyblx/clustering
This project explores clustering techniques and supervised learning applied to World Cup team performance analysis. The methodologies include K-Means, DBSCAN, K-Nearest Neighbors, Gaussian Mixture Models (GMM), and Agglomerative Clustering.
clustering data-analysis dbscan gmm kmeans supervised-learning unsupervised-learning world-cup
Last synced: 18 Jul 2025
https://github.com/ahmeddhus/exploring-football-data-analysis
Learning and exploring data analysis through real-world datasets using Python and StatsBomb APIs and mlpsoccer library
data-analysis jupyter-notebook mplsoccer python statsbomb
Last synced: 17 May 2026
https://github.com/nikbarb810/covid_growth_rate_390.51
Exploring Covid Growth Rate of European Population using genetic data analysis
bioinformatics data-analysis r rcpp
Last synced: 07 Apr 2026
https://github.com/servierhub/adsv
Analyze delimiter-separated values files
csv csv-converter csv-format csv-parser csv-parsing csv-reader csv-reading data data-analysis data-engineering data-mining
Last synced: 28 Sep 2025
https://github.com/hoxo-m/blog
HOXO-M Blog
data-analysis data-science r-package
Last synced: 30 Oct 2025
https://github.com/karlyndiary/bellabeat-eda
Bellabeat Case Study - Google Data Analytics Capstone using Python.
bellabeat bellabeat-case-study bellabeat-eda bellebeat-data-analysis case-study case-study-analysis data-analysis data-visualization eda python reports
Last synced: 17 May 2026
https://github.com/akash1070/predicting-zomato-restaurant-ratings
Perform extensive Exploratory Data Analysis(EDA) on the Zomato Dataset. Building an appropriate Machine Learning Model that will help various Zomato Restaurants to predict their respective Ratings based on certain features deploy the Machine learning model via Flask
data-analysis extratreesregressor flask linear-regression machine-learning random-forest zomato-bangalore zomato-data-analysis
Last synced: 18 May 2026
https://github.com/huynhtanphatt/diagnosing-uk-railway-performances
This project analyzes UK railway ticket and operation data to show how revenue, passenger demand, and on-time performance are connected.
data-analysis data-visualization datastorytelling python railway sql ticketing transportation
Last synced: 24 Apr 2026
https://github.com/sbera01/credit-card-approval-predictor
End-to-end Machine Learning project to predict credit card approval decisions using real-world financial features. Includes EDA, model training, and deployment-ready architecture
credit-card-approval-prediction data-analysis machine-learning python scikit-learn streamlit
Last synced: 24 Dec 2025
https://github.com/0xnu/england-house-prices
Predict house prices for the next five years across all English local authorities.
data-analysis england england-house-prices housing-market housing-market-analysis predictive-modeling regression
Last synced: 03 Aug 2025
https://github.com/ofir-frd/predict-success-of-a-restaurant
Apply machine learning on a restaurante database. Study and analyse the data for prediction of a successful restaurant.
data-analysis data-science machine-learning visualization
Last synced: 11 Jun 2026
https://github.com/dinamohsin/toman-bikeshare-data-analysis-sql-power-bi
This project involves data analysis using SQL, Power BI, and CSV datasets to extract insights and visualize key business metrics.
csv-files data-analysis data-visualization database powerbi sql sql-server
Last synced: 22 Apr 2026
https://github.com/jerinpious/house-price-prediction
This project is a machine learning-based application to predict house prices. A frontend interface has been developed using Streamlit to make the prediction process user-friendly for regular customers. The project is structured
data-analysis data-engineering data-science eda machine-learning pandas python random-forest scikit-learn streamlit
Last synced: 05 Apr 2026
https://github.com/harmanveer-2546/movie-industry
Investigate the film industry to gain sufficient understanding of what attributes to success and in turn utilize this analysis to create actionable recommendations for companies to enter the industry.
business business-analytics data-analysis datatime film-industry graphs matplotlib movie-database numpy pandas python scraping-websites seaborn visualization web-scraping-python
Last synced: 10 Apr 2026
https://github.com/mxagar/space_exploration
This repository is a collection of mini-projects and tutorials related to space images and geo-spatial data.
data-analysis deep-learning geospatial machine-learning
Last synced: 29 Sep 2025
https://github.com/venkat-023/thyroid-cancer-prediction
This project aims to develop a machine learning pipeline to predict thyroid cancer based on patient data. The dataset was sourced from multiple public repositories, cleaned, and merged to create a comprehensive dataset for modeling. Various classification algorithms were implemented, including Random Forest, Logistic Regression, K-Nearest Neighbors
data-analysis data-cleaning deep-learning ensembling-methods hyperparameter-tuning machine-learning-algorithms nueral-networks
Last synced: 17 May 2026
https://github.com/namratha2301/starbucks_global_presence
Exploring the global presence of Starbucks.
business-analysis data-analysis data-science data-visualization matplotlib pandas pycountry
Last synced: 19 May 2026
https://github.com/sreejabethu/smart-report-analyzer
An AI-powered app to analyze and summarize Excel, CSV, and PDF reports using Hugging Face language models. Built with Streamlit.
data-analysis huggingface llm nlp pdf-analysis python question-answering streamlit summarization
Last synced: 18 May 2026
https://github.com/maheera421/pandas
Implementation of essential Pandas functions.
data-analysis data-manipulation pandas-dataframes pandas-datareader pandas-python
Last synced: 17 Jul 2025
https://github.com/amr-yasser226/interactive-sales-analytics-dashboard
An interactive web-based dashboard for visualizing multinational electronics sales data. This project for the DSAI 203 course integrates a Python/Flask backend with an amCharts frontend to provide dynamic insights into product revenues, sales distribution, and employee statistics across different countries.
am5charts amcharts business-intelligence css dashboard data-analysis data-analytics data-visualization flask html javascript python sqlalchemy sqlite web-application
Last synced: 13 Apr 2026
https://github.com/priyadarshinijain/air-quality-data-analysis-and-visualization
# 🌍 Air Quality Data Analysis and Visualization
data-analysis jupyter-notebook python visualization
Last synced: 06 Feb 2026
https://github.com/cowboymrzamo2380/json-to-excel-converter
This repository provides a tool to convert JSON data to Excel format (.xlsx). It allows you to easily transform structured JSON data into a well-organized spreadsheet for better analysis and visualization.
automation-script automation-tools data-analysis data-converter data-export data-formatting data-tools data-visualization excel excel-automation excel-converter excel-tools json json-exporter json-parser json-processing json-to-csv json-to-excel programming-tools spreadsheet-tools
Last synced: 05 Apr 2025
https://github.com/mhkamel/ecommerce-targeting-system
A Flask-based E-Commerce Targeting System that provides customer segmentation and personalized product recommendations. Users can upload structured interaction data for analysis, receive AI-driven recommendations, and gain insights into user behavior. The application is built with Flask, Pandas, Scikit-Learn, and integrates an interactive web inter
ai bootstrap csv-processing customer-segmentation data-analysis data-science e-commerce flask machine-learning pandas python recommendation-system scikit-learn user-behavior web-application
Last synced: 09 Apr 2026