Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/mboula/mboula.github.io
GitHub portfolio + interactive resume | Showcasing data projects in civil rights (housing), cannabis, and analytics
cannabis case-study civil-rights compliance dashboards data-analysis data-cleaning data-vizualization excel google-data-analytics housing open-data pattern-analysis portfolio pro-se public-data r sql tableau
Last synced: 10 Jul 2025
https://github.com/katarinatmb/serbia-protest-analysis
This project analyzes the frequency, regional distribution, and group characteristics of protests that emerged across Serbia following the fatal collapse of the Novi Sad train station roof in November 2024. The analysis explores how different communities responded in the aftermath of the disaster, using data visualization in RStudio
data-analysis data-visualization r r-mark rstudio
Last synced: 10 Jul 2025
https://github.com/colindean/allegheny_voter_reg_analysis
Allegheny County Voter Registration Analysis Tools
data-analysis data-science elections pandas polars python voting
Last synced: 16 May 2026
https://github.com/bhaveshbhakta/autistic-patients-classification-using-ann
Autistic Patients Classification
ann artificial-neural-networks autistic-patients-classification data-analysis data-visualization deep-learning
Last synced: 25 Feb 2025
https://github.com/gaurav-van/data-analysis-projects
Collections of Projects that involves Data Analysis and Informed Decision Making
data-analysis database powerbi sql
Last synced: 06 Sep 2025
https://github.com/245839/automobile-analysis
Analysis of data on imported cars to the USA performed in Python using libraries for data analysis in the Jupyter environment.
data-analysis jupyter-notebook python
Last synced: 20 May 2026
https://github.com/pooja-manjunatha/nyc_parking_violations_dbt
This project uses dbt to transform NYC parking violations data through a layered architecture: Bronze: Raw ingested data Silver: Cleaned and enriched data Gold: Aggregated tables for analytics Using DuckDB as the warehouse backend, it ensures data quality with tests and documentation. The project enables reliable analysis of parking violations
data data-analysis data-engineering dbt duckdb python sql
Last synced: 14 May 2026
https://github.com/vlad1343/data-visualisation
Python project showcasing interactive and static visualizations using Plotly and Matplotlib. It includes analysis of CSV, JSON, and API data, turning complex datasets into clear, insightful charts.
anova api csv-files data-analysis data-visualization json matplotlib matplotlib-pyplot pandas pandas-python plotly python3 seaborn seaborn-python
Last synced: 08 Apr 2026
https://github.com/hugo-hattori/mini_dataanalysis_project
Data Analysis Project.
data-analysis data-analysis-python data-analytics jupyter jupyter-notebook pandas pandas-dataframe python
Last synced: 04 May 2026
https://github.com/faizantkhan/python_matplotlib
Matplotlib is a powerful Python library for creating visualizations and plots. It’s widely used for data representation, making complex information more accessible and interpretable. It offers various types of plots, including line graphs, scatter plots, bar charts, histograms, and more
data-analysis data-analytics data-engineering data-science data-visualization deep-learning graphs line machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot matplotlib-python python
Last synced: 20 May 2026
https://github.com/vzamboulingame/data-portfolio
This repository showcases my projects in Python and SQL, highlighting my skills in data analysis & visualization.
data-analysis data-portfolio data-science data-science-portfolio data-science-projects data-visualization jupyter-notebook portfolio python sql
Last synced: 20 May 2026
https://github.com/farhad-here/tegenx
TeGenX: Multilingual Text Generation App.TeGenX is a lightweight, interactive text generation application built with Streamlit. It leverages multiple pre-trained transformer models to generate text in both English and Persian.
data-analysis data-science deep-learning happytransformer huggingface nlp python stream text-generation text-generator textgeneration transformer web-application
Last synced: 25 Jan 2026
https://github.com/gabrielramirezv/rnaseq_2025_notas
Repository for RNA-seq class from the Undergraduate Program in Genomic Sciences.
Last synced: 29 Mar 2025
https://github.com/aakk23/netflix_sql_project
This SQL project provides an analytical overview of Netflix's movies and TV shows dataset, uncovering key insights related to content types, ratings, release trends, and geographic distribution. It helps explore patterns in content availability, audience targeting, and regional preferences to support data-driven decisions.
data-analysis netflix-data-analysis postgresql sql
Last synced: 10 Apr 2025
https://github.com/samruddhi3012/health-care-analytics
Hi! This repo involves analyzing the Healthcare analytics using Advanced Microsoft Excel.
dashboard data-analysis data-visualization healthcare microsoft-excel pivot-chart pivot-tables vlookup
Last synced: 29 Mar 2025
https://github.com/saravanansuriya/energy-consumption-analysis
Project will analyze energy usage and greenhouse gas (GHG) emissions of Ontario's Broader Public Sector (BPS) organizations, leveraging a comprehensive database of reported data in Power Bi
data-analysis data-cleaning powerbi python-script
Last synced: 22 Mar 2025
https://github.com/ManuMoolimani/Data-Analysis
Data Analysis Projects
data-analysis data-visualization excel
Last synced: 10 Jul 2025
https://github.com/mkoeppe/jiawei-computations
Computations supporting Chapters 2 and 3 of Jiawei Wang's dissertation "Subadditivity of Piecewise Linear Functions", UC Davis, Ph.D. program in Mathematics, 2020
benchmark-framework branch-and-bound cluster cutting-planes data-analysis hpc integer-programming reproducible-research sagemath
Last synced: 10 Aug 2025
https://github.com/lotfiferaga/sig_explore
3d-graphics api data-analysis data-visualization openstreetmap python
Last synced: 06 Mar 2026
https://github.com/engraulleite/local-data-warehousing-with-docker
Creating a DW from 0 to hero. Starting with logical and physical modeling to valuable reports.
airbyte data-analysis datawarehouse docker etl-pipeline metabase pgadmin4 postgresql
Last synced: 01 May 2026
https://github.com/adriens/declaration-solennelle-louis-mapou-2024-06-08-data
Dataviz DECLARATION SOLENNELLE DU 2024-06-08 - LOUIS MAPOU
data-analysis datavisualization dataviz dataviz-tools gephi gephi-visualizations new-caledonia nouvelle-caledonie storytelling
Last synced: 18 Feb 2026
https://github.com/s-narasimman/zepto_inventory_sql_data_analysis
This project focuses on data cleaning, exploration, and analysis of product information from the Zepto dataset using SQL. It provides actionable insights into pricing, stock availability, discounts, and category-level performance.
aggregation categorization csv data-analysis data-cleaning kaggle postgresql sql zepto
Last synced: 16 May 2026
https://github.com/deborangueira/campeonado_kaggle_2025
Desenvolvimento de um modelo de machine learning para prever o sucesso de startups. O objetivo é identificar quais empresas têm maior probabilidade de se tornarem casos de sucesso no mercado.
computacao data-analysis desafio kaggle modulo3 ponderada
Last synced: 16 May 2026
https://github.com/pabi1234810/data_analysis_zepto
A comprehensive SQL-based business intelligence solution for analyzing grocery store product data, inventory management, and pricing strategies. This project demonstrates end-to-end data analysis workflow from raw data exploration to actionable business insights.
analytics csv data-analysis data-science database excel kaggle kaggle-dataset mathematics pgadmin4 sql utf-8 zepto
Last synced: 01 Nov 2025
https://github.com/alinababer/data-science-and-insight-agent-rag-llama3-lava-llm
Data-Science-and-Insight-Agent-RAG-LLama3-Lava-LLM-Django-WebApplication is an advanced AI-driven chatbot designed to assist in data science, document analysis, and image interpretation. This repository contain the Datascience Agent of this project.
artificial-neural-networks classifcation data-analysis data-engineering data-visualization datascience large-language-models llama2 lstm machine-learning python random-forest regression
Last synced: 01 Jan 2026
https://github.com/an0n1mity/spamclassifiereval
A repository for evaluating the misclassification rate of spam classification models using a threshold-based approach.
data-analysis machine-learning natural-language-processing python-programming spam-classification text-classification
Last synced: 02 Nov 2025
https://github.com/hemangsharma/hotel-revenue-booking-analysis
This project provides a comprehensive revenue and reservation analysis for Highfield Hotel using historical data exported from booking systems and internal revenue reports. The goal is to derive actionable insights to improve room profitability, understand booking patterns, and support data-driven decision-making.
analysis data-analysis data-visualization hotel
Last synced: 10 Aug 2025
https://github.com/datalopes1/fifa21_datacleaning
Neste projeto será feito o processo de limpeza e manipulação a partir do dataset FIFA 21 messy, raw dataset for cleaning/ exploring, que pode ser encontrado no Kaggle, com licensa CC0: Public Domain e enviado por Rachit Toshniwal.
data-analysis data-cleaning python
Last synced: 30 Apr 2026
https://github.com/jakobzmrzlikar/pca-on-genomes
An analysis of human genome mutations from different populations.
data-analysis genome-analysis pca-analysis
Last synced: 16 May 2025
https://github.com/edanur-y/abalone-age-prediction-with-regression-models
Comparing the performances of simple linear, multiple linear, multi-layer perceptron and k-nearest neighbors regressions on abalone data to predict the age.
data-analysis hyperparameter-tuning missing-values-analysis outlier-analysis python recursive-feature-elimination
Last synced: 20 May 2026
https://github.com/ksharma67/eda-on-ipl
In this python notebook, analysis of IPL matches from 2008 to 2020 is done using python packages like pandas, matplotlib and seaborn.
data-analysis data-science eda matplotlib numpy pandas python seaborn
Last synced: 07 May 2026
https://github.com/kunalkumar2001/data-analytics-python-project
Data Analyst Python Project for Portfolio
data-analysis data-anaytics matplotlib numpy pandas python seaborn
Last synced: 19 Apr 2026
https://github.com/nafisrayan/decentai
A comprehensive platform built using ReactJS and Flask, combining blockchain technology with AI to create a secure and intelligent space for community engagement and policy discussions. Leverages NLP and LLM for meaningful interactions and sentiment analysis while ensuring data security and user privacy.
chatbot data-analysis data-visualization flask gemini gemini-ai gemini-ai-chatbot gemini-api government government-tech llm mongodb nlp polls python react tailwind voting-systems winknlp
Last synced: 12 Apr 2026
https://github.com/martachesnova/sql
Performing data modeling (ERD) and data engineering. Then, writing series of SQL queries to analyze Employee Database of a company.
data-analysis data-engineering data-modeling erd postgresql sql
Last synced: 16 May 2026
https://github.com/hassanislam463/british-airways-data-science
Analyze Skytrax reviews to uncover customer sentiments and key themes while predicting booking behavior using machine learning. This repository includes data collection, analysis, and modeling scripts alongside concise, visualized insights to improve customer experience and operational efficiency.
data-analysis data-science data-visualization
Last synced: 28 Mar 2025
https://github.com/swouf/ntds_imdb_team4
data-analysis data-visualization datascience graph-theory
Last synced: 13 May 2025
https://github.com/aicorsair/python-case-study-365-data-science-customer-segmentation-in-marketing
This repository contains a detailed case study on the segmentation of 365 Data Science customers using real-world data from an onboarding survey.
customer-segmentation data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization feature-engineering feature-selection hierarchical-clustering k-means-clustering machine-learning marketing marketing-analytics matplotlib pandas python scikit-learn social-media social-media-analysis
Last synced: 13 Apr 2026
https://github.com/hassanislam463/sentiment_analysis_of_financial_news_headlines_and_affect_on_stock_price_prediction
This project analyzes financial news sentiment using a fine-tuned RoBERTa model and integrates it with stock data to predict price movements using LSTM and GRU. It highlights the role of sentiment in enhancing stock market forecasting.
data-analysis data-science data-visualization deep-learning lstm-neural-networks nlp-machine-learning
Last synced: 28 Mar 2025
https://github.com/adrianlardies/multi-asset-financial-analysis
Comparative analysis of bitcoin, gold and S&P 500 in relation to macroeconomic indicators (VIX, interest rate, CPI). We explore the evolution of a $100 monthly investment in these assets, presenting visualizations to evaluate their performance and potential as financial diversification tools.
data-analysis data-science matplotlib pandas python seaborn
Last synced: 09 May 2026
https://github.com/hemant-kumar786/heart-disease-prediction
Heart Disease Analysis project in RStudio using statistical methods and data visualization. Includes data cleaning, exploratory data analysis (EDA), correlation study, and insights on key health indicators influencing heart disease.
correlation-study data-analysis data-visualization eda healthcare heart-disease r rstudio statical-analysis
Last synced: 02 Nov 2025
https://github.com/dongdong7048/newtaipei-housing-trend
新北市房價趨勢分析專案
data-analysis housing new-taipei python real-estate
Last synced: 28 Mar 2025
https://github.com/chrisrobertsjr/chrisrobertsjr
Welcome to my Github Profile!
data data-analysis java r sql statistics
Last synced: 03 May 2026
https://github.com/satyacoder29/smartfinance-dynamic-financial-dashboard
SmartFinance: Dynamic Financial Dashboard is an interactive tool designed to visualize key financial metrics like revenue, expenses, and profit. It features real-time data updates, charts, slicers, and navigation for easy analysis. This dashboard helps businesses make data-driven decisions and optimize financial performance.
data-analysis data-cleaning data-modeling data-visualization powerbi powerbi-desktop powerbi-visuals powerquerym
Last synced: 13 Feb 2026
https://github.com/nick-peter-marcus/chocolate-bar-analysis
Analyzing Chocolate Bar Features and Ratings - Data Visualization, Decision Trees, Random Forest
data-analysis data-visualization decision-trees python random-forest seaborn sklearn
Last synced: 10 May 2026
https://github.com/karishmagupta05/e-commerce-sales-dashboard
This project is an interactive E-Commerce Sales Dashboard built using Power BI. It provides key insights into sales, profit, and customer behavior through visually engaging charts and graphs.
data-analysis data-visualization powerbi
Last synced: 09 Feb 2026
https://github.com/waynejz/heart-disease-analysis
COMP9321 19T1 Assignment 3
data-analysis machine-learning web-application
Last synced: 04 Apr 2025
https://github.com/erseco/ugr_tratamiento_inteligente_datos
Repositorio de trabajo de la asignatura Tratamiento Inteligente de Datos del Máster en Ingeniería Informática de la Universidad de Granada (UGR)
Last synced: 26 Apr 2026
https://github.com/kelvintechnical/web-scraper
Tableau Book Price Analysis
data data-analysis data-science tableau tableau-public
Last synced: 25 Jan 2026
https://github.com/errea/vet_clinic_database
For this project you need special preparation. As the goal of this project is to solve some performance issue, first we need to introduce those issues. In order to do that, you will populate your database with a significant number of data.
data data-analysis data-structures data-visualization database
Last synced: 21 May 2026
https://github.com/kashirin-alex/thither.direct-onamove
an android skeleton-example application for using data from Thither.Direct platform on mobile applications
android-application data data-analysis data-structures data-visualization mobile-development mobility query research-data-management
Last synced: 27 Apr 2026
https://github.com/habiburrahman-mu/data-wrangling
Data Wrangling is the process of converting data from the initial format to a format that may be better for analysis.
data-analysis data-mining data-science
Last synced: 21 May 2026
https://github.com/nurulashraf/linear-regression-spotify
Data Science - Spotify Linear Regression Analysis
data-analysis data-preprocessing data-visualization dataset-exploration feature-selection linear-regression machine-learning matplotlib mean-squared-error model-evaluation multiple-regression music-analytics numpy predictive-modeling python regression-analysis root-mean-squared-error scikit-learn seaborn spotify-data
Last synced: 01 May 2026
https://github.com/andersoncrs/regularizacion_lasso_en_modelos_de_regresion_lineal
Este repositorio contiene un análisis detallado sobre la implementación de la regularización Lasso en modelos de regresión lineal para predecir el precio de vehículos. Se parte de un conjunto de datos limpio y se aplican diversas transformaciones y modelados para mejorar la precisión de las predicciones.
data-analysis data-science data-visualization jupyter-notebook linear-regression regularization-methods seaborn sklearn
Last synced: 16 May 2026
https://github.com/ishansurdi/data-visualisation-empowering-business-with-effective-insights
The following tasks are completed for Data Visualization: Empowering Business with Effective Insights on Forage in October 2024. It is important to note that this should not be interpreted as an endorsement.
chart communicating-insights-and-analysis dashboard data data-analysis forage powerbi powerbi-visuals tableau tata tata-group virtual-internship visual visualization
Last synced: 17 Feb 2026
https://github.com/dcs-training/regressionandmixedeffectsmodelling
This course will introduce you to regression and linear mixed-effects models (LMMs). It will help to develop your theoretical understanding and practical skills for running such models in R. Go to the readme file
data-analysis r rmarkdown statistics
Last synced: 25 Feb 2025
https://github.com/dcs-training/introtodatabases
This repository host the material connected to a training developed by Dave Elsmore (Edina) for CDCS. Go to the readme file
data-analysis data-wrangling databases sql
Last synced: 10 Jun 2026
https://github.com/mahmoudwal27/brazilian_ecommerce
This project explores and cleans the Olist Brazilian E-Commerce dataset using Python (Pandas) to prepare it for Power BI visualization. The process includes loading data, performing exploratory analysis, handling missing values and duplicates, formatting key columns, and exporting clean datasets.
analytics data-analysis data-analysis-python google-cloud python
Last synced: 16 May 2026
https://github.com/natnaelhhaile/Text-Similarity-Analysis
bag-of-words cosine-similarity data-analysis machine-learning natural-language-processing nltk-python one-hot-encoding python stemming stop-word-removal stop-words text-mining text-processing text-similarity-analysis tf tf-idf tokenization
Last synced: 11 Apr 2025
https://github.com/ifigeneiatsiflidou/applied-statistics-project
Project for an Applied Statistics course, involving exploratory data analysis and predictive modeling of movie revenue using engineered features and multiple linear regression.
correlation-analysis data-analysis linear-regression python scikit-learn visualization
Last synced: 29 Apr 2026
https://github.com/jihoonerd/national_health_insurance_sharing_service_project
국민건강보험 데이터를 활용한 EDA
data-analysis exploratory-data-analysis health insurance
Last synced: 18 Jul 2025
https://github.com/teja-1403/forage-tata-data-visualisation-empowering-business-with-effective-insights
This repository contains solutions to the 4 different tasks that must be performed during the Data Visualisation: Empowering Business with Effective Insights virtual internship provided by TATA via Forage.
analysis-and-reporting analytics analytics-and-decision-science charts communications dashboards data-analysis data-cleanup data-interpretation data-storytelling data-visualizations graph insights power-bi visual-basic visualizations
Last synced: 18 Feb 2026
https://github.com/daniel-jcvv/daniel-jcvv
👨💻 Data Engineer | 3+ years enterprise experience with Telcel & Citi Banamex Develop ETL pipelines, data governance, and cloud solutions. Building scalable data architectures and automated workflows for Fortune 500 clients. Tech Stack: Python, SQL Server, Oracle, Apache Airflow, PySpark
agentic-ai apache-airflow apache-kafka apache-spark automation business-intelligence citi-bank-apis data-analysis data-engineering data-lake data-warehouse etl-pipeline medallion-architecture mlops n8n-workflow python rag sql-server
Last synced: 15 Apr 2026
https://github.com/netcodez/analysing-unicorn-companies---sql
Analysing Unicorn Companies using SQL
data-analysis data-structures database postresql sql
Last synced: 16 May 2026
https://github.com/valyaevgeorgiy/r_basic
Работа с основами среды R и тем самым изучения нового языка программирования, связанного непосредственно с анализом данных и построением графиков и диаграмм.
coding data data-analysis r rstudio
Last synced: 12 Dec 2025
https://github.com/mvharsh/blinkit-sales-dashboard
An interactive Power BI dashboard visualizing Blinkit's sales performance across outlets, item types, and customer ratings for strategic insights.
blinkitdashboard data-analysis data-visualization powerbi
Last synced: 25 Jan 2026
https://github.com/shafaq-aslam/data-gathering
A hands on collection of notebooks exploring multiple techniques of data gathering, from reading CSV, Excel, JSON, and SQL files to exporting data in various formats and fetching real time data through APIs. This repository documents my complete learning journey of data ingestion, preparation, and extraction for data analysis workflows.
api data-analysis data-export data-gathering data-import data-science jupyter-notebook machine-learning pandas python python3
Last synced: 21 May 2026
https://github.com/tathithienthanh/majorproject_womenfashionproductrecommendationsystem
Build a recommendation system for recommending woman fashion's products on e-commerce platforms
content-based data-analysis data-collection data-processing data-visualization dfd e-commerce erd jupyter-notebook lazada nlp python recommender-system scraping-websites sql system-design tiki vietnamese visualization
Last synced: 20 Mar 2025
https://github.com/as16082023/global-electronics-retailer
Analyzed Maven Electronics' performance data to identify factors driving revenue decline since 2020.
advanced-excel data-analysis data-visualization
Last synced: 03 Feb 2026
https://github.com/anonymo2239/big-data-churn-analyzer
Scalable customer churn prediction using PySpark. Includes EDA, feature engineering, modeling, and real-time inference on new data.
big-data churn-analysis churn-prediction classification-algorithm data-analysis data-science data-visualization modeling pyspark
Last synced: 21 May 2026
https://github.com/grindelfp/two-data-manipulative-tasks
Two simple tasks on data analysis and processing.
Last synced: 17 Feb 2026
https://github.com/dcs-training/much-ado-about-nothing-missing-data-in-research
Repo for the Much ado about nothing workshop. Go to the Readme file
data-analysis data-cleaning data-wrangling r
Last synced: 15 Jun 2025
https://github.com/nuraj250/datainsighthub
A Node.js backend application that processes and analyzes personal user data to generate personalized insights and recommendations. It features secure user authentication, data upload and storage, custom algorithms for data analysis, and optional real-time notifications and third-party API integrations. Perfect for showcasing backend development
api-development backend-development bcrypt data-analysis data-analytics data-insights dotenv express jwt-authentication mongodb nodejs passport secure-api user-authentication
Last synced: 09 Apr 2026
https://github.com/gui-sitton/bank-loans
In this project I will prepare a report for a bank's loan division. I find out whether a customer's marital status and number of children have an impact on loan default, as well as other factors
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 21 May 2026
https://github.com/abhipatel35/moviematcher-movie-recommender-system
A robust movie recommendation system using the MovieLens dataset, employing Collaborative Filtering, Matrix Factorization, and Hybrid Models to enhance recommendation accuracy and diversity.
collaborative-filtering content-based-filtering data-analysis eda hybrid-models machine-learning matrix-factorization movie-recommendations movielens-dataset python recommender-system surprise-library
Last synced: 21 May 2026
https://github.com/srikarveluvali/dataanalysis
The "Dataset - Extraction, Analysis, and Visualization" project is a Python-based data analysis venture that focuses on exploring and interpreting the "Video Game Sales Analysis" dataset.
css data-analysis html javascript matplotlib numpy pandas python seaborn tableau
Last synced: 09 Apr 2026
https://github.com/michael-angelo-mootoo/quanta-app
Quanta is an open source statistical package app / toolkit for neuroscience and general computational descriptive and inferential statistics.
computational-statistics customtkinter data-analysis descriptive-statistics gui-application inferential-statistics neuroscience python r statistical-analysis statistics tkinter-python
Last synced: 16 May 2026
https://github.com/pdiegel/currencytracker
A Python application that fetches real-time currency exchange rates from an API, securely stores the data in an SQLite database, and includes error handling, logging, and good programming practices for reliable and periodic data capturing.
analysis api currency data-analysis data-capture logging python python3 sqlite3 tracker
Last synced: 09 Sep 2025
https://github.com/ggarciajavier/udacity-dalf-project3-test-perceptual-phenomenom
Work performed for the 3rd project of Udacity Data Analyst Nanodegree: statistical testing of a perceptual phenomenom (Stroop task).
data-analysis python statistical-inference udacity-data-analyst-nanodegree
Last synced: 18 May 2026
https://github.com/tapas-gope/pizza-sales
This project analyzes Pizza Sales Data to provide insights into customer preferences and sales performance. Key metrics include total revenue, orders, and average order value, with a breakdown by pizza category and size. The dashboard identifies peak sales periods and top-selling items, supporting data-driven business decisions.
business-intelligence dashboard data-analysis data-visualization dax powerbi sales-analysis
Last synced: 02 Jan 2026
https://github.com/kaushik-puttaswamy/food-delivery-time-prediction-using-machine-learning
The Food Delivery Time Prediction Model estimates delivery times using regression algorithms, with XGBoost as the best performer, and is deployed as a real-time application via Streamlit.
data-analysis data-science delivery food-delivery geolocation machine-learning modeldeployment predictive-modeling python realtimeproject regression-models streamlit xgboost
Last synced: 16 Apr 2026
https://github.com/yasir-arafah/nyc-trip-fare-prediction-using-tcn
"NYC Trip Fare Prediction Using Temporal Convolutional Networks (TCN)" is a Data Analytics Project where the trip and fare data of NYC taxi are combined and then analyzed using Pyspark and visualized using Matplotlib library. The project predicts the fare by using Temporal Convolutional Neural Network.
colab data-analysis matplotlib nyc-taxi-dataset pyspark python
Last synced: 29 Apr 2026
https://github.com/shivani8136/bellabeat-smart-device-data-analysis
This project analyzes smart device fitness data to uncover insights into user behavior, engagement, and wellness patterns. Conducted for Bellabeat, a high-tech company specializing in health-focused smart products for women, this analysis supports strategic decisions around product development and feature prioritization.
data-analysis data-visualization r-programming-language
Last synced: 08 Feb 2026
https://github.com/sgb31/covid-19-data-analysis
"In this project, I analyzed COVID-19 data to explore trends, case growth, and key patterns. I worked on cleaning the data, performing exploratory analysis, and visualizing infection rates, recoveries, and fatalities. The goal was to gain insights into how the pandemic evolved and its overall impact.
data-analysis data-visualization matplotlib pandas python seaborn
Last synced: 13 May 2026
https://github.com/fealt/databricks_incremental_data_project
Databricks project showcasing incremental data ingestion with industry best practices.
data-analysis data-engineering data-ingestion databricks delta-lake etl lakehouse medallion-architecture python spark spark-sql sql streaming-data
Last synced: 08 May 2026
https://github.com/touradbaba/multi-page_dash_application
This repository contains a Multi-Page Dash Application designed to provide interactive visualizations of geo-spatial data, focusing on population and GDP. The app offers insights into demographic and economic trends through interactive maps and various types of charts. It is built with Python, using Plotly and Dash, and is deployed on Heroku.
dash dashboard data-analysis data-visualization exploratory-data-analysis heroku-deployment plotly pythonanywhere
Last synced: 27 Jul 2025
https://github.com/balajimohan18/power-bi-visualization-project
This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce damages by accidents & calamities.
data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-visuals powerpoint-slides
Last synced: 08 Mar 2026
https://github.com/jwt218/isonq
MATLAB package for Qtegra-generated data file processing.
data-analysis geochemistry isotopes matlab
Last synced: 03 Apr 2025
https://github.com/kheriberto/logistic_regression_project
A project that analyses dummie data from an advertising company using logistic regression
data-analysis logistic-regression pandas python scikit-learn seaborn
Last synced: 08 Apr 2026
https://github.com/leabrodyheine/ml-kaggle-cirrhosis-data
This project showcases skills in machine learning, data preprocessing, and model evaluation using Python libraries such as scikit-learn, XGBoost, and Optuna. It involves implementing various machine learning models, handling imbalanced data, and employing imputation techniques to enhance model performance for predicting cirrhosis outcomes.
data-analysis data-pre imbalanced-data imputation machine-learning optuna pipeline scikit-learn xgboost
Last synced: 14 May 2026
https://github.com/alejandrolara11/desafio_latam_introduccion_analisis_de_datos
Repositorio del curso "Introducción al Análisis de Datos" de Desafío Latam. Ejercicios prácticos realizados durante el curso, enfocados en análisis de datos con Python, Pandas, y visualización básica.
data-analysis data-science data-visualization matplotlib numpy pandas python seaborn statsmodels
Last synced: 29 Apr 2026
https://github.com/hayatiyrtgl/cryptocurrency_time_series_rnn
Python script for training a Simple RNN model on cryptocurrency price data to predict future prices, including data exploration and evaluation
data-analysis data-science data-visualization keras pandas pandas-python prediction predictive-modeling python python-script rnn rnn-tensorflow tensorflow time-series time-series-analysis
Last synced: 08 Apr 2026
https://github.com/malucor/analise_exploratoria_dados
Programa em Python para fazer uma Análise Exploratória de Dados de Logística.
analise-de-dados analise-exploratoria analise-exploratoria-de-dados data-analysis ebac exploratory-data-analysis ipynb jupyter-notebook python
Last synced: 16 May 2026
https://github.com/l0rd-inquisit0r/data-analytics
A repository of data analytics implementations in Python
ai data-analysis data-analysis-python data-analytics
Last synced: 18 Jun 2025
https://github.com/rajesh9943/visualizing-global-development-trends-an-animated-analysis-of-life-expectancy-and-fertility-rates
To clean and analyze data to find trends in global population, fertility, and life expectancy from 1960 to 2016. This idea was inspired by hans rosling . To analyze the data, I used a scatter bubble chart, which clearly shows how's the population increased and the fertility rate decreased from 1960 to 2016.
data-analysis data-cleaning-and-preprocessing data-exploration expolatory-data-analysis identify-patterns reporting vizualisation
Last synced: 08 Oct 2025
https://github.com/ejw-data/tableau-drug-study
Brief analysis of drug treatments that were also analyzed with pandas
Last synced: 02 Jan 2026
https://github.com/saidulalimallick04/smart-traffic-violation-pattern-detector-dashboard
This project is a Streamlit web application designed to analyze traffic violation data. It provides a user-friendly interface to explore, visualize, and gain insights from traffic violation datasets. Users can upload their own data, perform analysis, and view summaries and trends.
dashboard data-analysis data-visualization internship-project pandas python smart-traffic streamlit
Last synced: 18 Apr 2026