Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/techshot25/graduateadmissions
Looking at the probability of being accepted in a graduate program using a machine learning model
bayesian-regression correlation-matrices data-analysis data-science linear-regression machie-learning random-forest-regression regression ridge-regression
Last synced: 25 Feb 2025
https://github.com/karanch10/fraudshield
FraudShield is a machine learning credit card fraud detection system that analyzes transaction attributes to identify suspicious activities in real time. Built with Python, SQL, and Django, it provides a user-friendly interface for fraud prediction using OpenBanking APIs and advanced detection techniques. Ideal for businesses and individuals.
data-analysis data-science data-visualization machine-learning python3
Last synced: 20 May 2026
https://github.com/tabibyte/azerbaijani-rapper-lyrics-data-analysis
Lyrics Data Analysis of Azerbaijani Rappers
azerbaijan data-analysis rappers
Last synced: 22 Jul 2025
https://github.com/patricksferraz/aqw-madrid-data-analysis
Interactive analysis and visualization of Madrid's air quality and weather data (2001-2016) using Python, Dash, and Jupyter. Features interactive maps, statistical analysis, and data visualization tools.
air-quality dash data-analysis data-engineering data-science data-visualization data-wrangling environmental-data environmental-science interactive-dashboard jupyter jupyter-notebook madrid open-data pandas plotly python statistical-analysis time-series weather-data
Last synced: 30 Jan 2026
https://github.com/iwasakiyuuki/data-analysis-platform-airflow-dag
A collection of Airflow DAGs for automating data collection into our on-premises data analysis platform.
airflow airflow-dags data-analysis data-collection
Last synced: 13 May 2025
https://github.com/ahnaf19/clean_bankingdata
Here I tried to practice simple ETL tasks. I know how to perform these tasks in SQL, here just explored my way around using pandas as well.
data-analysis data-cleaning pandas python
Last synced: 19 Apr 2026
https://github.com/samruddhi3012/rfm-analysis
Hi there! In this project I have performed Sales Analysis (RFM Analysis) using SQL and Tableau.
data-analysis data-visualization mssqlserver rfm-analysis segmentation tableau
Last synced: 27 Jun 2025
https://github.com/faizantkhan/python_matplotlib
Matplotlib is a powerful Python library for creating visualizations and plots. It’s widely used for data representation, making complex information more accessible and interpretable. It offers various types of plots, including line graphs, scatter plots, bar charts, histograms, and more
data-analysis data-analytics data-engineering data-science data-visualization deep-learning graphs line machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot matplotlib-python python
Last synced: 20 May 2026
https://github.com/farhad-here/tegenx
TeGenX: Multilingual Text Generation App.TeGenX is a lightweight, interactive text generation application built with Streamlit. It leverages multiple pre-trained transformer models to generate text in both English and Persian.
data-analysis data-science deep-learning happytransformer huggingface nlp python stream text-generation text-generator textgeneration transformer web-application
Last synced: 25 Jan 2026
https://github.com/saravanansuriya/energy-consumption-analysis
Project will analyze energy usage and greenhouse gas (GHG) emissions of Ontario's Broader Public Sector (BPS) organizations, leveraging a comprehensive database of reported data in Power Bi
data-analysis data-cleaning powerbi python-script
Last synced: 22 Mar 2025
https://github.com/s-narasimman/zepto_inventory_sql_data_analysis
This project focuses on data cleaning, exploration, and analysis of product information from the Zepto dataset using SQL. It provides actionable insights into pricing, stock availability, discounts, and category-level performance.
aggregation categorization csv data-analysis data-cleaning kaggle postgresql sql zepto
Last synced: 16 May 2026
https://github.com/deborangueira/campeonado_kaggle_2025
Desenvolvimento de um modelo de machine learning para prever o sucesso de startups. O objetivo é identificar quais empresas têm maior probabilidade de se tornarem casos de sucesso no mercado.
computacao data-analysis desafio kaggle modulo3 ponderada
Last synced: 16 May 2026
https://github.com/pabi1234810/data_analysis_zepto
A comprehensive SQL-based business intelligence solution for analyzing grocery store product data, inventory management, and pricing strategies. This project demonstrates end-to-end data analysis workflow from raw data exploration to actionable business insights.
analytics csv data-analysis data-science database excel kaggle kaggle-dataset mathematics pgadmin4 sql utf-8 zepto
Last synced: 01 Nov 2025
https://github.com/an0n1mity/spamclassifiereval
A repository for evaluating the misclassification rate of spam classification models using a threshold-based approach.
data-analysis machine-learning natural-language-processing python-programming spam-classification text-classification
Last synced: 02 Nov 2025
https://github.com/jakobzmrzlikar/pca-on-genomes
An analysis of human genome mutations from different populations.
data-analysis genome-analysis pca-analysis
Last synced: 16 May 2025
https://github.com/edanur-y/abalone-age-prediction-with-regression-models
Comparing the performances of simple linear, multiple linear, multi-layer perceptron and k-nearest neighbors regressions on abalone data to predict the age.
data-analysis hyperparameter-tuning missing-values-analysis outlier-analysis python recursive-feature-elimination
Last synced: 20 May 2026
https://github.com/swouf/ntds_imdb_team4
data-analysis data-visualization datascience graph-theory
Last synced: 13 May 2025
https://github.com/hemant-kumar786/heart-disease-prediction
Heart Disease Analysis project in RStudio using statistical methods and data visualization. Includes data cleaning, exploratory data analysis (EDA), correlation study, and insights on key health indicators influencing heart disease.
correlation-study data-analysis data-visualization eda healthcare heart-disease r rstudio statical-analysis
Last synced: 02 Nov 2025
https://github.com/erseco/ugr_tratamiento_inteligente_datos
Repositorio de trabajo de la asignatura Tratamiento Inteligente de Datos del Máster en Ingeniería Informática de la Universidad de Granada (UGR)
Last synced: 26 Apr 2026
https://github.com/errea/vet_clinic_database
For this project you need special preparation. As the goal of this project is to solve some performance issue, first we need to introduce those issues. In order to do that, you will populate your database with a significant number of data.
data data-analysis data-structures data-visualization database
Last synced: 21 May 2026
https://github.com/habiburrahman-mu/data-wrangling
Data Wrangling is the process of converting data from the initial format to a format that may be better for analysis.
data-analysis data-mining data-science
Last synced: 21 May 2026
https://github.com/andersoncrs/regularizacion_lasso_en_modelos_de_regresion_lineal
Este repositorio contiene un análisis detallado sobre la implementación de la regularización Lasso en modelos de regresión lineal para predecir el precio de vehículos. Se parte de un conjunto de datos limpio y se aplican diversas transformaciones y modelados para mejorar la precisión de las predicciones.
data-analysis data-science data-visualization jupyter-notebook linear-regression regularization-methods seaborn sklearn
Last synced: 16 May 2026
https://github.com/dcs-training/regressionandmixedeffectsmodelling
This course will introduce you to regression and linear mixed-effects models (LMMs). It will help to develop your theoretical understanding and practical skills for running such models in R. Go to the readme file
data-analysis r rmarkdown statistics
Last synced: 25 Feb 2025
https://github.com/teja-1403/forage-tata-data-visualisation-empowering-business-with-effective-insights
This repository contains solutions to the 4 different tasks that must be performed during the Data Visualisation: Empowering Business with Effective Insights virtual internship provided by TATA via Forage.
analysis-and-reporting analytics analytics-and-decision-science charts communications dashboards data-analysis data-cleanup data-interpretation data-storytelling data-visualizations graph insights power-bi visual-basic visualizations
Last synced: 18 Feb 2026
https://github.com/shafaq-aslam/data-gathering
A hands on collection of notebooks exploring multiple techniques of data gathering, from reading CSV, Excel, JSON, and SQL files to exporting data in various formats and fetching real time data through APIs. This repository documents my complete learning journey of data ingestion, preparation, and extraction for data analysis workflows.
api data-analysis data-export data-gathering data-import data-science jupyter-notebook machine-learning pandas python python3
Last synced: 21 May 2026
https://github.com/anonymo2239/big-data-churn-analyzer
Scalable customer churn prediction using PySpark. Includes EDA, feature engineering, modeling, and real-time inference on new data.
big-data churn-analysis churn-prediction classification-algorithm data-analysis data-science data-visualization modeling pyspark
Last synced: 21 May 2026
https://github.com/gui-sitton/bank-loans
In this project I will prepare a report for a bank's loan division. I find out whether a customer's marital status and number of children have an impact on loan default, as well as other factors
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 21 May 2026
https://github.com/tapas-gope/pizza-sales
This project analyzes Pizza Sales Data to provide insights into customer preferences and sales performance. Key metrics include total revenue, orders, and average order value, with a breakdown by pizza category and size. The dashboard identifies peak sales periods and top-selling items, supporting data-driven business decisions.
business-intelligence dashboard data-analysis data-visualization dax powerbi sales-analysis
Last synced: 02 Jan 2026
https://github.com/kaushik-puttaswamy/food-delivery-time-prediction-using-machine-learning
The Food Delivery Time Prediction Model estimates delivery times using regression algorithms, with XGBoost as the best performer, and is deployed as a real-time application via Streamlit.
data-analysis data-science delivery food-delivery geolocation machine-learning modeldeployment predictive-modeling python realtimeproject regression-models streamlit xgboost
Last synced: 16 Apr 2026
https://github.com/shivani8136/bellabeat-smart-device-data-analysis
This project analyzes smart device fitness data to uncover insights into user behavior, engagement, and wellness patterns. Conducted for Bellabeat, a high-tech company specializing in health-focused smart products for women, this analysis supports strategic decisions around product development and feature prioritization.
data-analysis data-visualization r-programming-language
Last synced: 08 Feb 2026
https://github.com/balajimohan18/power-bi-visualization-project
This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce damages by accidents & calamities.
data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-visuals powerpoint-slides
Last synced: 08 Mar 2026
https://github.com/kheriberto/logistic_regression_project
A project that analyses dummie data from an advertising company using logistic regression
data-analysis logistic-regression pandas python scikit-learn seaborn
Last synced: 08 Apr 2026
https://github.com/l0rd-inquisit0r/data-analytics
A repository of data analytics implementations in Python
ai data-analysis data-analysis-python data-analytics
Last synced: 18 Jun 2025
https://github.com/ejw-data/tableau-drug-study
Brief analysis of drug treatments that were also analyzed with pandas
Last synced: 02 Jan 2026
https://github.com/gkn-tech/brisecheck_website
Web Crawler, Visualizations and Game
choropleth-map contact-form data-analysis data-visualization game-development pygame python-flask scatter-plot web-crawler web-scraping
Last synced: 25 Feb 2025
https://github.com/anamakarevich/suicide_rates_factors
Female suicide rates analysis for Udacity Hacathon
data-analysis data-cleaning linear-regression suicide
Last synced: 21 May 2026
https://github.com/simranshaikh20/credit-card-dashboard
A Data Visualization Project using Microsoft Power bi
data-analysis data-visualization powerbi
Last synced: 02 Jan 2026
https://github.com/josericodata/josericodata
Adding a cool README file
big-data data-analysis data-science dublin hadoop hadoop-mapreduce hadoop-spark ireland jobsearch jobseeker portfolio portfolio-data-science portfolio-website python sql
Last synced: 26 Aug 2025
https://github.com/ginga1402/demand-supply-analysis
Demand- Supply Analysis using Python
data-analysis data-science demand-supply-management driver-rider-relationship
Last synced: 30 Mar 2025
https://github.com/abhishekyadav915/diwali_sales_analysis
This project aims to analyze sales data during the Diwali festival using Python. The analysis focuses on identifying key trends, customer purchasing behavior, and sales performance across different segments. By leveraging data visualization and statistical analysis, we uncover insights.
data-analysis data-visualization matplotlib-pyplot numpy-library pandas-dataframe seaborn-python
Last synced: 05 Apr 2025
https://github.com/azizbekavazov/eda-uci-retail-dataset
Exploratory Data Analysis (EDA) on UCI Online Retail Dataset. Customer insights, product trends, sales patterns and product recommendations.
customer-insights data-analysis data-visualization eda exploratory-data-analysis jupyter-notebook matplotlib pandas personalized-recommendations product-recommendation python recommendation-system retail-analytics seaborn uci-online-retail
Last synced: 23 Jul 2025
https://github.com/thesfinox/mltools
A collection of simple tools for data science and machine learning projects.
ai data-analysis data-science data-visualization logging machine-learning matplotlib neural-network python toolbox
Last synced: 14 May 2025
https://github.com/bhiogade/tlc-trip-analysis
NYC Taxi and Limousine Commission (TLC) Trip Analysis
data-analysis data-cleaning data-collection data-visualization pandas-python tableau tableau-desktop
Last synced: 30 Mar 2025
https://github.com/leticiamilan/dashboard-analitico-de-vendas-globais
Dashboard Analítico de Vendas Globais - DSA - Desenvolvido com Power BI
dashboard dashboard-power-bi data-analysis power-bi powerbi
Last synced: 03 Feb 2026
https://github.com/mr-chang95/datascience_airbnb
Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.
airbnb data-analysis data-science data-visualization jupyter-notebook numpy pandas python sklearn
Last synced: 08 Apr 2026
https://github.com/kunalkumar2001/sales-project-using-excel-and-sql
Comprehensive sales analysis using SQL, Excel, and PowerPoint to uncover insights on top-sellers, peak times, and branch performance.
data-analysis data-analytics excel mssql sql
Last synced: 03 Nov 2025
https://github.com/borjamome/visualization_supermarkets_with_r
Visualization using R and OpenStreetMaps
data-analysis datavisualization openstreetmap r
Last synced: 02 Jan 2026
https://github.com/myke003/data-analysis-projects
This repository serves as a collection of all my projects.
data-analysis jupyter-notebook powerbi
Last synced: 14 Mar 2025
https://github.com/cano1998/data-visualization-project
A project focused on data visualization to explore various aspects of a car dataset. The visualizations provide insights into car performance, efficiency, and characteristics based on different manufacturers and features.
bar-pl bar-plot data-analysis data-visualization histogram jupyter-notebook line-plot
Last synced: 17 Jul 2025
https://github.com/garcane/credit-card-transactions-fraud-detection-project
The Credit Card Transactions Fraud Detection Project repository is designed to analyse and detect fraudulent transactions in credit card data.
Last synced: 03 Feb 2026
https://github.com/ishmal793/basic-python-
Beginner-friendly Python code examples and exercises – a strong foundation for aspiring data analysts.
data-analysis data-analytics learning-python-code problem-solving python-basics python-for-beginners
Last synced: 23 Jul 2025
https://github.com/shrutiii1109/diwali-sales-analysis-through-python
Data analysis project on Diwali sales using Python (Pandas, NumPy, Matplotlib, Seaborn). The goal is to analyze customer behavior, identify sales trends, and provide insights to improve marketing and business strategies.
data-analysis jupyer-notebook matplotlib numpy pandas python seaborn
Last synced: 30 Apr 2026
https://github.com/sahilmaurya28/youtube-data-analysis
YouTube Data Analysis using Python — uncovering trends, engagement patterns, and correlations between likes, comments, views, and categories to understand what drives content success.
analysis data-analysis data-visualization matplotlib-pyplot numpy pandas portfolio-project python seaborn youtube
Last synced: 13 Apr 2026
https://github.com/mamtapanda088/dataanalaysis-warmup-
Tasks: Create a DataFrame: Convert the dictionary into a pandas DataFrame. Top and Bottom Rows: Display the top 3 bottom ,3 rows of the DataFrame. Summary Statistics: Generate summary statistics for the dataset. Gender Count: Count the occurrences of each gender. Marks Analysis: Calculate the average, maxi, and min marks. Tools Used: Python ,pandas
data-analysis data-science jupyter-notebook visualization
Last synced: 04 Apr 2025
https://github.com/lucashomuniz/project-01
SALES DATA ANALYSIS WITH POWERBI AND PYTHON
business-analytics business-intelligence data-analysis data-science data-visualization excel powerbi python toy-project
Last synced: 30 Mar 2025
https://github.com/lucashomuniz/project-10
Optimizing Sales Forecast Accuracy: Exploratory Analysis and Insights
data-analysis data-munging data-visualization dax-languague exploratory-data-analysis language-r power-bi sales-forecast statistics-modules
Last synced: 30 Mar 2025
https://github.com/shibbir-ahmad24/a-data-driven-approach-to-food-security-and-supermarket-accessibility
A Data-Driven Approach to Food Security and Supermarket Accessibility
data-analysis matplotlib numpy pandas python3 seaborn
Last synced: 05 Apr 2025
https://github.com/sdley/logiciel-de-deliberation-uam-2022
Del-Annuel est logiciel de deliberation annuelle des ecoles superieures ou universités
data-analysis pandas python tkinter-gui
Last synced: 08 May 2026
https://github.com/rudra-g-23/power-bi-custom-visual
A custom Power BI visual that displays a customizable, interactive charts with advanced capabilities.
custom-visuals data-analysis data-visualization dax powerbi powerbi-custom-visuals svg visualization
Last synced: 02 Jan 2026
https://github.com/akshaypratapsingh09/zomato-blogs-all-links-dataset
Engineering / Culture / Blogs Data gathered for Educational and Learning purposes from Zomato's Blogs and spreading the better problem solving Methodologies adapted by Modern Unicorns
data-analysis dataset regex selenium webdriver zomato-data-analysis
Last synced: 06 Apr 2025
https://github.com/tushar2704/employee-distribution
This repository contains valuable insights and visualizations derived from an extensive HR dataset spanning from 2000 to 2020, with over 22,000 rows.
data-analysis data-visualization excel postgresql powerbi sql tushar2704
Last synced: 04 Nov 2025
https://github.com/omari-kd/environmental-impact-on-food-production
The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.
data data-analysis data-science data-visualization environmental-impact-analysis r
Last synced: 30 Mar 2025
https://github.com/zborovskaanna/grosery_store_sales_analysis
Python data analysis project. Analysis of grocery store sales using visualizations and reporting in Tableau
data-analysis data-visualization matplotlib numpy pandas python seaborn tableau
Last synced: 08 Apr 2026
https://github.com/shreshthvashisht/abc-call-volume-trend-analysis
Customer Experience Analysis
advanced-excel call-centre-analysis call-volume-trend data-analysis data-visualisation experience-analytics pivot-tables
Last synced: 01 Mar 2026
https://github.com/aditiagrawal04/netflix-insights-mysql-
SQL-based analytical project exploring Netflix’s dataset to extract insights about content type, genre, ratings, country-based distributions, and release trends. Ideal for understanding business intelligence using SQL.
business-intelligence data-analysis data-exploration mysql netflix sql sql-project
Last synced: 28 Jun 2025
https://github.com/andryadsm/pizza-sales-report
🍕 Project Pizza Sales Report (MySQL, Tableau)
dashboards data-analysis data-visualization database-management mysql sales sql tableau
Last synced: 14 May 2025
https://github.com/mmfava/analises-papers
Script base de alguns papers publicados entre 2019 e 2021.
Last synced: 22 May 2026
https://github.com/al-ghaly/hotel-revenue-excel-analysis
Excel Dashboard to analyze data of a hotel over the past three years.
dashboard data-analysis data-visualization excel excel-analysis
Last synced: 02 Jan 2026
https://github.com/analyticslover/salifort-motors-turnover-project
The Salifort Motors H.R. Project serves as the capstone for the Google Advanced Analytics Program on Coursera. This project presents a business scenario and a problem on the scnario context, employee turnover. In this project, essential techniques as EDA and Data Modeling are used to analyze and predict the employee turnover rates in the company.
data data-analysis datamodeling eda machine-learning pandas python sklearn
Last synced: 10 Apr 2026
https://github.com/fazej99/u.s-climate-and-temperature-analysis
This project analyzes historical temperature trends in the U.S., explores their economic impacts, predicts future changes using machine learning, visualizes regional anomalies with GIS, and presents findings through a secure and interactive Streamlit dashboard.
data-analysis data-science data-visualization gis machine-learning streamlit
Last synced: 22 May 2026
https://github.com/data-edd/california_population_projection
This project demonstrates a population projection analysis for the state of California using MySQL
Last synced: 30 Mar 2025
https://github.com/sanjana-bongale/cancer_survival_data_analysis_and_prediction_using_logistic_regression
This project performs data analysis using Python to predict cancer patient survival outcomes. It involves data cleaning, exploratory analysis, and visualizations to explore factors like cancer type, stage, and treatments. A logistic regression model is built to predict patient survival based on demographic and medical data.
data-analysis data-cleaning data-science data-visualization eda jupyter-notebook kaggle logistic-regression machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn seaborn
Last synced: 08 Apr 2026
https://github.com/namratagulati/fraud_detection
This fulfills all the requirements of a fraud detection model developed on linear regression using feature scaling, engineering and testing model with the help of auc-roc curve and others.
data-analysis data-visualization machine-learning machine-learning-algorithms machinelearning-python
Last synced: 04 Jun 2026
https://github.com/tushar2704/sql-query
Repository is designed to help you strengthen your SQL query skills by providing a collection of common and interview-based SQL queries for practice.
artificial-intelligence data-analysis data-engineering data-science database database-management database-schema relational-databases sql sql-database sql-query tushar2704
Last synced: 04 Nov 2025
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 24 Jan 2026
https://github.com/anjalikumari021/sports_data_analysis_using_excel
Analyzed Sports data and prepared advanced dashboard using MS Excel.
data-analysis data-cleaning excel-dashboard ms-excel pivot-tables reporting
Last synced: 08 Mar 2026
https://github.com/mmfava/significados-aulas-biologia-quasiexp-2019
Repositório das análises realizadas para o paper "Construção de significados em aulas práticas de laboratório de biologia: uma avaliação por delineamento quase-experimental".
Last synced: 28 Jun 2025
https://github.com/bala-1409/sales-forecasting-datascience-project
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning
Last synced: 26 Apr 2026
https://github.com/bala-1409/loan-clustering-datascience-projects
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering clustering-algorithm data-analysis data-science data-visualization kmeans-clustering machine-learning machine-learning-algorithms sql unsupervised-learning unsupervised-machine-learning
Last synced: 22 Mar 2025
https://github.com/aleskandro/r-hadoop-madreduce-examples
A lot of examples about using R with hadoop for MapReduce with and without libraries as rhadoop/rhipe - DIEEI@unict.it - Advanced Programming Languages
data-analysis hadoop mapreduce r
Last synced: 04 Nov 2025
https://github.com/smohanta23/ev-trendanalytics-24
This Tableau project analyzes EV adoption trends using data up to May 2024. Visualizations cover growth, geography, market share, CAFV eligibility, and consumer preferences, supporting data-driven decisions with detailed drill-downs. Data is meticulously cleaned, offering stakeholders valuable insights into EV market dynamics and trends for future.
business-intelligence data-analysis data-engineering electric-vehicles feature-engineering kpianalysis predictive-analytics tableau trendanalysis
Last synced: 27 Mar 2026
https://github.com/vladstudennikov/diabetes-prediction-app
ML-powered web app built with Laravel and Vue.js to predict diabetes risk based on users' daily habits and behavior
cypress data-analysis diabetes-prediction fastapi inertiajs laravel matplotlib medicine ml pandas php scikit-learn seaborn vuejs
Last synced: 08 Apr 2026
https://github.com/yousef-jaber-abdelaziz/electrical-vehicles-data-analysis-project
A full stack Data Engineering f\project from Getting the data to the Data warehousing and then the Dashboard using Power BI
data-analysis data-engineering data-modeling data-visualization data-warehouse data-warehousing fabric microsoft-azure microsoft-fabric-data-engineer powerbi sql-server
Last synced: 23 Jun 2026
https://github.com/esr-style/stylegrid
A free alternative to AG grid built by me for personal use case.
aggrid data-analysis grid pivot-chart pivot-grid table
Last synced: 16 Sep 2025
https://github.com/faris771/identify_customer_segments
This project is part of the Palestine Launchpad by Spark, and Udacity with Google. It uses unsupervised learning to identify customer segments for a mail-order company in Germany. The goal is to direct marketing campaigns towards the most promising audiences. The data is provided by Bertelsmann Arvato Analytics.
clustering data-analysis decomposition feature-engineering machine-learning unsupervised-learning
Last synced: 08 Aug 2025
https://github.com/hatamiarash7/ir-system
IR System for Reuters DB
data-analysis data-mining ir python
Last synced: 29 Mar 2025
https://github.com/mindlessmuse666/eda-explorer
Инструмент на Python для разведочного анализа данных (EDA) и визуализации, поддерживающий загрузку данных CSV и JSON, с модульной архитектурой ООП. Практическая работа по теме: "Обнаружение и визуализация данных для понимания их сущности" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".
csv-visualization data-analysis data-science data-visualization exploratory-data-analysis json-visualization matplotlib oop pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/tejaswirupa/data-analysis-of-departure-delays-at-united-airlines
Explored how weather and time factors influence delays in 58,000+ UA flights. Used permutation testing and visual analytics to show how temperature, visibility, and time of day affect departure punctuality.
Last synced: 25 Jan 2026
https://github.com/leosimoes/datascienceacademy-python-analisededados
Atividades do curso Análise de Dados com Linguagem Python da DataScienceAcademy.
data-analysis data-science jupyter-notebook python sql
Last synced: 29 Apr 2026
https://github.com/shz-code/diwali_sales_data_analysis
Customer Product Purchase Behavior Analysis
behavior-analysis data-analysis matplotlib ml sales seaborn
Last synced: 14 Mar 2025
https://github.com/sivkri/shiny-scatter-plot-app
This repository contains a Shiny app that allows users to create interactive scatter plots by selecting the X and Y axes and customizing the point color. The app utilizes the shiny package in R to provide a user-friendly interface and the ggplot2 package for creating visually appealing plots.
data-analysis data-visualization ggplot2 interactive-web-application r rprogramming scatter-plot shiny
Last synced: 22 Mar 2025
https://github.com/habiburrahman-mu/exploratory-data-analysis
Methods to see if certain characteristics or features can be used to predict.
data-analysis data-mining data-science data-visualization
Last synced: 20 Jan 2026
https://github.com/hasnathjami/data-analysis-of-covid-19
An Oracle PL/SQL-based project on COVID-19 data analysis. It is my CSE 4.1 project of Distributive Database Management System LAB.
data-analysis naive-bayes-classifier oracle-database probability-statistics sqlplus
Last synced: 08 Mar 2026
https://github.com/cosmoduende/r-earthquakes
Análisis y visualización de datos de actividad sísmica en México con R. Cómo analizar y visualizar la historia sísmica de México con datos del SSN (Servicio Sismológico Nacional)
data-analysis data-analytics data-science dataviz earthquakes r-code r-programming r-studio rstudio sismo sismologia sismos ssn ssnmx terremoto terremotos
Last synced: 24 Jan 2026
https://github.com/mrham17/spotify_streaming_analytics
Project is stable & documentation will be completed soon. Thank you for your understanding and patience.
big-data-analytics data-analysis google-colab music-data r-programming spotify streaming-analytics
Last synced: 24 Jul 2025
https://github.com/leandrocollares/street-cherry-trees-in-vancouver
Street cherry trees in Vancouver: an exploratory data analysis
data-analysis data-visualization folium pandas plotly-express
Last synced: 17 Sep 2025
https://github.com/netesf13d/expt-sequence-analysis
Data processing, analysis and visualization package for atomic physics experiments in the single-atom regime.
cold-atoms data-analysis data-visualization optical-tweezers
Last synced: 24 Jul 2025