Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-02 00:07:33 UTC
- JSON Representation
https://github.com/karaniwachira/baby_names_analysis
Data Analysis: Baby Names Exploration
data data-analysis quarto quartopub r rstats tidyverse-ggplot2
Last synced: 22 Jun 2025
https://github.com/abelarduu/power_bi_analyst
Projeto Power BI para relatório de dados financeiros, com navegação intuitiva e recursos interativos. Oferece uma experiência completa ao usuário, combinando apresentação sofisticada e funcionalidade eficaz para análise de dados.
dashboard data-analysis data-analytics modelagem-de-dados powerbi tratamento-de-dados
Last synced: 08 Sep 2025
https://github.com/sidsin0809/hmdb-endo-flagger
A Python toolkit to identify and score endogenous human metabolites from HMDB XML metadata
data-analysis hmdb metabolomics ontology pipeline python-3 streaming-parser xml-parsing
Last synced: 06 Jul 2025
https://github.com/balajimohan18/loan-clustering-datascience-project
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering-algorithm data-analysis data-science data-visualization eda kmeans-clustering machine-learning sql unsupervised-learning
Last synced: 27 Jul 2025
https://github.com/adeebkhan25/dataset_suicide_susceptible
The "Student Suicide Risk Factors Dataset" is a comprehensive collection of data aimed at understanding and mitigating the factors contributing to student suicides.
data-analysis dataset machine-learning supervised-learning
Last synced: 24 Dec 2025
https://github.com/vedantshi/tableau-bike-data-dashboard
London Bike Rides Analysis explores bike usage patterns using data visualization and machine learning. It identifies trends through a dynamic moving average, analyzes weather impact with heatmaps, and provides actionable insights via an interactive Tableau dashboard. Tools: Python, Tableau.
data-analysis data-visualization python tableau weather-data
Last synced: 16 May 2026
https://github.com/maxbiostat/diehl_ebola_cell_2016
supplementary code and data to Diehl et al, 2016 (Cell)
data-analysis data-visualization disease-spread ebola mutation
Last synced: 11 Jul 2025
https://github.com/nivasharmaa/friskwatch
A Java program for analyzing stop-and-frisk data from the NYPD. Features data import, organization, and statistical analysis to compare occurrences during and after policy implementation.
data-analysis data-visualization dataprocessing datascience file-io java java-oop nypd-data
Last synced: 19 May 2026
https://github.com/purushothamadluru/atlantic-gdp-job-demand-analysis
data-analysis data-visualization powerbi
Last synced: 17 Feb 2026
https://github.com/shellynagar27/marketing-content-performance-analysis
Analyzed 2024 social media campaign data from TikTok, Instagram, LinkedIn, and X.com using Power BI to uncover performance trends across platforms, content types, and regions. Built an interactive dashboard to drive insights on engagement, optimal posting times, and content strategy.
data-analysis data-modelling data-visualization excel figma marketing-analytics powerbi powerquery wireframing
Last synced: 26 Jun 2025
https://github.com/danhnnguyen0606/bitcoin-navigator
Bitcoin Navigator: A data-driven dashboard designed to analyze Bitcoin trends, empowering investors to refine their strategies and identify optimal investment opportunities.
bitcoin btc crypto cryptocurrency data-analysis data-analytics data-science data-visualization investment looker looker-studio
Last synced: 15 Mar 2025
https://github.com/lvmalware/lsm-module
A simple statistics module, which provides 4 basic types of regressions using the Least Squares Method (LSM)
data-analysis least-square-regression regression regression-analysis statistics
Last synced: 31 Mar 2025
https://github.com/ryuzen6/kaggle-series
This is a series of Machine Learning/Deep Learning Models made for practice.
artificial-intelligence data-analysis data-science deep-learning machine-learning python3
Last synced: 20 May 2026
https://github.com/mizzy/tweetduck
Twitter Archive to DuckDB Importer - Extract and import Twitter archive data (2025 format) into DuckDB for analysis
archive cli data-analysis duckdb golang twitter
Last synced: 28 Jun 2026
https://github.com/evamaerey/ma206distributions
data-analysis data-science ggplot2 statistics
Last synced: 22 Jul 2025
https://github.com/astrojarhead/irafscripts
IRAF cl scripts
astronomy data-analysis image-processing iraf scripts
Last synced: 12 Jan 2026
https://github.com/badranalyst/restaurant-reviews-sentiment-analysis-nlp-case-study
This project analyzes restaurant reviews using Natural Language Processing (NLP) for sentiment analysis. It covers data exploration, pre-processing (NLTK text cleaning), model building, prediction, and deployment. The goal is to predict sentiment from reviews using Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.
data-analysis data-science eda exploratory-data-analysis matplotlib-pyplot model model-building numpy pandas pre-processing predictive-modeling python seaborn
Last synced: 13 Apr 2026
https://github.com/olympus-terminal/data-processing
Data analysis and processing tools
automation data-analysis data-processing data-science etl machine-learning pdf-extraction python r research statistics web-scraping
Last synced: 16 May 2026
https://github.com/sharoonjoseph321/samsung_stock_prediction
Predicting future price of Samsung stock, using machine learning , scikit learn and pandas
algorithms data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms prediction predictive-analytics predictive-modeling python stock-price-prediction supervised-learning
Last synced: 06 Apr 2025
https://github.com/sakan811/gachascope
Evaluate the cost-effectiveness of various in-app purchase bundles available in gacha games.
data data-analysis data-visualization game honkai honkai-star-rail honkai-starrail hoyoverse javascript nextjs tableau tableau-public typescript wutheringwaves
Last synced: 04 May 2026
https://github.com/balajimohan18/peerloankart-loan-fraud-detection-datascience-project
This project uses machine learning to predict whether a loan applicant will repay their loan. The project uses a dataset of historical loan data from PeerLoanKart, a peer-to-peer lending platform.
classification-model data-analysis data-analytics data-cleaning data-science data-visualization dimensional-analysis eda exploratory-data-analysis feature-engineering gradient-boosting-classifier hyperparameter-tuning jupyter-notebook maachine-learning machine-learning-algorithms predictive-modeling python supervised-learning
Last synced: 20 May 2026
https://github.com/adnanrahin/nlp-with-disaster-tweets
Kaggle Competition: Predict which Tweets are about real disasters and which ones are not. Natural Language Processing.
data-analysis data-science data-visualization kaggle-competition machine-learning natural-language-processing regular-expression tweets
Last synced: 21 Jun 2025
https://github.com/amoghkori/deeplabcut-package-for-animal-pose-estimation
DeepLabCut Mouse Location Prediction: Training a deep neural network to predict the location of a mouse using annotated joint positions.
data-analysis data-annotations data-preprocessing deep-learning machine-learning model-evaluation python-programming research research-project
Last synced: 17 Mar 2025
https://github.com/kiran-kumar-k3/sales-performance-dashboard
The Sales Performance Dashboard is an interactive Python-based web application that visualizes and analyzes sales data, providing actionable insights through dynamic charts and metrics.
data-analysis python streamlit
Last synced: 20 May 2026
https://github.com/hanzopgp/lolanalysis
League Of Legends game data engineering, analysis, visualization and machine learning. Business intelligence project.
data-analysis data-cleaning data-engineering data-visualization dataiku deep-learning etl machine-learning scraping university
Last synced: 27 May 2026
https://github.com/archanakokate/bank_term_deposit_prediction
Build a Decision Tree classifier to predict if the client will subscribe to a Term Deposit based on their demographic and behavioral data.
data-analysis data-visualization exploratory-data-analysis machine-learning
Last synced: 14 Sep 2025
https://github.com/pkjjoshi/restaurants-analysis
Performed beginner-level EDA on a restaurant dataset using Python. Analyzed top cuisines, city-wise ratings, price ranges, and online delivery impact using Pandas and Matplotlib. Includes 4 well-structured notebooks with visual insights.
beginner-project data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas python restaurant-data seaborn
Last synced: 21 Jun 2025
https://github.com/gui-sitton/carsells
In this project I am an analyst on the Crankshaft List. Hundreds of free vehicle advertisements are published on the site every day. I need to study the data collected over the last few years and determine which factors influence the price of a vehicle.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 20 May 2026
https://github.com/jiteshshelke/codsoft
A repository showcasing three machine learning projects—Titanic Survival Prediction, Movie Rating Prediction, and Iris Flower Classification—completed during CodSoft's Data Science Internship. 🚀
codsoft codsoftinternship data-analysis data-science linear-regression logistic-regression machine-learning machine-learning-algorithms python
Last synced: 20 May 2026
https://github.com/teditae/data-analysis-with-pandas
Mini data science projects focused on Pandas-powered analysis.
data-analysis data-manipulation pandas python
Last synced: 30 Apr 2026
https://github.com/patricksferraz/aqw-madrid-data-analysis
Interactive analysis and visualization of Madrid's air quality and weather data (2001-2016) using Python, Dash, and Jupyter. Features interactive maps, statistical analysis, and data visualization tools.
air-quality dash data-analysis data-engineering data-science data-visualization data-wrangling environmental-data environmental-science interactive-dashboard jupyter jupyter-notebook madrid open-data pandas plotly python statistical-analysis time-series weather-data
Last synced: 30 Jan 2026
https://github.com/atharvkadammm/suicide-prediction-system
A machine learning project predicting suicide risk based on multiple socio-economic and environmental factors using data mining techniques.
csv data-analysis data-science data-visualization datamining exploratory-data-analysis feature-engineering machine-learnin matplotlib mental-health numpy pandas riskassesment seaborn sklearn suicide-prediction supervised-
Last synced: 01 Jul 2025
https://github.com/atharvkadammm/calmlytic
An end-to-end machine learning project that predicts anxiety severity using classification models (Naive Bayes, Decision Tree, SVM, Logistic Regression, XGBoost), based on lifestyle, health, and behavioral features.
anxiety-prediction classification csv data-analysis data-preprocessing-and-cleaning data-science data-visualization ensemble-learning logistic-regression machine-learning-algorithms matplotlib mental-health numpy pandas python sci-kit-learn seaborn supervised-learning svm xgboost
Last synced: 21 Jun 2025
https://github.com/steviecurran/prediction-plot
Code to performs machine learning (k-nearest neighbours regression) and plot the predicted versus measured values
astrophysics c data-analysis high-redshift machine-learning pgplot python statistics tensorflow visualization
Last synced: 20 May 2026
https://github.com/nikitalpopov/news
v semester project
data-analysis data-science python scikit-learn
Last synced: 20 May 2026
https://github.com/rezowanrahat/netflix_analysis
Data analysis of Netflix content using Python, Pandas, and Seaborn
data-analysis data-visualization netflix pandas python
Last synced: 07 May 2026
https://github.com/ahnaf19/clean_bankingdata
Here I tried to practice simple ETL tasks. I know how to perform these tasks in SQL, here just explored my way around using pandas as well.
data-analysis data-cleaning pandas python
Last synced: 19 Apr 2026
https://github.com/lunarwhite/lake-george-viz
Geroge Lake data analysis and visualization, ANU COMP1730/6730
Last synced: 01 Nov 2025
https://github.com/kushagrakumar04/visual-age-distribution
A Bar chart or histogram to visually depict the distribution of a categorical or continuous variable, such as the age distribution or gender composition within a population. This graphical representation provides a clear and insightful overview of the data's patterns and trends.
data-analysis data-science google-colab
Last synced: 21 Jun 2025
https://github.com/jpcadena/malware-analysis
Analysis of malware signatures and their associated Common Vulnerabilities and Exposures (CVEs)
black common-vulnerabilities-and-exposures cve-search data-analysis data-engineering data-reporting data-visualization isort malware-analysis matplotlib mypy numpy pandas plotly poetry pre-commit pydantic python ruff seaborn
Last synced: 03 Mar 2026
https://github.com/devexpress-examples/wpf-pivot-grid-define-custom-cell-template-to-performing-data-editing
This example shows how to edit a cell with the cell editing template in Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 02 May 2026
https://github.com/silasberger/charts-analysis
Data set collection, preprocessing and analysis of singles- and album charts
charts data-analysis data-mining data-science dataset music
Last synced: 14 Sep 2025
https://github.com/syarwinaaa09/hypothesis-testing-with-mens-and-womens-soccer-matches
a data-driven exploration of international men's and women's football (soccer) match results using Python
data-analysis data-visualization football jupyter-notebook men-vs-women pandas python soccer sports-analytics visualization
Last synced: 05 May 2026
https://github.com/ranxi2001/predicting-mental-health-risk
数据分析案例-精神健康预测(数据来源kaggle)
data-analysis data-visualization eda
Last synced: 27 Jun 2025
https://github.com/samruddhi3012/rfm-analysis
Hi there! In this project I have performed Sales Analysis (RFM Analysis) using SQL and Tableau.
data-analysis data-visualization mssqlserver rfm-analysis segmentation tableau
Last synced: 27 Jun 2025
https://github.com/alpkanoz/ibm_data_science_professional_certificate
The repository contains projects and training materials carried out throughout the IBM data science professional course.
classification clustering data-analysis data-science data-visualization dataframe ibm ibm-watson machine-learning mathplotlib pandas predictive-modeling python scikit-learn
Last synced: 07 Mar 2026
https://github.com/jgohel9902/toronto-airbnb-snowflake
This project analyzes Airbnb listings in Toronto using **Snowflake’s cloud data platform**. It follows a **Bronze → Silver → Gold** medallion architecture and leverages **Snowflake Cortex** to generate **AI-driven executive insights**.
data-analysis python snowflake sql
Last synced: 07 Mar 2026
https://github.com/bhaveshbhakta/autistic-patients-classification-using-ann
Autistic Patients Classification
ann artificial-neural-networks autistic-patients-classification data-analysis data-visualization deep-learning
Last synced: 25 Feb 2025
https://github.com/245839/automobile-analysis
Analysis of data on imported cars to the USA performed in Python using libraries for data analysis in the Jupyter environment.
data-analysis jupyter-notebook python
Last synced: 20 May 2026
https://github.com/vlad1343/data-visualisation
Python project showcasing interactive and static visualizations using Plotly and Matplotlib. It includes analysis of CSV, JSON, and API data, turning complex datasets into clear, insightful charts.
anova api csv-files data-analysis data-visualization json matplotlib matplotlib-pyplot pandas pandas-python plotly python3 seaborn seaborn-python
Last synced: 08 Apr 2026
https://github.com/faizantkhan/python_matplotlib
Matplotlib is a powerful Python library for creating visualizations and plots. It’s widely used for data representation, making complex information more accessible and interpretable. It offers various types of plots, including line graphs, scatter plots, bar charts, histograms, and more
data-analysis data-analytics data-engineering data-science data-visualization deep-learning graphs line machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot matplotlib-python python
Last synced: 20 May 2026
https://github.com/farhad-here/tegenx
TeGenX: Multilingual Text Generation App.TeGenX is a lightweight, interactive text generation application built with Streamlit. It leverages multiple pre-trained transformer models to generate text in both English and Persian.
data-analysis data-science deep-learning happytransformer huggingface nlp python stream text-generation text-generator textgeneration transformer web-application
Last synced: 25 Jan 2026
https://github.com/gabrielramirezv/rnaseq_2025_notas
Repository for RNA-seq class from the Undergraduate Program in Genomic Sciences.
Last synced: 29 Mar 2025
https://github.com/samruddhi3012/health-care-analytics
Hi! This repo involves analyzing the Healthcare analytics using Advanced Microsoft Excel.
dashboard data-analysis data-visualization healthcare microsoft-excel pivot-chart pivot-tables vlookup
Last synced: 29 Mar 2025
https://github.com/marlysson/craw
A system to show the data collected from various sources using chartjs - ⚡️
chartsjs data-analysis data-science web-scraping
Last synced: 21 Jun 2025
https://github.com/bho0920/crime-data-analysis-eu
Crime Data Analysis for Self-Defense Tool Market Entry in the EU.
data data-analysis sql sqlite tableau
Last synced: 21 Jun 2025
https://github.com/dmytrori/himalayan_expeditions
Himalayan expedition stats, 1905–2020
alpinism data-analysis data-visualization pandas-python
Last synced: 21 Jun 2025
https://github.com/deborangueira/campeonado_kaggle_2025
Desenvolvimento de um modelo de machine learning para prever o sucesso de startups. O objetivo é identificar quais empresas têm maior probabilidade de se tornarem casos de sucesso no mercado.
computacao data-analysis desafio kaggle modulo3 ponderada
Last synced: 16 May 2026
https://github.com/alinababer/data-science-and-insight-agent-rag-llama3-lava-llm
Data-Science-and-Insight-Agent-RAG-LLama3-Lava-LLM-Django-WebApplication is an advanced AI-driven chatbot designed to assist in data science, document analysis, and image interpretation. This repository contain the Datascience Agent of this project.
artificial-neural-networks classifcation data-analysis data-engineering data-visualization datascience large-language-models llama2 lstm machine-learning python random-forest regression
Last synced: 01 Jan 2026
https://github.com/an0n1mity/spamclassifiereval
A repository for evaluating the misclassification rate of spam classification models using a threshold-based approach.
data-analysis machine-learning natural-language-processing python-programming spam-classification text-classification
Last synced: 02 Nov 2025
https://github.com/patricialjohnson/data-visualization-tableau-project
Tableau Visualization Project
business-analytics business-intelligence data-analysis data-visualization digital-marketing digital-marketing-agency kpi microsoft-excel program-management project-management python search-engine-optimization seo sql tableau
Last synced: 21 Jun 2025
https://github.com/jakobzmrzlikar/pca-on-genomes
An analysis of human genome mutations from different populations.
data-analysis genome-analysis pca-analysis
Last synced: 16 May 2025
https://github.com/ksharma67/eda-on-ipl
In this python notebook, analysis of IPL matches from 2008 to 2020 is done using python packages like pandas, matplotlib and seaborn.
data-analysis data-science eda matplotlib numpy pandas python seaborn
Last synced: 07 May 2026
https://github.com/jabercrombia/invoice-tracker
Created an invoice tracker with sample data using Nextjs and data visualizations.
data-analysis nextjs postgres shadcn vercel
Last synced: 07 Apr 2026
https://github.com/m-biriulova/automated-sales-report
Automated sales & returns report using Python, Excel, and PDF export
automation business-intelligence data-analysis data-visualization excel financial-analysis freelance portfolio-project python report-generation sales-report
Last synced: 20 Jun 2025
https://github.com/swouf/ntds_imdb_team4
data-analysis data-visualization datascience graph-theory
Last synced: 13 May 2025
https://github.com/madrury/hot-sauce
Simuation of a Hot Sauce Spicyness Dataset
data-analysis data-science data-visualization dataset machine-learning
Last synced: 16 May 2026
https://github.com/whisplnspace/insightgenie
InsightGenie is an AI-powered data analyst that lets you upload files, ask questions, and get insights with visualizations
data-analysis data-science data-visualization deployment gemini-api huggingface nlp
Last synced: 19 Jun 2025
https://github.com/lucasfloresc/final_project
This is the final project of the Ironhack Bootcamp. In this project I applied all methods and tecniques learned in the Bootcamp, such as Web Scrapping and API extraction, Data cleaning and processing with Python, Python logic, the implementation of machine learning and Data Visualization. All displayed in Streamlit for more user friendly interface
data-analysis data-visualization machine-learning python streamlit webscraping
Last synced: 08 May 2026
https://github.com/mkk-1817/cvip-ds-exploratory_data_analysis-terrorism
This repository deals with exploring global terrorism trends analyzing the Global Terrorism Database to uncover temporal patterns, identify top terrorist groups, examine attack types, and gain insights into geographical and success/failure dynamics.
coderscave data-analysis data-science data-visualization eda exploratory-data-analysis python terrorism-analysis
Last synced: 19 Jun 2025
https://github.com/celineboutinon/lafleche-et-associes
OpenClassrooms Data Analyst 2022-2023 - Projet 7 using KNIME Analytics Platform
data-analysis data-analytics data-visualisation knime-analytics-platform no-code rgpd
Last synced: 08 Feb 2026
https://github.com/lorinczakos/sql-projects
This is a collection of my SQL scripts that I wrote and were approved through my course with GoIT Romania Data Analyst course
bigquery cte data data-analysis dbeaver marketing-analytics postgresql project-repository sql vscode
Last synced: 16 May 2026
https://github.com/erseco/ugr_tratamiento_inteligente_datos
Repositorio de trabajo de la asignatura Tratamiento Inteligente de Datos del Máster en Ingeniería Informática de la Universidad de Granada (UGR)
Last synced: 26 Apr 2026
https://github.com/errea/vet_clinic_database
For this project you need special preparation. As the goal of this project is to solve some performance issue, first we need to introduce those issues. In order to do that, you will populate your database with a significant number of data.
data data-analysis data-structures data-visualization database
Last synced: 21 May 2026
https://github.com/kashirin-alex/thither.direct-onamove
an android skeleton-example application for using data from Thither.Direct platform on mobile applications
android-application data data-analysis data-structures data-visualization mobile-development mobility query research-data-management
Last synced: 27 Apr 2026
https://github.com/nurulashraf/linear-regression-spotify
Data Science - Spotify Linear Regression Analysis
data-analysis data-preprocessing data-visualization dataset-exploration feature-selection linear-regression machine-learning matplotlib mean-squared-error model-evaluation multiple-regression music-analytics numpy predictive-modeling python regression-analysis root-mean-squared-error scikit-learn seaborn spotify-data
Last synced: 01 May 2026
https://github.com/nferno55/mock-data-governance
Working with messy data and using data quality practices to clean it up and practice SQL/Python automation. YAML will be used for Metadata validation soon.
data-analysis database-management metadata python sql sqlite3 yaml
Last synced: 16 May 2026
https://github.com/dcs-training/introtodatabases
This repository host the material connected to a training developed by Dave Elsmore (Edina) for CDCS. Go to the readme file
data-analysis data-wrangling databases sql
Last synced: 10 Jun 2026
https://github.com/tabibyte/aoty-highest-rated-albums-data-analysis
Data Analysis of AOTY Highest Rated Albums
albums aoty data-analysis music
Last synced: 10 Sep 2025
https://github.com/athari22/multivariable_regression_and_valuation_model_
Multivariable regression model using Python to analyze and predict Boston housing prices based on various socioeconomic and environmental features.
data-analysis data-analysis-python housing-prices housing-prices-competition machine-learning pandas pandas-python plotly python regression-models seaborn seaborn-python sklearn
Last synced: 17 Jun 2025
https://github.com/teja-1403/forage-tata-data-visualisation-empowering-business-with-effective-insights
This repository contains solutions to the 4 different tasks that must be performed during the Data Visualisation: Empowering Business with Effective Insights virtual internship provided by TATA via Forage.
analysis-and-reporting analytics analytics-and-decision-science charts communications dashboards data-analysis data-cleanup data-interpretation data-storytelling data-visualizations graph insights power-bi visual-basic visualizations
Last synced: 18 Feb 2026
https://github.com/nafisrayan/crypto-trading-platform
This React Crypto Exchange Template is designed to provide a solid foundation for building a comprehensive cryptocurrency exchange platform. With its sleek and modern design, this template is perfect for anyone looking to create a user-friendly and intuitive trading experience.
crypto dashboard data-analysis data-visualization react template
Last synced: 16 May 2026
https://github.com/arsalan-dev-engineer/ai-repository
A repository that contains AI related projects, notes, practice files and documentations.
ai algorith beginner-friendly data-analysis data-preprocessing developer jupyterlab matplotlib matplotlib-pyplot natural-language-processing numpy pandas python unsupervised-learning visualization
Last synced: 12 Apr 2026
https://github.com/shafaq-aslam/data-gathering
A hands on collection of notebooks exploring multiple techniques of data gathering, from reading CSV, Excel, JSON, and SQL files to exporting data in various formats and fetching real time data through APIs. This repository documents my complete learning journey of data ingestion, preparation, and extraction for data analysis workflows.
api data-analysis data-export data-gathering data-import data-science jupyter-notebook machine-learning pandas python python3
Last synced: 21 May 2026
https://github.com/j-faria/bicerin
Working on the RV challenge in Torino
data-analysis gp radial-velocity rv-challenge
Last synced: 07 Apr 2026
https://github.com/anonymo2239/big-data-churn-analyzer
Scalable customer churn prediction using PySpark. Includes EDA, feature engineering, modeling, and real-time inference on new data.
big-data churn-analysis churn-prediction classification-algorithm data-analysis data-science data-visualization modeling pyspark
Last synced: 21 May 2026
https://github.com/danitilahun/exploratory-data-analysis-projects
This repository contains a collection of my personal Exploratory Data Analysis (EDA) projects. Each project involves exploring various datasets to gain insights, uncover patterns, and visualize trends.
data-analysis data-science data-visualization exploratory-data-analysis python
Last synced: 16 May 2026
https://github.com/dacrol/filterdataset
Filters a dataset based on attributes
data-analysis dataset deep-learning machine-learning python python3
Last synced: 25 Jul 2025
https://github.com/gui-sitton/bank-loans
In this project I will prepare a report for a bank's loan division. I find out whether a customer's marital status and number of children have an impact on loan default, as well as other factors
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 21 May 2026
https://github.com/jackmnob/python-tableau-eda-stockdash
Data cleaning, preparation, and manipulation (EDA) for an interactive stock market dashboard with Tableau - using pandas (Python) via JupyterLab
cleaning-data dashboard data-analysis data-preparation eda jupyter-notebook jupyterlab python tableau-public
Last synced: 14 May 2026
https://github.com/jacktheprogrammer/hypothesis-testing-using-data-analytics
Hypothesis testing using data analytics for yellow trip car ride provider service to increase their revenue
data-analysis data-analytics data-analytics-project data-insights data-plotting data-visualization descriptive-analysis hypothesis-testing prescriptive-analysis statistical-analysis statistical-methods
Last synced: 17 Jun 2025
https://github.com/kakri787/alcoholism-and-grade-analysis
A mini project for university data science module where we analyzed on the relationship between alcohol consumption in students and their academic performance, making use of exploratory data analysis and machine learning techniques to see if we can predict student's grades.
data-analysis data-science data-vizualisation lasso-regression machine-learning neural-network
Last synced: 12 Apr 2025
https://github.com/tapas-gope/pizza-sales
This project analyzes Pizza Sales Data to provide insights into customer preferences and sales performance. Key metrics include total revenue, orders, and average order value, with a breakdown by pizza category and size. The dashboard identifies peak sales periods and top-selling items, supporting data-driven business decisions.
business-intelligence dashboard data-analysis data-visualization dax powerbi sales-analysis
Last synced: 02 Jan 2026
https://github.com/prathmesh2507/ctc-hackthon
A data-driven system designed to reduce overcrowding and optimize urban public transport using real-world geospatial data and intelligent simulation.
dashboard data-analysis data-visualization python streamlit
Last synced: 16 May 2026
https://github.com/datalopes1/desafio_delivery
Desafio do Clube de Assinaturas da Universidade dos Dados para simular as demandas reais de um analista de dados
Last synced: 06 Mar 2026
https://github.com/vatshayan/students-marks-prediction-project
Prediction of marks of students using Machine Learning algorithms.
college-project data-analysis data-science data-science-projects final-project final-year-project machine-learning machine-learning-algorithms marks minor-project semester student-project students
Last synced: 17 Jun 2025
https://github.com/navp7/roadaccident_powerbi
An interactive Power BI dashboard designed to analyze road accident data
dashboards data-analysis data-visualization powerbi
Last synced: 19 Mar 2026