Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/wb-az/sql-for-data-analysis-udacity
This repository contains SQL queries for the SQL for Data Analysis given by Udacity. The queries include commands to define, select, manipulate, control access, aggregate, and join data and data tables.
aggregation data-analysis data-cleaning erd joints postgresql sql subqueries-and-joins window-functions-in-sql
Last synced: 23 May 2026
https://github.com/akmj1011/hill-and-valley-prediction-using-logistic-regression
Created A Prediction System Using Logistic Regression For Figuring Out The Hall And Valley From The Given Datasets
cloud-computing data-analysis data-manipulation data-preprocessing data-transformation data-visualization google-colab
Last synced: 13 May 2026
https://github.com/vladstudennikov/diabetes-prediction-app
ML-powered web app built with Laravel and Vue.js to predict diabetes risk based on users' daily habits and behavior
cypress data-analysis diabetes-prediction fastapi inertiajs laravel matplotlib medicine ml pandas php scikit-learn seaborn vuejs
Last synced: 08 Apr 2026
https://github.com/haideratgh/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-query sql-server window-functions-in-sql
Last synced: 29 Jun 2025
https://github.com/ronylpatil/whatsapp-group-chat-analysis
This project is totally based on data analysis where our college official Whatsapp group is used to extract useful information from the chat. Some of the useful extracted features are most active members of the group, most active day of the week, top-10 media contributors in the Group, and many more...
data-analysis data-preprocessing data-wrangling feature-engineering
Last synced: 14 Jun 2025
https://github.com/smohanta23/ev-trendanalytics-24
This Tableau project analyzes EV adoption trends using data up to May 2024. Visualizations cover growth, geography, market share, CAFV eligibility, and consumer preferences, supporting data-driven decisions with detailed drill-downs. Data is meticulously cleaned, offering stakeholders valuable insights into EV market dynamics and trends for future.
business-intelligence data-analysis data-engineering electric-vehicles feature-engineering kpianalysis predictive-analytics tableau trendanalysis
Last synced: 27 Mar 2026
https://github.com/beolawork-art/novabank-churn-analysis
NovaBank has noticed that customers are closing accounts or going inactive, and they want to understand why.
data-analysis data-science-projects data-visualization eda machine-learning numpy pandas python scikit-learn sql
Last synced: 08 Apr 2026
https://github.com/chaitanyac22/investment-analysis-for-an-asset-management-company
Data analysis to identify the best sectors, countries, and a suitable investment type for making investments.
business-analytics business-intelligence data-analysis data-cleaning data-insights data-manipulation data-preparation data-visualization decision-making finance python3 risk-management statistics
Last synced: 06 May 2026
https://github.com/sisolieri/prova_ds_saloocupacio2024
Admission challenge to Hackató Saló Ocupació by Barcelona activa
arima barcelona catboost data-analysis data-visualizations forecasting machine-learning pandas public-funding python scikit-learn time-series xgboost
Last synced: 10 Apr 2026
https://github.com/farhad-here/median-performance-comparison
Benchmarking the performance of median calculation using vanilla Python vs NumPy.
data-analysis matplotlib numpy python
Last synced: 18 Apr 2026
https://github.com/mnkanout/patients_medication_prediction
The aim of the project is to create a model that can help medical professionals select the proper medication for patients based on their symptoms. The model uses historical data of other patients to predict what could be the most suitable medication based on the patient's symptoms.
data data-analysis data-science data-visualization decision-tree-classifier machine-learning python3
Last synced: 29 Jun 2025
https://github.com/aleskandro/r-hadoop-madreduce-examples
A lot of examples about using R with hadoop for MapReduce with and without libraries as rhadoop/rhipe - DIEEI@unict.it - Advanced Programming Languages
data-analysis hadoop mapreduce r
Last synced: 04 Nov 2025
https://github.com/bala-1409/rafik-s-kitchen-data-analysis
The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.
business-analytics business-intelligence data-analysis data-analytics data-visualization eda exploratory-data-analysis ms-office powerbi-report powerpoint-presentations python sql-server
Last synced: 06 May 2026
https://github.com/mstovarh/analisis-de-bebidas-de-starbucks
En este repositorio se encuentran unas gráficas basadas en diversas características de las bebidas de Starbucks, usé tecnologías como la herramienta de Data Analysis de ChatGPT, Excel y PowerQuery.
chatgpt data-analysis excel powerquery
Last synced: 15 Apr 2025
https://github.com/bala-1409/loan-clustering-datascience-projects
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering clustering-algorithm data-analysis data-science data-visualization kmeans-clustering machine-learning machine-learning-algorithms sql unsupervised-learning unsupervised-machine-learning
Last synced: 22 Mar 2025
https://github.com/karlyndiary/spotify-excel-dashboard
Data Analysis on the Spotify Dataset using Microsoft Excel and VBA.
charts data-analysis data-cleaning data-visualization excel excel-export excel-vba pivot-tables
Last synced: 04 Jan 2026
https://github.com/satyam4229/omnify-dataanalysis
Our assessment of Omnify focused on data-driven strategies to maximize profitability. We identified "Product X" as the most profitable product and recommended leveraging the "Wellness Solutions" keyword category for optimal keyword strategy.
data-analysis data-science data-visualization excel omnify
Last synced: 04 Jan 2026
https://github.com/aneeshmurali-n/project-ml-data-preprocessing
The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.
data-analysis data-cleaning data-encoding data-exploration feature-scaling label-encoding matplotlib minmaxscaler numpy one-hot-encoding outlier-detection pandas standardscaler
Last synced: 02 May 2026
https://github.com/serlo/data-pipeline-interactive-exercises
processing pipeline for exercise dashboards
Last synced: 26 Feb 2025
https://github.com/skysign/dat
데이터분석을 함께 공부하는 스터디입니다.
data data-analysis data-science
Last synced: 02 Jan 2026
https://github.com/phanchenh/youtube_analysis_rlanguage
Insights into YouTube Channel Performance - A Data-Driven Approach
business-analytics data-analysis data-driven data-visualization etl-pipeline preprocessing r-language r-programming-language
Last synced: 10 Mar 2026
https://github.com/friendotjava/air-quality-analysis
Data Analysis for Air Quality in different District in Beijing.
data-analysis explanatory-data-analysis exploratory-data-analysis jupyter-notebook pytho streamlit
Last synced: 06 May 2026
https://github.com/ayberkyavuz/body_type_estimator
This repository is a tutorial for all levels who want to learn how to develop end to end machine learning system.
backend classification css data-analysis dataset end-to-end flask flask-application frontend html javascript machine-learning machine-learning-application material-design materializecss pandas python tutorial webapp xgboost
Last synced: 10 Apr 2026
https://github.com/ronitjariwala/prodigy_ds_02
Prodigy InfoTech Data Science Internship Task-2
Last synced: 28 Apr 2026
https://github.com/bala-1409/loan-classification-data-science-projects
This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.
data data-analysis datacleaning datascience datavisualization exploratory-data-analysis loan machine-learning machine-learning-algorithms modelfitting sql supervised-learning visualization
Last synced: 22 Mar 2025
https://github.com/andrii04/ga4-gcs-to-bigquery-etl
Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.
automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql
Last synced: 18 May 2026
https://github.com/syarwinaaa09/visualizing-the-history-of-nobel-prize-winners
analysis and visualization of Nobel Prize winners
data-analysis data-visualization jupyter-notebook machine-learning matplotlib nobel-prize pandas python
Last synced: 06 May 2026
https://github.com/macdon112/credit-card-fraud-detection
Comparing ML models (Random Forest, KNN, Decision Tree) for credit card fraud detection using SMOTE and stratified cross-validation.
classification data-analysis fraud-detection imbalanced-data machine-learning python scikit-learn
Last synced: 10 May 2026
https://github.com/bala-1409/sales-forecasting-datascience-project
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning
Last synced: 26 Apr 2026
https://github.com/rahul-jha98/restauranttrends.stats-backend
Application that scrapes the Zomato Dataset and enables the user to visualise the results.
data-analysis data-extraction firebase-storage web-scraping zomato-api
Last synced: 16 Mar 2026
https://github.com/okwilkins/retailanalysis
A comprehensive exploratory analysis and implementation of kmeans/hierarchical clustering on online retail data.
data-analysis data-science machine-learning statistics
Last synced: 18 Oct 2025
https://github.com/harryrlk/data_analysis_showcase
This repository showcases my data analysis and visualization projects using Excel, Python, R, and Tableau. Some projects are under NDA, so key figures and specific numbers are not included, but brief overviews and methodologies are provided. Feel free to explore and contact me for further details.
data-analysis data-science data-visualization excel portfolio python r tableau
Last synced: 06 May 2026
https://github.com/deanlogan/data-analysis-course
Code created when completing the Data Analysis with Python Course on freecodecamp.org
course data-analysis numpy pandas python python3
Last synced: 06 May 2026
https://github.com/bala-1409/milk-production-time-series-forecasting-datascience-project
This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.
acf adf arima-model data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms pacf python python3 sarimax-model seasonality seasonality-analysis time-series time-series-forecasting trends
Last synced: 27 Apr 2026
https://github.com/gattupalli-saketh/sentiment-analysis-on-products-
Product reviews sentiment analysis.
data-analysis machine-learning nlp review-analysis sentiment-analysis sentiment-classification
Last synced: 18 Apr 2026
https://github.com/mmfava/significados-aulas-biologia-quasiexp-2019
Repositório das análises realizadas para o paper "Construção de significados em aulas práticas de laboratório de biologia: uma avaliação por delineamento quase-experimental".
Last synced: 28 Jun 2025
https://github.com/badranalyst/titanic-survival-prediction-full-data-science-project-classification
This project predicts Titanic survivors using classification models. It includes data cleaning, pre-processing, exploratory data analysis (EDA), categorical feature conversion, model building, and evaluation. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used to analyze and predict survival outcomes.
classification data-analysis data-science eda exploratory-data-analysis machine-learning matplo matplotlib-pyplot ml model numpy pandas predictive-modeling python seaborn
Last synced: 06 May 2026
https://github.com/sayamalt/fake-news-classification-using-fine-tuned-bert
Successfully developed a text classification model to predict whether a given news text is fake or not by fine-tuning a pretrained BERT transformed model imported from Hugging Face.
bert-embeddings bert-model data-analysis data-visualization deep-learning fine-tuning-bert model-evaluation model-training-and-evaluation text-classification text-preprocessing text-tokenization tokenizer-nlp wordcloud-visualization
Last synced: 05 Apr 2025
https://github.com/farzeen-2001/superstore_analysis_sql
Anaylsed the superstore Data using SQl
Last synced: 15 Apr 2025
https://github.com/quocduyenanhnguyen/human-trafficking-analysis
I analyzed human trafficking data
data-analysis data-analytics data-visualization human-trafficking mysql mysql-database mysql-workbench query sql tableau tableau-dashboards tableau-public
Last synced: 02 May 2026
https://github.com/anjalikumari021/sports_data_analysis_using_excel
Analyzed Sports data and prepared advanced dashboard using MS Excel.
data-analysis data-cleaning excel-dashboard ms-excel pivot-tables reporting
Last synced: 08 Mar 2026
https://github.com/shreshthvashisht/xyz-ads-airing-report_analysis
Ad data analysis using Advanced Excel
ad-airing-analysis advanced-excel data-analysis data-visualization pivot-tables
Last synced: 18 Feb 2026
https://github.com/azaz9026/data_cleaning
Welcome to the Data Cleaning repository! This collection is dedicated to showcasing techniques and methods for cleaning and preparing datasets for analysis.
data-analysis data-engineering data-structures data-visualization eda feature-engineering machine-learning numpy outliers pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/sanchittechnogeek/overscripted-analysis
Geolocation and user language extraction analysis from Mozilla Overscripted dataset
analysis data data-analysis mozilla
Last synced: 23 Mar 2025
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 24 Jan 2026
https://github.com/bibymaths/python_snippets
A collection of Python scripts for bioinformatics data analysis, including tools for transcription counts, nucleotide composition, and protein sequence evaluation.
amino-acid-scoring bioinformatics data-analysis fasta-generation mathematical-evaluation nucleotide-analysis protein-sequence-analysis transcription-counts
Last synced: 29 Jul 2025
https://github.com/abdelmajidlh/cours
Cours Data engineering et data analyse.
apache-spark big-data data-analysis data-engineering docker jupyter-notebook pyspark
Last synced: 06 May 2026
https://github.com/nevermendel/revolut-analysis
Python script to analyse Revolut transactions
data-analysis revolut revolut-analysis
Last synced: 12 Apr 2025
https://github.com/superpandas-ai/superpandas
Adding LLM integration to Pandas library
Last synced: 06 May 2026
https://github.com/kishorep26/school-recommendation-system
Intelligent school recommendation system that matches students with suitable educational institutions based on preferences and performance metrics
bootstrap data-analysis decision-support edtech education education-technology flask matching-algorithm python recommendation-system school-finder school-search student-portal web-application
Last synced: 06 May 2026
https://github.com/asghar-rizvi/youtube-statistics-project
This project analyzes a dataset of global YouTube statistics to uncover insights about YouTube channels, their ranks, and other attributes. The dataset used for this analysis was obtained from Kaggle.
data-analysis data-analysis-python data-science data-science-projects matplotlib numpy pandas pycharm-ide python seaborn
Last synced: 13 Jun 2026
https://github.com/lawwrites/uncovering_fintech_user_insights
Linear Regression and K-Means to obtain user behavior insights for a fintech company
data-analysis data-science kmeans-clustering linear-regression python unsupervised-machine-learning
Last synced: 22 May 2026
https://github.com/marknature/oibsip
AICTE Oasis Infobyte Data Science Internship
data-analysis data-science data-visualization github google-sheets jupyter-notebook linkedin machine-learning project-management python
Last synced: 06 May 2026
https://github.com/cnoret/retail-data-analysis
Let's analyze historical sales data from a large retail chain and predict weekly sales using machine learning on a Streamlit web app
data-analysis data-analyst data-science data-vizualisation pandas python streamlit streamlit-webapp
Last synced: 10 Apr 2026
https://github.com/drill-n-bass/dealavo-project
Cartesian product from dictionary to list of dictionaries and faster methods for finding index than the `index` method.
data-analysis data-analysis-python matplotlib pandas python python3 random timeit
Last synced: 06 May 2026
https://github.com/tushar2704/loan-limits-by-country
This project aims to leverage a diverse dataset encompassing economic indicators, demographic factors, and credit history to establish a predictive model. By establishing appropriate loan limits, financial institutions can enhance risk management, ensure responsible lending, and promote financial inclusivity.
artificial-intelligence data-analysis data-science loan project tushar2704
Last synced: 30 Oct 2025
https://github.com/josepablodmg/python--linear-regression-advertising
A linear regression analysis to predict sales based on advertising spending across TV, radio, and newspaper channels. The project includes exploratory data analysis, model training, coefficient visualization, and residual analysis.
advertising data-analysis exploratory-data-analysis linear-regression machine-learning python regression scikit-learn visualization
Last synced: 06 May 2026
https://github.com/ssoehdata/sql_for_data_science_specialization_course
Materials and Certifications from the SQL for DataScience Course
data-analysis data-science database databricks postgresql sql sqlite
Last synced: 10 Apr 2026
https://github.com/tushar2704/sql-query
Repository is designed to help you strengthen your SQL query skills by providing a collection of common and interview-based SQL queries for practice.
artificial-intelligence data-analysis data-engineering data-science database database-management database-schema relational-databases sql sql-database sql-query tushar2704
Last synced: 04 Nov 2025
https://github.com/jbn/vaquero
A Python library for iterative and interactive data wrangling at laptop-scale.
data data-analysis data-cleaning data-mining dirty-data elt etl etl-framework
Last synced: 10 Jun 2026
https://github.com/onome-joseph/flexisaf
Generative AI & Data Science
data-analysis data-science machine-learning
Last synced: 16 Sep 2025
https://github.com/laudebugs/fec-data-analysis-2020
The project aimed to determine the total sum of contributions to the candidate committees as well as the number of contributions made by individuals.
data-analysis fec presidential-candidates
Last synced: 16 May 2026
https://github.com/lopez86/rust-mlearn
Machine Learning Tools in Rust
data-analysis data-science machine-learning rust
Last synced: 15 May 2025
https://github.com/edanur-y/variable-analysis-of-banks-ratio-data
Testing variables for multicollinearity, multivariate normality and analyzing outliers and missing values. ⭕SPSS 🔵R
data-analysis log-transformation missing-values-analysis multicollinearity normality-test r spss
Last synced: 10 Jun 2026
https://github.com/farhad-here/data-visualization-analysis-dva
This is my data analysis project. Users can use this project to clean and preprocessing the date or data visualization. Individuals can impute or ecnode ther dataset.
altair bokeh data-analysis data-analysis-python io matplotlib numpy pandas plotly python sklearn streamlit
Last synced: 11 Apr 2026
https://github.com/dogan-the-analyst/data_analysis_in_the_office
Data analysis with R in the Office.
data-analysis ggplot r theoffice tidyverse
Last synced: 14 Mar 2025
https://github.com/dimits-ts/college_analysis
A statistical study about US college admissions, featuring a full report in LaTeX.
anova data-analysis exploratory-data-analysis linear-regression statistics
Last synced: 25 Jan 2026
https://github.com/itrauco/data-dirtying-tool
a simple command line tool to generate dirty data and do common data things in google cloud
data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning
Last synced: 24 Feb 2025
https://github.com/antoniszks/music-category-identifier
A 'Data-Science & Machine Learning' project where we are training a neural network to identify what kind of music we give to it. Based on a university project.
ai artificial-intelligence data-analysis data-science jupyter-notebook machine-learning ml notebook python
Last synced: 25 Feb 2025
https://github.com/dsrodrigovieira/favoritasales
Este repositório contém o projeto desenvolvido para o desafio do kaggle "Store Sales - Time Series Forecasting. Use machine learning to predict grocery sales"
data-analysis data-science kaggle-competition machine-learning python telegram-bot xgboost-regression
Last synced: 05 May 2026
https://github.com/motapinto/agent-based-simulation-conquest
Agent-based simulation modelation of the conquest Battlefield gamemode
agent-based-simulation data-analysis jade java sajas swing
Last synced: 24 Jan 2026
https://github.com/shivam5509/power-bi-project
Expert in creating interactive dashboards and reports using Power BI, utilizing 10+ visual tools like cards, slicers, and charts. Skilled in cleaning and transforming large datasets with Power Query Editor. Proficient in advanced DAX functions (SUMX, FILTER, CALCULATE) to derive insights and drive data-driven decisions.
advanced-excel computer-science data-analysis data-mining data-visualization engineering mysql numpy pandas powerbi pyhton3 sql sql-server
Last synced: 11 Apr 2026
https://github.com/vimlesh-gupta/blinkit_data_analytics_project
End-to-end Blinkit data analytics project using Python, SQL Server & Power BI
blinkit data-analysis eda pandas powerbi python sql-server
Last synced: 06 May 2026
https://github.com/namratagulati/fraud_detection
This fulfills all the requirements of a fraud detection model developed on linear regression using feature scaling, engineering and testing model with the help of auc-roc curve and others.
data-analysis data-visualization machine-learning machine-learning-algorithms machinelearning-python
Last synced: 04 Jun 2026
https://github.com/jofaval/daily-california-births
Data Analysis of the Daily AFAB (Assigned Female At Birth) Births in California, 1959
california data-analysis data-science data-visualization deep-learning google-colab machine-learning python tensorflow timeseries timeseries-analysis
Last synced: 28 Jun 2025
https://github.com/shubham200137/icc-women-s-t20-world-cup-data-analytics
Created a Power BI report to identify top 11 players for a T20 cricket team by scraping data from espncricinfo with Python, cleaning and transforming the data with pandas, and evaluating various player performance metrics.
beautifulsoup4 data-analysis data-visualization numpy-python pandas-python powerbi web-scraping
Last synced: 25 Feb 2025
https://github.com/shubham200137/cyclistic-case-study
This repository contains a case study for Google's Data Analytics Professional Certificate, focusing on Cyclistic, a fictional bike sharing company in Chicago. The case study aims to drive growth by converting casual riders into members through a marketing strategy.
data-analysis data-visualization numpy-python pandas-python presentation-slides sql tableau
Last synced: 11 Jun 2026
https://github.com/faisal-fida/box-office-mojo-analysis
Analyzed box office data from Box Office Mojo, exploring relationships between worldwide revenue, release year, and a combined score that considers both factors. It includes visualizations like scatter plots, bar charts, and identifies top and bottom performing movies.
box-office data-analysis data-science python revenue-prediction visualization
Last synced: 06 May 2026
https://github.com/kath92/my_data_projects
My data projects.
data-analysis data-vizualisation nlp-machine-learning poewrbi python tableau
Last synced: 23 Mar 2025
https://github.com/lotfiferaga/google-play-store-sentiment-analysis
Perform sentiment analysis on Google Play Store reviews using Python. Analyze user feedback to determine the overall sentiment (positive, negative, or neutral) towards various apps. Gain insights to aid developers and businesses in understanding user satisfaction levels and improving their products.
data-analysis data-visualization googleplayservices python reviewsanalysis-nlp
Last synced: 26 Feb 2025
https://github.com/valna/mercado-play
Mercado Play (streaming service of @mercadolibre) redesign with Astro.
astro data-analysis data-manipulation data-science data-visualization husky javascript justd mercado-play mit-license prettier react rocketseat typescript
Last synced: 08 Apr 2026
https://github.com/sanjana-bongale/cancer_survival_data_analysis_and_prediction_using_logistic_regression
This project performs data analysis using Python to predict cancer patient survival outcomes. It involves data cleaning, exploratory analysis, and visualizations to explore factors like cancer type, stage, and treatments. A logistic regression model is built to predict patient survival based on demographic and medical data.
data-analysis data-cleaning data-science data-visualization eda jupyter-notebook kaggle logistic-regression machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn seaborn
Last synced: 08 Apr 2026
https://github.com/grlyntng/rpims
Django Code and documentation for the Retail Pharmacy Inventory Management System (best final year project award)
data-analysis django erp forecasting-models lstm-neural-networks reporting
Last synced: 26 May 2026
https://github.com/zborovskaanna/dou-salary-analysis
Python data analysis project focused on improving data manipulation skills using Pandas
Last synced: 26 Feb 2025
https://github.com/l337x911/simulations
data analysis via in silico simulations
data-analysis machine-learning python3
Last synced: 06 Apr 2025
https://github.com/s1m0n38/cr-analysis
An exercise in data collection/analysis
clash-royale data-analysis data-collection data-science
Last synced: 08 Jul 2025
https://github.com/stribedi-94/comcast-telecom-consumer-complaints-data-analysis
Comcast Telecom Consumer Complaints Analysis
comcast-telcom-complaints data-analysis data-science data-visualization python
Last synced: 26 May 2026
https://github.com/weybsonalves/prevendo-o-atrito-de-clientes
Projeto em que percorro as etapas que compõem o ciclo de vida da ciência de dados a fim de prever o atrito de clientes do serviço de cartões de crédito de um banco.
data-analysis data-science data-visualization machine-learning python
Last synced: 06 May 2026
https://github.com/macnianios/salifort-motors_retention
Google Advanced Data Analytics Capstone: Analyzing customer retention at Salifort Motors.
data-analysis machine-learning pandas python seaborn sklearn
Last synced: 08 Apr 2026
https://github.com/elakkiya-u/digital-marketing-campaign
A machine learning project to predict whether a customer will convert based on digital marketing campaign data.
campaigns data-analysis deployment digital-marketing machine-learning predictive-modeling python
Last synced: 30 Jun 2025
https://github.com/bala-1409/peerloankart-loan-fraud-detection-datascience-project
This project uses machine learning to predict whether a loan applicant will repay their loan. The project uses a dataset of historical loan data from PeerLoanKart, a peer-to-peer lending platform.
correlation data-analysis data-cleaning data-science data-visualization dimensional-analysis eda exploratory-data-analysis feature-engineering gradient-boosting-classifier hyperparameter-tuning juypter-notebook machine-learning machine-learning-algorithms numpy pandas predictive-modeling python3 scikitlearn-machine-learning supervised-learning
Last synced: 08 Apr 2026
https://github.com/apsinghanalytics/hranalytics_myersbriggspersonalityinsights
A Excel analytics study exploring the correlation between personality traits and key HR-relevant parameters, including tenure and performance
data-analysis data-visualization excel pivot-tables
Last synced: 30 Jan 2026
https://github.com/kaushik-puttaswamy/airline-passenger-referral-prediction-using-machine-learning
This project uses a machine learning model to predict if passengers referred by existing customers will book a flight, helping airlines target likely customers. Key factors like service ratings and value for money drive predictions, achieving over 90% accuracy.
airline-marketing customer-referral-prediction customer-satisfaction data-analysis feature-engineering hyperparameter-tuning machine-learning model-evaluation predictive-analytics
Last synced: 22 Mar 2025
https://github.com/jayita11/healthcare-management-optimization-analysis-and-visualization
This project analyzes healthcare data from 2019 to May 2024, optimizing patient care, resource allocation, and financial management. Insights include billing trends, blood bank management, doctor performance, and medication demand, supported by excel,interactive Tableau dashboards and SQL analysis.
data-analysis excel healthcare interactive-dashboards mysql sql tableau-dashboards
Last synced: 23 Mar 2025