Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/master-helix/ibm-data-analyst-certification-stock-analysis-project
This is a mini project repository of my IBM Certification involving stock analysis and plotting of Tesla and GameStop
analytics data data-analysis data-visualization ibm matplotlib pandas python web-scraping
Last synced: 09 May 2026
https://github.com/frankelavsky/security-dash-challenge
I had two 8 hour days to create a visualization dashboard for three datasets. Tab one: Voronoi overlay on line graph. Tab two: Data partitioning method keeps in-memory usage low. Tab three: deals with "Failed" vs "Successful" attempts as positive/negative barcharts over time. I used d3.js, require, MVC pattern, and vanilla js.
client-side complexity css3 d3 d3js dashboard data-analysis data-structures-algorithms data-visualization frontend-app html5 interactive-visualizations javascript modular network-analysis network-monitoring network-security security single-page-app visualization
Last synced: 14 Apr 2026
https://github.com/nullmaster7/btk-pythontensorflow-ozet
data data-analysis python tensorflow-examples
Last synced: 19 Jan 2026
https://github.com/krzysikd/uber_fare_prediction
Predicting uber fares using advanced machine learning models and feature engineering techniques
data-analysis data-processing eda hyperparameter-tuning jupyter machine-learning regression-models
Last synced: 02 Apr 2025
https://github.com/shibbir24/a-data-driven-approach-to-food-security-and-supermarket-accessibility
A Data-Driven Approach to Food Security and Supermarket Accessibility
data-analysis matplotlib numpy pandas python3 seaborn
Last synced: 13 Apr 2026
https://github.com/mysto-007/dog-vision-a-dog-breed-recognizer-kaggle-competition
A solution for Dog Breed Identification on Kaggle competition
colab-notebook data-analysis data-science data-visualization jupyter-notebook kaggle kaggle-competition python
Last synced: 09 May 2026
https://github.com/chiragkumargohil/co2-emissions-data-analysis
A Python programme that analyses CO2 emission data from 1997 to 2010. This programme prints data, provides brief of a given year, displays and compares Year vs. Emission graphs for chosen countries, and generates a separate data file for chosen countries. It was a self-paced project that Guru 99 provided.
co2-emission data-analysis matplotlib python
Last synced: 28 Aug 2025
https://github.com/jiwookseo/natural_language_analysis
api sample for google natural language and ECOS(한국은행 경제통제시스템)
data-analysis google-natural-language-api text-analysis
Last synced: 11 Oct 2025
https://github.com/pyrypp/koivunen-vastaanottoanalyysi
An analysis on warehouse goods receiving
business-intelligence data-analysis interactive-visualizations
Last synced: 11 Oct 2025
https://github.com/vineet416/eda-hr-analytics
EDA on HR-Analytics by PW Skills Data Analytics course
data-analysis data-analysis-python data-analytics data-preprocessing data-processing data-visualization exploratory-data-analysis jupyter-notebook matplotlib-pyplot numpy pandas python seaborn statistical-analysis
Last synced: 14 Apr 2026
https://github.com/europanite/gundam-forest
Random Forest Data Analysis of Kill In Action rate for every personnel in GUNDAM world, like in Titanic.
data data-analysis data-science data-visualization death-rate death-rates gundam-model gundam-series gundom-forest jupyter jupyter-notebook kia kill-in-action python random-forest titanic titanic-kaggle titanic-survival titanic-survival-prediction
Last synced: 29 Jun 2026
https://github.com/allanotieno254/powerbi-dax-filter-context
This repository contains a Power BI project that explores **DAX Filter Context**, a crucial concept in DAX calculations. The project focuses on **Bank Loan Analysis**, demonstrating how different filter contexts affect DAX formulas.
business-intelligence data data-analysis dax dax-functions powerbi powerbi-visuals visualization
Last synced: 08 Jan 2026
https://github.com/silvermete0r/sdu_hackathon_uss_db_analysis
Smart Data Ukimet Hackathon - "Data Modeling" case Solution - Topic: Store Analysis based on Unified Star Schema
data-analysis data-modeling postgresql python sql unified-star-schema
Last synced: 14 Apr 2026
https://github.com/navp7/pizzasales_powerbi
This project involves creating a comprehensive sales performance dashboard using Power BI to visualize and analyze the sales data of an Italian pizza company.
data-analysis ms-sql-server ms-word powerbi visualization
Last synced: 13 Mar 2026
https://github.com/ndiplacide7/r-project
Explore diverse data analysis techniques using R programming combined with advanced machine learning algorithms to uncover insights and create powerful predictive models.
data-analysis data-visualization machine-learning-algorithms r
Last synced: 25 Mar 2025
https://github.com/pratanup/bank-customer-churn
A prediction model based on ML as well as DL and compare their performances to find Churned Customers
adaboost-classifier ann churn-prediction data-analysis data-visualization decision-tree-classifier deep-learning deep-learning-algorithms gaussian-naive-bayes-classification gradient-boosting-classifier k-nearest-neighbours logistic-regression machine-learning machine-learning-algorithms random-forest-classifier svc svm-classifier xgboost-classifier
Last synced: 10 Mar 2026
https://github.com/soyuid/bakery-data-analyst
# About the Project This Bakery Data Analysis project was created to help bakery owners understand their sales patterns. With in-depth data analysis, it is expected to provide useful insights to improve sales and operational strategies.
bakery data-analysis python sales visualization
Last synced: 24 Mar 2025
https://github.com/bhaveshbhakta/flight-price-prediction-using-ml
Flight Price Prediction
data-analysis data-visualization flight-price-prediction machne-learning random-forest
Last synced: 12 Oct 2025
https://github.com/jeffbrennan/analysis-templates
Templates of commonly used graphics/functions/settings to help focus on the bigger picture
Last synced: 12 Oct 2025
https://github.com/akash1070/project--uber-data-analysis
To Determine UBER data from the dataset using Python
data-analysis data-science python
Last synced: 09 May 2026
https://github.com/mrprajapati18/100-days-of-code-data-science
100 Days of Code Challenge to learn Data Science from scratch! 📊🔍
anaconda-navigator data-analysis data-science data-visualization machine-learning-algorithms pyhton-library python-3
Last synced: 18 Apr 2026
https://github.com/alanjamlu34/bike-dataset
Ini adalah tugas akhir dari kelas Dicoding Menjadi Data Analist
data-analysis streamlit-dashboard
Last synced: 19 Oct 2025
https://github.com/agb2k/twitter-analyzer
Project to extract tweets based on searches, analyze it's data and autocorrect potentially incorrect words
data-analysis python tweepy twitter
Last synced: 13 Oct 2025
https://github.com/madhursinghbhadoriya/atliq_salesdata_analysis-powerbi
AtliqH_SalesData Analysis - Power
dashboard data-analysis powerbi
Last synced: 21 Jan 2026
https://github.com/shellynagar27/good-cabs-data-analysis-project
This project is part of CodeBasics Challenge #13, where the goal was to provide actionable insights to the Chief of Operations at Goodcabs, a cab service provider in tier-2 cities of India. The project focused on analyzing key metrics like trip volume, repeat passenger rate, and passenger satisfaction.
critical-thinking data-analysis data-visualization excel exploratory-data-analysis power-bi presentation problem-solving sql storytelling
Last synced: 25 Jan 2026
https://github.com/hari7261/playwithdata-python
This is one of the repository where I have put lot of data science and machine learning related questions on their solutions I hope you will find something better than some other platforms. Thank you Happy exploring
data-analysis data-science data-science-learning machienlearning matplotlib matplotlib-python ml numpy numpy-arrays numpy-library pandas pandas-dataframe pandas-library python python-script sklearn
Last synced: 13 Apr 2026
https://github.com/khushi-sabarad/8-week-sql-challenge
Case studies' solutions for the #8WeekSQLChallenge by Danny Ma
8weeksqlchallenge case-study data-analysis mysql sql
Last synced: 06 Sep 2025
https://github.com/jsimell/sleepanalysis
A Python data analysis project analyzing the sleep quality affecting factors and temporal patterns in the sleeping data of a single subject.
data-analysis matplotlib numpy pandas python scikit-learn seaborn
Last synced: 14 Apr 2026
https://github.com/mateib20/proiect-achizi-ia-i-prelucrarea-datelor
Procesarea semnalului, analiza datelor și analiza spectrală pentru semnal sonor
c c-language c-programming c-programming-language data-analysis data-engineering data-science data-visualization datascience python python-lambda python-library signal-analysis signal-processing
Last synced: 11 Jun 2025
https://github.com/chaganti-reddy/weather-prediction-australia
Creating a fully-automated system that can use today's weather data for a given location to predict whether it will rain at the location tomorrow.
data-analysis logistic-regression machine-learning prediction-model python3
Last synced: 13 Apr 2026
https://github.com/luminati-io/Target-dataset-samples
A sample dataset of over 1000 target products, extracted using the Bright Data API, ideal for brand reputation, tracking inventory, and optimizing prices.
api data-analysis data-mining datasets target web-scraper web-scraping
Last synced: 09 Apr 2025
https://github.com/sumit9000/submission-of-web-server-log-analysis-assessment
This project analyzes one year of real-world HTTP access logs from the University of Calgary’s computer science server. Using Python, pandas, and regular expressions, we clean and parse the data to extract meaningful insights and answer 10 analytical questions.
data-analysis data-cleaning eda jupyter-notebook log-parsing pandas python realworld-data regex web-log-analysis
Last synced: 14 Apr 2026
https://github.com/vedanty3/supermarket-sales-data-analysis
This project contains data visualization techniques (using pandas and matplotlib) to explore different aspects of supermarket sales data of 3 months.
data-analysis data-science jupyter-notebook matplotlib numpy pandas python
Last synced: 08 May 2026
https://github.com/manojrathod0777/loan-prediction
Predict loan approval status using machine learning techniques. This project demonstrates data preprocessing, feature engineering, model training, and evaluation, along with an interactive Streamlit app for real-time predictions. Ideal for financial decision-making.
classification-models data-analysis data-science financial-analytics jupyter-notebook loan-prediction machine-learning predictive-modeling python streamlit-app
Last synced: 13 Apr 2026
https://github.com/kathkoeh/pimaindian-kk
Logistic regression analysis of diabetes risk using the Pima Indians dataset. Includes prevalence analysis, modeling, ROC/AUC evaluation, and patient testing in Python.
data-analysis diabetes epidemiology logistic-regression machine-learning public-health python
Last synced: 28 Apr 2026
https://github.com/samkazan/business-analysis-tableau
Business Analysis on Global/Superstore data using Tableau.
analysis data-analysis tableau visualization
Last synced: 08 Feb 2026
https://github.com/ayorick23/python-data-science-cheat-sheet
Guía rápida y práctica de sintaxis, comandos y funciones esenciales de Python para Ciencia de Datos. Perfecta para recordar cómo usar las librerías más comunes como NumPy, Pandas, Matplotlib y Scikit-learn en tus análisis diarios.
cheat-sheet data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning matplotlib ml numpy pandas python scikit-learn scipy seaborn statistics sympy tensorflow
Last synced: 07 Apr 2026
https://github.com/pseudomanifold/pump
A generic data flow program
c-plus-plus-11 cplusplus data-analysis data-flow small
Last synced: 14 Oct 2025
https://github.com/marvinmarnold/oipm_stop_search
OIPM's analysis on Stop & Search (frisk) activity by the New Orleans Police Department.
data-analysis frisk new-orleans oipm police search stop
Last synced: 22 Jul 2025
https://github.com/supernyv/data_science_projects
Personal Data Science Projects
data-analysis data-science data-visualization exploratory-data-analysis machine-learning
Last synced: 14 Oct 2025
https://github.com/saisurajmatta/healthcare-data-analytics
Power BI project analyzing Emergency Department data, demonstrating skills in data transformation, DAX, and visualization. It focuses on patient flow, wait times, demographics, and satisfaction, providing actionable insights for healthcare improvement. Includes documentation, data dictionary, and code samples.
data-analysis data-modeling data-visualization dax power-bi powerbi-visuals powerquery
Last synced: 22 Jan 2026
https://github.com/pkjjoshi/behind-the-menu-uncovering-insights-from-restaurant-data
Discover hidden patterns in dining data — from popular cuisine pairings to geographic restaurant clusters
data-analysis data-visualization insights jupyter-notebook pandas python restaurant-data
Last synced: 05 Jul 2025
https://github.com/rohanrony19/movie-recommendation-system
This is a python project where using Pandas library we will find correlation and give the best recommendation for movies.
data-analysis deep-learning knn-algorithm numpy pandas python recommendation-system
Last synced: 14 Apr 2026
https://github.com/pedrosfaria2/analisetitulosnetflix
Estudo de popularidade dos filmes da Netflix no IMDB.
analise-de-dados data-analysis jupyter-notebook matplotlib numpy pandas python
Last synced: 14 Apr 2026
https://github.com/leosimoes/datascienceacademy-python
Atividades do curso Fundamentos de Linguagem Python Para Análise de Dados e Data Science (Com ChatGPT) da DataScienceAcademy.
chatgpt data-analysis data-science python
Last synced: 02 May 2026
https://github.com/prady2309/car-price-prediction
Multiple Linear Regression Project
data-analysis data-science machine-learning python
Last synced: 20 May 2026
https://github.com/sngr0x0/ranklytics-kr
OP.GG Scraping
data-analysis league-of-legends matplotlib opgg playwright-python scraping visualization
Last synced: 16 Oct 2025
https://github.com/anthonytlei/graphsql
Lightweight SQL-to-GraphQL connector for querying GraphQL endpoints using SQL syntax.
connector data-analysis dbapi graphql graphsql python sql sqlalchemy superset
Last synced: 09 Apr 2026
https://github.com/fatihilhan42/nba-players-data-1950-to-2021
In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.
data data-analysis data-engineering data-science data-visualization
Last synced: 16 Oct 2025
https://github.com/kittonn/data-analysis-freecodecamp
freecodecamp - data analysis projects.
Last synced: 05 Apr 2025
https://github.com/nmelgar/lego_my_data
Data visualization project to sell LEGO bulks.
csv data-analysis data-visualization data-viz google-sheets tableau
Last synced: 08 Jan 2026
https://github.com/mindlessmuse666/iris-ml-based-on-decision-trees
Проект демонстрирует применение моделей машинного обучения на основе деревьев решений и случайного леса для классификации набора данных Iris. Включает в себя загрузку данных, обучение моделей, оценку производительности и визуализацию результатов. Предназначен для изучения основ машинного обучения и анализа данных.
classification data-analysis data-visualization decision-trees iris-dataset machine-learning model-evaluation python random-forest scikit-learn
Last synced: 17 Oct 2025
https://github.com/katiebuntic/research_methods
Data Science Research Methods
analysis data-analysis data-science python research-project
Last synced: 23 Jun 2026
https://github.com/khulnasoft/data-science-materials
data-analysis data-engineering data-science data-visualization
Last synced: 17 Oct 2025
https://github.com/whoprashant7/querying-a-large-relational-database-using-ms-sql
Analysing data using Ms Sql Server
data-analysis ms-sql-server sql
Last synced: 05 Jul 2025
https://github.com/tyriek-cloud/nyc-mobility-survey-analysis
An end-to-end data engineering project in which five NYC DOT datasets were modified in an ETL process and analyzed for insights.
aws aws-athena aws-glue aws-glue-crawler aws-quicksight aws-s3 data-analysis data-engineering etl-pipeline json python
Last synced: 09 May 2026
https://github.com/abhijeet107/task-4
Design an interactive dashboard for business stakeholders.
data-analysis excel-csv tableau-dashboards tableau-public
Last synced: 22 Jan 2026
https://github.com/singhrdeep/croppilot
CropPilot is a lightweight, Python-based command-line tool designed to help small-scale farmers, gardeners, and students manage crop data, track profits, and explore sustainable practices. Built for usability and extensibility.
agriculture data-analysis farm-management open-source python
Last synced: 25 Apr 2025
https://github.com/jigyasag18/aircraft-data-management
This repository offers a comprehensive simulation of global military air deployments involving 10 countries, aircraft models, mission types, and strategic zones. It analyzes air power distribution, mission intent (offensive, defensive, support), and geopolitical positioning. The project provides structured insights into regional & zone level threat
aircraft-data aircraft-performance data data-analysis data-visualization database database-management dataset datavisualisation mysql powerbi powerbi-report powerbi-visuals sql
Last synced: 04 Feb 2026
https://github.com/lijesh010/covid-19_global_analytics_power_bi_project
This repository is a data visualization project that offers an in-depth analysis of the Covid-19 pandemic using Microsoft Power BI. This interactive dashboard provides valuable insights into key metrics related to Covid-19 cases, deaths, recoveries, and more, helping users understand the global impact of the pandemic.
dashboard data-analysis data-visualization powerbi report
Last synced: 08 Jan 2026
https://github.com/Kaushik-Puttaswamy/Airline-Passenger-Referral-Prediction-Using-Machine-Learning
This project uses a machine learning model to predict if passengers referred by existing customers will book a flight, helping airlines target likely customers. Key factors like service ratings and value for money drive predictions, achieving over 90% accuracy.
airline-marketing customer-referral-prediction customer-satisfaction data-analysis feature-engineering hyperparameter-tuning machine-learning model-evaluation predictive-analytics
Last synced: 20 Oct 2025
https://github.com/iqbalmind/learn-python-data-scientist
IqbalMind Playground for python data scientist
data data-analysis data-visualization datascience datascientist datascientisttraining python python-playground
Last synced: 16 Mar 2025
https://github.com/mothraa/etl-marketanalysis-webscraping-poo
OC project 2 refactoring (POO version not yet completed)
data-analysis etl poo python web-scraping
Last synced: 20 Oct 2025
https://github.com/bala-1409/tableau-visualization-viz.-project
This repository contains Visualization Projects which is visualized through Tableau Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and also it provides social values in some cases to calculate damages and intensity by calamities.
dashboard data-analysis data-science data-visualization exploratory-data-analysis tableau tableau-dashboards tableau-public visualization
Last synced: 04 Feb 2026
https://github.com/pinedah/sleep-data-analysis-exercise
Análisis de un dataset médico sobre el sueño, explorando duración, calidad y factores relacionados. Incluye limpieza de datos, EDA y visualizaciones con Python (pandas, numpy, matplotlib, seaborn, scipy).
data-analysis data-science escom numpy pandas python school-project scipy
Last synced: 13 Apr 2026
https://github.com/mr-chang95/udacity_movie_project
Movie Data Analysis and Visualization Project for Udacity's Data Analyst Program. Using Python in Jupyter Notebook.
data-analysis data-visualization jupyter-notebook movie python
Last synced: 13 Apr 2026
https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office
Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.
data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python
Last synced: 11 Apr 2026
https://github.com/mahdi-meyghani/movie-recommendation-system
A Python-based movie recommendation system utilizing popularity-based, content-based, and collaborative filtering models with data science and machine learning techniques.
data-analysis data-science machine-learning recommendation-system scikit-learn scikitlearn-machine-learning
Last synced: 23 Jan 2026
https://github.com/scbirlab/hts-tools
🏮 Parsing and analysing platereader absorbance and fluorescence data.
assay-analysis data-analysis fluorescence high-throughput high-throughput-screening platereader
Last synced: 23 Jan 2026
https://github.com/mrfoxak/movie-recommender-system-project
This is a Machine Learning Recommendation System Project
data-analysis machine-learning python recommender-system regression tokenization
Last synced: 13 Apr 2026
https://github.com/dcs-training/spatial_dynamics
Use of QGIS and R to analyse first and second order geospatial effects. Go to the Readme file
data-analysis geographical-data gis qgis r statistics
Last synced: 23 Oct 2025
https://github.com/monish-nallagondalla/algerian_forest_fires
This project predicts forest fires in Algeria using machine learning models . The dataset includes various meteorological and environmental features such as temperature, humidity, and wind speed. The app cleans the data and builds models to predict the likelihood of forest fires based on historical data and environmental conditions.
data-analysis data-science datacleaning flask forest-fire-prediction machine-learning meteorological-data python regression-models ridge-regression
Last synced: 09 May 2026
https://github.com/cezlul/analyse-ventes-immobilier
Solution ML d'analyse immobilière parisienne : classification automatique appartements vs commerces (K-means, 91%) et prédiction prix (régression, R²=0.98) sur 26K transactions. Valorise portefeuille 169M€ avec recommandations stratégiques data-driven.
data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python sklearn
Last synced: 13 Apr 2026
https://github.com/phomint/udacity_free_datawragling_with_mongodb
Udacity Free course to use MongoDB
data-analysis mongodb udacity-course
Last synced: 11 Jun 2025
https://github.com/siddhantprateek/machine-learning-resources
Machine Learning Resources
best-practices clustering-algorithm data-analysis deep-learning in-progress journey linear-regression machine-learning machine-learning-algorithms neural-language-modelling neural-language-processing neural-network numpy python3 read reinforcement-learning-algorithms tensorflow visualisation
Last synced: 07 May 2026
https://github.com/karlyndiary/coffee-shop-sales-analysis
Comprehensive analysis of coffee shop sales utilizing Pandas for data cleaning and exploratory data analysis (EDA), complemented by Streamlit for creating interactive data visualization dashboards.
data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard
Last synced: 07 May 2026
https://github.com/aicorsair/python-case-study-ab-testing-for-lunartech-homepage-cta-button
This repository contains a detailed case study on an A/B test of LunarTech's homepage CTA button, using proxy data structured similarly to the company's real data.
ab-testing click-through-rate confidence-intervals data-analysis data-analytics data-exploration data-science data-visualization hypothesis-testing matplotlib normal-distribution numpy pandas practical-significance python statistical-analysis statistical-significance z-critical z-statistic z-test
Last synced: 07 May 2026
https://github.com/badranalyst/exploratory-data-analysis-on-salaries-dataset
Performing EDA on a dataset related to salaries, exploring relationships between factors like job titles, industries, and locations. Insights are visualized with plots to identify trends and disparities in salary data.
data-analysis dataset eda exploratory-data-analysis pandas python
Last synced: 07 May 2026
https://github.com/whis99/data_analysis_journey
A repositories of my data analysis projects.
data data-analysis data-analysis-python data-visualization dataset jupyter-notebook matplotlib python visualization
Last synced: 07 May 2026
https://github.com/warazkhan/airplane-crashes-and-fatalities-since-1908-
This project analyzes airplane crash data (1908 - 2008)✈️📊 to uncover trends in aviation accidents, fatalities, and safety improvements. Using exploratory data analysis (EDA) and data visualization, we examine key factors influencing crashes, identify high-risk regions, and explore advancements in aviation safety.
data-analysis data-visualization exploratory-data-analysis
Last synced: 10 Jun 2026
https://github.com/devexpress-examples/winforms-pivot-change-the-field-value-header-appearance-backcolor
This example handles the CustomDrawFieldValue event to fill the header's color.
data-analysis dotnet pivot-grid-for-winforms winforms xtrapivotgrid-suite
Last synced: 07 May 2026
https://github.com/bryanhe24/data_analysis_app
A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.
ai data data-analysis data-visualization fullstack-development javascript math python reactjs
Last synced: 07 May 2026
https://github.com/syarwinaaa09/modeling-car-insurance-claim-outcomes
a data analysis project on car insurance trends using Python and Jupyter Notebook
car-insurance classic-cars data-analysis data-science jupyter-notebook machine-learning matplotlib pandas python seaborn visualization
Last synced: 07 May 2026
https://github.com/joseph-pabian/life-expectancy-
Statistical analysis of life expectancy in developed vs developing countries using SQL and Python
data-analysis duckdb public-health python sql statistics
Last synced: 07 May 2026
https://github.com/ddihora1604/advanced_business_analytics_on_world_bank_global_financial_inclusion_data_2021
Bridging the Gaps in Financial Inclusion: Understanding the Cash-Credit Paradox, Divide between Cash and Digital Payments, and Financial Resilience.
advanced-excel business-analytics data-analysis data-engineering data-mining data-visualization database exploratory-data-analysis machine-learning preprocessing-data python
Last synced: 07 May 2026
https://github.com/jpgiant/gujaratrainfallanalysis_2021
Analysis about the rainfall that occurred in the districts of Gujarat state in 2021
data-analysis exploratory-data-analysis exploratory-data-visualizations matplotlib numpy pandas-python python
Last synced: 07 May 2026
https://github.com/biginformatics/git-basics
Hands-on Git and GitHub lessons for analysts and statisticians
data-analysis git github public-health training
Last synced: 10 Jun 2026
https://github.com/vyjayanthipolapragada/genai_smart_retail_recommendation
GenAI Smart Retail is a recommendation system designed for retail environments. It provides personalized product recommendations to users based on product descriptions using a content-based filtering approach. The system leverages FastAPI for backend integration, allowing users to interact with the recommendation engine via an API. This project aim
content-based-recommendation data-analysis data-science data-visualization fastapi gen-ai instacart-data jupyter-notebook open-ai python3 retail scikitlearn-machine-learning stream
Last synced: 07 May 2026
https://github.com/satyam4229/identify-employee-attrition
This is the model where we predict the attrition of the employees of the company by checking there records and all. In the given dataset, we have the features like salary, environment, age, gender and their experience.
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 08 May 2026
https://github.com/riborings/python_projects
Python projects and other programming experiences
data-analysis machine-learning project python regression-analysis
Last synced: 08 May 2026
https://github.com/aidan-zamfir/the-iliad
Data analysis & relationship network for the characters of Homers Iliad
data data-analysis dataframes networks networkx python selenium spacy webscraping
Last synced: 08 May 2026
https://github.com/blladerunner/customer-churn-dashboard
Customer Churn Dashboard — SQL + Python analytics project exploring customer retention patterns, churn rate by demographics and services, and key insights for telecom business strategy.
business-intelligence churn-analysis customer-retention dashboard data-analysis data-analytics data-science pandas powerbi python sql sqlite telecom
Last synced: 08 May 2026
https://github.com/fahamidur/cuisine-analysis
This project analyzes recipes from AllRecipes.com to reveal global cooking patterns, nutritional trends, and cultural food differences, offering data-driven insights for food enthusiasts and researchers.
beautifulsoup data-analysis datavisualization pandas selenium tableau-public webscraping
Last synced: 08 May 2026
https://github.com/prakashjha1/stock-investment-analysis
Stock Investment Analysis Project can help investor to select the better performing stocks.
data-analysis data-science numpy pandas pandas-datareader parallel-programming python
Last synced: 08 May 2026
https://github.com/danmadeira/algoritmos-estatistica-python
Demonstração de Algoritmos de Estatística em Python
algorithms data-analysis data-science python statistics
Last synced: 08 May 2026
https://github.com/samjoesilvano/password_strength_prediction_using_nlp
Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.
data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf
Last synced: 08 May 2026