Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-20 00:07:30 UTC
- JSON Representation
https://github.com/iguptashubham/pizzahut-analysis-sql
best dataset for data analysis. Pizzahut data analysis done by Shubham Gupta in MySql. This dataset is provided by friend of mine intern at pizzahut. In pizzahut, they used this dataset to train and ask question. This data does not reveal anything about the pizzahut. It is safe to share. data
data-analysis data-analytics database dataset datasets mysql mysql-database pizzahut
Last synced: 14 May 2026
https://github.com/hetuvpatel/research-chatgpt
Research and data analysis project evaluating the social, ethical, and educational impacts of ChatGPT using survey-driven insights and Python-powered data analysis. 📚🤖
data-analysis matplotlib pandas python seaborn
Last synced: 01 May 2026
https://github.com/pferreirafabricio/data-immersion
🏊🏻♂️ Activities and exercises from 'Imersão Dados' event
data data-analysis data-science dataset jupiter-notebook python
Last synced: 14 May 2026
https://github.com/tawfikhammad/data-analysis-projects
Data visualization and analysis
data-analysis data-science data-visualization matplotlib plotly seaborn
Last synced: 14 May 2026
https://github.com/huseyincenik/tableau
This repository contains Tableau visualizations and related resources for my project.
analytics api bianalyst business-analytics business-intelligence business-solutions dashboard data data-analysis data-science data-structures dataanalysis dataset datavisualization drilldown interactive-visualizations tableau tableau-dashboards viz
Last synced: 19 Mar 2026
https://github.com/leosimoes/datascienceacademy-powerbi-3.0
Projetos do curso Microsoft Power BI Para Data Science Versão 3.0 da DataScienceAcademy. Dashboards para diversos casos de negócios.
business-intelligence dashboards data-analysis data-visualization microsoft-power-bi
Last synced: 19 Mar 2026
https://github.com/averma205/national-power-outages-severity-analysis
DSC 80 final project at UCSD
data-analysis data-science geospatial-data pandas predictive-modeling python sklearn
Last synced: 09 Feb 2026
https://github.com/scarblase/salary-comparison
Submission for the DataCamp Salary Competition(1 level). 🏆
data data-analysis data-science data-visualization engineering python sql structured-data
Last synced: 01 May 2026
https://github.com/shadan100/sales-prediction-analysis
The aim is to build a predictive model and find out the sales of each product at a particular store. Using this model, BigMart will try to understand the properties of products and stores which play a key role in increasing sales.
artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python sales-prediction
Last synced: 01 Mar 2026
https://github.com/nmsby/pca-machine-learning-lab
Principal Component Analysis (PCA) implementation and analysis lab for Machine Learning. Features manual PCA implementation, scikit-learn applications, data compression, and feature extraction with detailed visualizations.
data-analysis dimensionality-reduction jupyter-notebook machine-learning numpy pca python scikit-learn visualization
Last synced: 01 May 2026
https://github.com/pedestriandynamics/cloudfast-dl4pude
A Cloud-based Deep Learning System for Improving Crowd Safety at Event Entrances
anomaly-detection artificial-intelligence cloud-environment computer-vision convolutional-neural-network crowd-behavior-analysis data-analysis data-visualisation deep-learning live-camera machine-learning
Last synced: 01 May 2026
https://github.com/priyanshubiswas-tech/deloitte-daikibo-forensic-analysis-task-2
Forensic pay equity analyzer for Deloitte. Processes compensation data to classify gender equality scores into Fair/Unfair/Discriminative tiers. Outputs modified Excel with 3-tier evaluation system.
data data-analysis deloitte excel forensic-analysis
Last synced: 06 Feb 2026
https://github.com/sunnybibyan/random_data_generation
A project that generates a dataset using various statistical distributions (Normal, Uniform, Exponential, Random Integers, and Binomial) and performs data analysis. Includes visualizations and an option to export the data as a CSV file.
data-analysis data-visualization python random-data-generation statistics streamlit-webapp
Last synced: 13 Jun 2026
https://github.com/abhi18av/innovation-competition
Submission for a programming challenge
clojure clojurescript data-analysis
Last synced: 13 Jun 2026
https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba
First assignment for the course Data Mining @CSE.UOI
data-analysis data-science numpy scipy seaborn statistics
Last synced: 01 May 2026
https://github.com/phammings/sales-management-analysis
Sales management analysis and Power BI dashboard for sample business request and user stories
data-analysis excel powerbi sql
Last synced: 01 Feb 2026
https://github.com/reinmagine/eliminating-no-sensor
Contains my project that analyzes air quality sensor data to determine if the NO (Nitric Oxide) sensor in N. Mai, Los Angeles, CA can be removed without affecting data accuracy.
air-quality-sensor colab-notebook cost-optimization data-analysis data-optimization matplotlib-python nitric-oxide pyspark-python python sql
Last synced: 14 Jun 2026
https://github.com/adagio/ivoox_episodes
iVoox Episodes: Scraping & Analysis
beautifulsoup4 data-analysis ivoox pandas python python3 scraping
Last synced: 20 Apr 2026
https://github.com/soufianboukir/ecom-analytics-platform
End-to-end data science project on an Amazon sales dataset, including data preprocessing, analysis, modeling, and a Streamlit dashboard for insights and decision-making.
data-analysis data-science data-visualization data-visualization-dashboard forecasting-models timeseries
Last synced: 14 Jun 2026
https://github.com/lisa-ho/breadit
Respository for scraping and analysing data from the Reddit/Sourdough community to explore lockdown baking trends.
data-analysis data-viz nltk python reddit-api sentiment-analysis web-scraping
Last synced: 01 May 2026
https://github.com/antononcube/wl-datareshapers-paclet
Wolfram Language (aka Mathematica) paclet for data reshaping functions, like, long- and wide form, cross tabulation, etc.
contingency-table cross-tabulation data-analysis data-transformation long-form wide-form
Last synced: 20 Mar 2026
https://github.com/riddhis2226/titanic-survival-data-analysis
Titanic-Survival-Data-Analysis : Analyze passenger data from the Titanic to predict survival based on features like age, gender, class, and fare.
data-analysis data-mining data-science data-visualization database jupyter-notebook machine-learning-models machinelearning-python plotlyjs python3
Last synced: 01 May 2026
https://github.com/dogan-the-analyst/model_car_warehouse_analysis
This is a SQL project.
Last synced: 15 Jun 2026
https://github.com/kaz-yos/distributed
Comparison of Privacy-Protecting Analytic and Data-sharing Methods: a Simulation Study (Pharmacoepidemiol Drug Saf 2018)
data-analysis epidemiology statistics
Last synced: 15 Jun 2026
https://github.com/ensinho/data-analysis
My repository for data analysis studys in Python.
csv data-analysis graphs python python-documentation
Last synced: 15 Jun 2026
https://github.com/antononcube/wl-quantileregression-paclet
Wolfram Language (aka Mathematica) paclet that provides various Quantile Regression functions.
data-analysis machine-learning quantile-regression time-series time-series-analysis
Last synced: 20 Mar 2026
https://github.com/chetanmalviya513/Firm-Financial-Transaction-Analysis
📊 Financial Analysis & Forecasting Processed large-scale financial data using Python for trend analysis and insights. Developed interactive Tableau dashboards to improve forecasting accuracy and reduce costs by 25%.
data-analysis financial-data forecasting insights msexcel pandas python reporting tableau-dashboards
Last synced: 15 Jun 2026
https://github.com/jrbourbeau/cr-composition
IceCube cosmic-ray composition analysis
cosmic-rays data-analysis machine-learning physics python
Last synced: 20 Apr 2026
https://github.com/rakumar99/power-bi-projects
This repository contains various power bi projects and dashboards of Humaan Resources , Financial Analysis using Power BI Desktop.
dashboards data-analysis data-visualization databases datacleaning datamodeling etl powerbi powerquery reports
Last synced: 04 Jun 2026
https://github.com/metalwarrior665/actor-results-checker
apify data-analysis json-schema-checker
Last synced: 16 Jun 2026
https://github.com/kaushik0911/jubilant-guide
A Streamlit application for advanced route planning and accessibility analysis using OpenRouteService (ORS). Explore optimal routes while avoiding roadblocks and discover points of interest (POIs) within travel time ranges.
data-analysis data-visualization geospatial-analysis python streamlit
Last synced: 16 Jun 2026
https://github.com/techshot25/baltimore-911-calls
Analysis of 911 calls provided by the city of Baltimore.
data-analysis data-science decision-tree-classifier logistic-regression machine-learning machine-learning-algorithms statistics
Last synced: 16 Jun 2026
https://github.com/nakshjainsonigara/vba-canteenmanagementsystem
The Canteen Management System is a comprehensive software solution designed to modernize and optimize canteen operations. It aims to simplify the complexities of managing a canteen by automating key processes such as order management, payment processing, and report generation.
canteen canteen-mangement-system charts data-analysis email excel microsoft payment-gateway vba vba-excel vba-macros word
Last synced: 30 Jan 2026
https://github.com/jamiemagee/rhi
Collating the data on the Renewable Heat Incentive scheme, and presenting it in a more readable format.
data-analysis open-data open-government rhi
Last synced: 25 Feb 2026
https://github.com/com-480-data-visualization/project-2023-the-vizards
Lausanne Transportation : a data visualization of the Lausanne Transportation network. Developed by the Vizards team as part of the EPFL Data Visualization course project (COM-480).
buses data-analysis data-science data-visualization epfl lausanne map metro public-transport public-transportation switzerland webgl
Last synced: 01 May 2026
https://github.com/com-480-data-visualization/project-2023-choo-choo-data-darlings
This repository contains the source code for our data visualization project, an interactive platform designed to explore the intricate Swiss transportation network. Developed by the Choo Choo Data Darlings team at EPFL, the project provides an in-depth view into the vast array of Swiss transportation operations, including trains, buses, and trams.
boats buses data-analysis data-science data-visualisation data-visualization epfl metro public-transport public-transportation switzerland trains trams
Last synced: 01 May 2026
https://github.com/lijesh010/globalsuperstoresalesanalysis
The Global Superstore Sales Analysis repository showcases a comprehensive Power BI dashboard that provides valuable insights into sales performance. This project is designed to present key information and trends to stakeholders, enabling informed decision-making.
dashboard data-analysis data-visualization msexcel power-bi sales-analysis
Last synced: 19 Mar 2026
https://github.com/mindgamesnl/yanderestats
https://mindgamesnl.github.io/YandereStats/
data-analysis reporting-pipeline yandere yandere-sim
Last synced: 18 Jun 2026
https://github.com/tnleite/projeto_king_lift
Este projeto apresenta uma análise detalhada dos dados financeiros da King Lift, uma empresa de locação de empilhadeiras. Utilizando Microsoft Excel, Power Query e Power Pivot, desenvolvi um dashboard interativo, também em Excel, que ajuda a empresa a obter insights valiosos para melhorar a eficiência operacional e aumentar o faturamento.
data-analysis data-science data-visualization excel
Last synced: 19 Mar 2026
https://github.com/harshmule1/store-sales-analysis
Sales Analysis Using Power Bi
Last synced: 19 Mar 2026
https://github.com/duoan/machine-learning-notebook
A notebook repository for tracking learning machine learning notebook.
data-analysis decision-tree ensemble-model gbdt machine-learning numpy pandas xgboost
Last synced: 18 Jun 2026
https://github.com/jmssnr/shuffle-kit
shuffle-kit: model and analyze playing card shuffles in Python
data-analysis playing-cards python shuffle statistics
Last synced: 19 Jun 2026
https://github.com/film2549/data-analysis-of-a-simulated-marketing-business-case-using-python-sql-and-power-bi
Data Analysis of a Simulated Marketing Business Case Using Python, SQL and Power BI
chulalongkorn computer-engineering computer-science data-analysis data-visualization database marketing nltk-library pandas powerbi pyodbc python simulation sql sqlserver
Last synced: 01 May 2026
https://github.com/shrawans007/google_cyclistic_2023
Google Data Analytics Capstone Case Study (SQL and Tableau)
big-query bigquery coursera-assignment cyclistic cyclistic-bike-share-analysis-case-study cyclistic-bikshare data-analysis data-analysis-project data-analytics data-cleaning data-combination data-exploration data-science google-data-analytics sql tableau tableau-dashboard tableau-public
Last synced: 19 Jun 2026
https://github.com/archie-cm/credit_risk_model_vix_id-x_partners
The objective project is to decrease the company's losses by up to 30% through bad loans by creating a machine learning system to assist in automating loan assessments
credit-risk data-analysis data-visualization machine-learning scorecard
Last synced: 01 May 2026
https://github.com/lebrancconvas/how-much-love-in-thai-song
How much Love song among the Thai Songs?
data-analysis side-project web-scraping
Last synced: 19 Jun 2026
https://github.com/kirkalyn13/open-signal-report-generator
Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,
data-analysis data-science data-visualization matplotlib numpy pandas python
Last synced: 19 Jun 2026
https://github.com/henrylin03/china-gdp
Analysis and visualisation of China GDP data using Python.
data data-analysis data-visualisation dataset kaggle pandas
Last synced: 01 May 2026
https://github.com/mohnoor94/datasciencefundementalsusingpython
My journey to learn Data Science with Python
data data-analysis data-science data-visualization learning learning-by-doing python python3
Last synced: 19 Jun 2026
https://github.com/denizkarya1999/investor_data
Analyzing investor data (CIS 422 Term Project)
academic-project data-analysis database-management investments money research young-investors
Last synced: 19 Mar 2026
https://github.com/lebrancconvas/data-playground
Data Science and Analysis Playground.
data-analysis data-science jupyter-notebook numpy pandas python python3 seaborn statistics
Last synced: 16 Apr 2026
https://github.com/souravsuvarna/whatsapp-chat-analyzer-api
The WhatsApp Chat Analyzer API is a public api specifically designed for frontend enthusiasts who are interested in building a WhatsApp Chat Data Visualizer project. Built on FastAPI, this API offers a seamless and efficient method to process chat data and returns the processed result data in JSON format.
api data-analysis data-science fastapi publicapi python
Last synced: 20 Jun 2026
https://github.com/nafisalawalidris/data-analysis-with-python
This repo features Jupyter Notebook labs for learning data analysis with Python. Explore data acquisition, wrangling, visualization, modeling, and evaluation. Enhance your skills in Python data analysis.
data-acquisition data-analysis data-science data-wrangling exploratory-data-analysis feature-engineering machine-learning model-development model-evaluation-and-refinement pandas
Last synced: 02 May 2026
https://github.com/markmusic27/data-statistics-calculator
💣 This method (made in JavaScript / Python) can find the mean, median, mode, range, and standard deviation.
data-analysis standard-deviation statistics statistics-calculator
Last synced: 20 Jun 2026
https://github.com/vhawk19/ambaan
just wants the average analyst to be happi
data-analysis duckdb-wasm sql vue
Last synced: 01 Mar 2026
https://github.com/rakumar99/jp-morgan-chase-virtual-internship
This repository contains the various tasks assigned by JPMorgan Chase & Co. Virtual Internship on Microsoft Excel
conditional-formatting dashboard data-analysis data-visualization hlookup pivot-tables presentation vba-macros vlookup
Last synced: 02 Mar 2026
https://github.com/wrighang/shipping-data-analysis
Independent Project: Transit time trends analysis following a major shipping process change.
data-analysis matplotlib numpy pandas python
Last synced: 18 Apr 2026
https://github.com/emso-exe/reclamacoes_de_consumidores_com_empresa_de_telecomunicacoes
Projeto de análise de reclamações de consumidores com empresa de telecomunicações no 1º semestre de 2021 com base nos dados do site consumidor.gov.br.
analise-de-dados ciencia-de-dados data-analysis data-science datascience python python-3 python3
Last synced: 02 May 2026
https://github.com/sunnybibyan/marketing_campaign_analysis_power_bi_dashboard
Campaign Performance Analysis This project analyzes the performance of Spring, Summer, and Fall marketing campaigns, revealing key insights and actionable recommendations.
data-analysis data-visualization dax marketing-campaign powerbi
Last synced: 19 Mar 2026
https://github.com/dangerousfish/uk-climate-trends-dashboard-metoffice
A data pipeline and Streamlit dashboard that aggregates, cleans and visualises historical UK Met Office station data - interactive charts, heatmaps and maps for temperature, rainfall and sunshine.
climate climate-analysis climate-change climate-data climate-science data-analysis data-visualization metoffice metofficeweather streamlit temperature weather
Last synced: 02 May 2026
https://github.com/aravind-selvam/bikeshare-company-analysis
Google Data Analytics Professional Certificate program's Capstone project, of a bike sharing company
analytics business-analytics business-intelligence data data-analysis data-visualization dataanalytics google-data-analytics postgresql sql sql-server
Last synced: 22 Apr 2026
https://github.com/titanscouting/tra-analysis
Titan Robotics 2022 Strategy Team Analysis Repository
data-analysis frc frc-scouting hacktoberfest python
Last synced: 29 Jan 2026
https://github.com/y-india/project-road-accident-severity-prediction-system
see README below , please.
application data data-analysis data-classification data-cleaning data-science data-visualization data-visualization-project machine-learning ml pandas project real-world-problem-solving real-world-project road-project streamlit-webapp
Last synced: 02 May 2026
https://github.com/heiderjeffer/evaluating-rule-offsetting-schemes-for-sustainable-policy-growth-in-modern-democracies
Python Java. Research Proposal RP
artificial-intelligence data-analysis data-collection data-merging python qualitative-data-analysis quantitative-analysis statistical-analysis
Last synced: 09 Jun 2026
https://github.com/melogabriel/nubank-expenses-analysis
This project consolidates monthly credit card statement data from Nubank into a single CSV file using Python, enabling data visualization through a Google Sheets dashboard in Looker Studio.
data-analysis data-visualization googlesheets lookerstudio pandas python
Last synced: 02 May 2026
https://github.com/nicholaskross/yt-pscore-analysis
Analysis of the Oct 2019 p-score dataset
analytics data-analysis data-cleaning social-media-analysis youtube youtube-channel
Last synced: 27 Feb 2026
https://github.com/kalebers/economic_analysis_data_science
Data Analysis Python project using economy data base to predict percentage of good and bad payers
data-analysis data-science machine-learning pandas python scipy sklearn-library
Last synced: 18 Apr 2026
https://github.com/nafisalawalidris/international-breweries
This GitHub readme provides an overview of data analysis using SQL on the International Breweries dataset, including dataset description, analysis questions, example SQL queries, and key insights derived from the analysis.
data-analysis insights international-breweries-dataset queries sql
Last synced: 31 Jan 2026
https://github.com/samruddhi3012/customer-behavior-analysis
Hello there! This repo contains python project based on E-Commerce Customer Behavior analysis.
customer-segmentation customerbehavior data-analysis ecommerce python
Last synced: 02 May 2026
https://github.com/gaurav-van/house_price_predictor_streamlit_web_app
Data Science Project to Predict House Prices in Bangalore using the concept of Regression. This Repository is used for Deployment of the Project
data-analysis data-science exploratory-data-analysis machine-learning prediction python regression streamlit
Last synced: 02 May 2026
https://github.com/denisecase/nlp-03-text-exploration
Exploratory analysis of text corpora using tokenization, frequency, co-occurrence, and bigrams to reveal structure in text.
bigrams co-occurence corpus-analysis data-analysis nlp python text-analysis text-exploration tokenization
Last synced: 02 Jun 2026
https://github.com/msthamizh/phonepe-pulse-data-visualization-and-exploration
Developing a Streamlit application that allows users to explore and analyze transaction data from the PhonePe Pulse dataset. The project aims to provide insights into digital payment trends across India.
data-analysis data-visualization dataframe mysql pandas plotly python streamlit
Last synced: 02 May 2026
https://github.com/seankwarren/water-quality-analysis
An examination of water quality in the Atlanta watershed with a focus on identifying neglected areas and potential strategies for improving water quality monitoring
analytics data-analysis jupyter-notebook python
Last synced: 03 May 2026
https://github.com/nurfakhri/e-commerce-data-analyst
E-commerce data analysis supported by data wrangling, EDA, and web dashboard
dashboard data-analysis e-commerce flask-application python
Last synced: 10 Feb 2026
https://github.com/fybex/chatgpt-conversations-analysis
Analysis of 89,000 ChatGPT conversations to understand interaction patterns and response behaviors.
chatgpt conversation-analysis data-analysis data-visualization language-analysis prompt-patterns sentiment-analysis
Last synced: 02 May 2026
https://github.com/ganeshkumartk/ncov-2019
[EDA] Statistical modelling of Novel Coronavirus breakout nCoV-2019
corona data-analysis ncov ncov-2019 statistics wuhan wuhan-coronavirus wuhan-virus
Last synced: 05 Jun 2026
https://github.com/virajbhutada/movie-rental-store-analytics-sql-powerbi-excel
Dive into the DVD rental industry with my Capstone project, Movie Rental Analytics. Analyzing the Sakila DVD Rental Store Database, I extract insights through exploratory data analysis (EDA) and Power BI visualizations. Findings inform strategies for optimizing film inventory, enhancing business operations, and customer experiences.
business-intelligence capstone-project customer-behavior-analysis data-analysis data-science excel exploratory-data-analysis film-ratings mece movie-database movie-rental mysql powerbi powerbi-visuals revenue-analysis sql sql-database
Last synced: 05 Jun 2026
https://github.com/nfaltir/youtube-channel-analysis
Youtube API channel Analysis using pandas
data-analysis data-science data-visualization google webscraping youtube youtube-api
Last synced: 02 May 2026
https://github.com/anthonybench/datapeek
Peek summary of datafile in a succinct, opinionated manner.
Last synced: 02 Mar 2026
https://github.com/mohamedhany99/human-voice-identifier-counter
the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)
android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python
Last synced: 27 Mar 2026
https://github.com/xuri/excelize-cs
Excelize is a C# port of Go Excelize library that allow you to write to and read from XLAM / XLSM / XLSX / XLTM / XLTX files.
agent ai chart csharp data-analysis data-science data-visualization excel excelize formula microsft office ooxml parser spreadsheet xlsm xlsx
Last synced: 03 Mar 2026
https://github.com/madhuresh2011/telco-customer-churn-analysis-using-python
The analysis primarily investigates factors influencing customer churn, particularly focusing on payment methods and contract types.
csv data-analysis matplotlib numpy pandas pyhton seaborn vizualisation
Last synced: 02 May 2026
https://github.com/mayankyadav23/air-bnb-data-analysis
Data analysis and insights from NYC Airbnb listings, focusing on key metrics such as host performance, neighborhood trends, pricing, and customer reviews. Comprehensive documentation of ETL processes and analytical methodologies is provided. Perfect for understanding Airbnb dynamics and decision-making in the NYC market.
advanced-excel business-intelligence data-analysis data-analytics data-visualization power-bi ppt
Last synced: 19 Mar 2026
https://github.com/datavil/framex
A light-weight, dataset obtaining library for fast prototyping, tutorial creation, and experimenting.
data-analysis data-fetching data-science dataframe datasets visualization
Last synced: 06 Jun 2026
https://github.com/elissorokin/data-analyst-portfolio-rus
Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.
ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis
Last synced: 25 Feb 2026
https://github.com/chandansoren/diabetics_prediction
Predicting that whether the patient has diabetes or not on the basis of the features we will provide to our machine learning model.
data-analysis machine-learning python svm
Last synced: 06 Jun 2026
https://github.com/fdtomasi/regain-applications
Containers for notebooks and data where REGAIN has been used.
algorithms data-analysis latent-variable-models machine-learning minimization network-inference regain sklearn time-series
Last synced: 16 Apr 2026
https://github.com/mr-chang95/loan_data_visualization
Data Visualization Project for Udacity's Data Analyst Program. Using Python in Jupyter Notebook.
data-analysis data-visualization jupyter-notebook loans python udacity-data-analyst-nanodegree
Last synced: 24 Apr 2026
https://github.com/savinrazvan/degrees
A program that computes the "degrees of separation" between two actors by identifying the sequence of movies connecting them, inspired by the Six Degrees of Kevin Bacon game. Uses IMDb-based datasets for actors, movies, and their relationships.
actor-connections ai data-analysis degrees-of-separation educational-project graph-theory imdb movie-database python six-degrees-of-kevin-bacon
Last synced: 24 Apr 2026
https://github.com/santiagortiiz/snowflake-data-warehousing
Snowflake University. Snowflake Data Warehousing. Foundamentals
big-data data-analysis data-warehouse olap snowflake
Last synced: 19 Mar 2026
https://github.com/gauthamnairvm/trex-app
Text Refinement EXplorer - An EDA tool for text based data.
data-analysis data-visualization groq-api large-language-models llama3 natural-language-processing text2sql
Last synced: 03 May 2026
https://github.com/jofaval/pima-indian-diabetes
Data Analysis and Classification of Pima Indian Women's Diabetes in 1988
data-analysis data-science deep-learning google-colab kaggle logistic-regression machine-learning pima-diabetes-data python scikit-learn xgboost
Last synced: 16 Apr 2026
https://github.com/ferrangarciarovira/premier-league-betting-analysis
Comprehensive Python analysis of Premier League betting market inefficiencies (2005–2024). Evaluates bookmaker biases, betting strategies, and market efficiency using statistical methods and Monte Carlo simulations.
betting-strategies bias-detection data-analysis market-efficiency monte-carlo-simulation premier-league python sports-analytics
Last synced: 03 May 2026
https://github.com/cs-joy/pandasv2.0.3
learn data analysis with pandas
data-analysis pandas pandas-learning
Last synced: 03 May 2026
https://github.com/mirseo/pandas_learning
pandas_learning
data-analysis data-analysis-python data-science data-visualization numpy numpy-example pandas pd python python-3 python3
Last synced: 03 May 2026
https://github.com/athityakumar/btp
btech btp daru data-analysis networkx nlp project python ruby
Last synced: 24 Apr 2026