Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/ahmednasef3/heart-attack-full-eda
Simple EDA for Heart Attack Dataset.
data-analysis data-science data-visualization eda exploratory-data-analysis heartattack matplotlib pandas seaborn
Last synced: 11 May 2026
https://github.com/szapp/magistdataanalysisvisualization
Case study: Data analysis and visualization to evaluate and recommend a business partnership. Team project for data-driven business with SQL and Tableau
dashboard data-analysis data-science data-visualization data-viz sql tableau-public
Last synced: 19 Mar 2026
https://github.com/pradeepchegur/seamantic_web_design
We designed a semantic web for Instagram in Wix platform.
data-analysis framework instagram semantic-web website-design wix
Last synced: 19 Mar 2026
https://github.com/gauthamnairvm/trex-app
Text Refinement EXplorer - An EDA tool for text based data.
data-analysis data-visualization groq-api large-language-models llama3 natural-language-processing text2sql
Last synced: 03 May 2026
https://github.com/jamiemagee/rhi
Collating the data on the Renewable Heat Incentive scheme, and presenting it in a more readable format.
data-analysis open-data open-government rhi
Last synced: 25 Feb 2026
https://github.com/mehulcode12/atliq-bank_creditcard_transaction_analysis
The credit card project at Atliq Bank comprises two key phases: market identification and trial. This initiative aims to leverage mathematical and statistical concepts to analyze data related to demographics, income, credit scores, and spending patterns in order to identify the target audience for the credit card.
codebasics data-analysis data-science data-visualization mathematics python python3 statistics
Last synced: 30 Apr 2026
https://github.com/denisecase/nlp-03-text-exploration
Exploratory analysis of text corpora using tokenization, frequency, co-occurrence, and bigrams to reveal structure in text.
bigrams co-occurence corpus-analysis data-analysis nlp python text-analysis text-exploration tokenization
Last synced: 02 Jun 2026
https://github.com/phillbertnevinemmanuel/movieindustryanalysis-correlation
This project is a comprehensive data analysis endeavor within the Movie Industry, spanning from Data Cleaning to Exploratory Data Analysis, Correlation Analysis, and Temporal Analysis. The dataset was sourced from Kaggle, purportedly scraped using the IMDb API. Python was the primary tool utilized for analysis.
data-analysis data-cleaning python
Last synced: 30 Apr 2026
https://github.com/nafisalawalidris/hici-african-foods
HiCi African Foods: Excel dashboard & pivot table analysis of EU food rejection data to identify risks & recommend focus areas for market expansion.
data-analysis data-cleaning data-visualization eu-food-rejection excel-dashboard hici-african-foods market-expansion pivot-tables
Last synced: 19 Mar 2026
https://github.com/karthikudyawar/passwordometer
To predict the strength of the password
cybersecurity data-analysis data-visualization dataset docker exploratory-data-analysis-eda fastapi jupyter-notebook mongodb password-security password-strength-meter
Last synced: 30 Apr 2026
https://github.com/alcestide/scianalytics
Playground for Data Analysis and Visualization for Research and Scientifical Purposes with Pandas and Plotly.
csv data-analysis data-science data-visualization pandas plotly python science-research statistics
Last synced: 30 Apr 2026
https://github.com/tynoee/record_company-database
A record company database with multiple query commands using SQL
Last synced: 31 Jan 2026
https://github.com/ganeshkumartk/ncov-2019
[EDA] Statistical modelling of Novel Coronavirus breakout nCoV-2019
corona data-analysis ncov ncov-2019 statistics wuhan wuhan-coronavirus wuhan-virus
Last synced: 05 Jun 2026
https://github.com/y-india/project-road-accident-severity-prediction-system
see README below , please.
application data data-analysis data-classification data-cleaning data-science data-visualization data-visualization-project machine-learning ml pandas project real-world-problem-solving real-world-project road-project streamlit-webapp
Last synced: 02 May 2026
https://github.com/edisedis777/duckdb-analyzer
A powerful tool for analyzing large CSV datasets using DuckDB.
csv data-analysis database duckdb
Last synced: 16 Apr 2026
https://github.com/alrza2003/alrza2003.github.io
This repository contains the source files for my personal portfolio website. It highlights my background as a data analyst and radiology student, and showcases real-world projects, tools I use, and ways to connect with me. The site is based on a pre-built template that I customized to reflect my profile and experience.
data data-analysis data-visualization portfolio portfolio-website python
Last synced: 30 Apr 2026
https://github.com/rudra-g-23/find-my-joint
A utility to find potential join keys (matching columns) across multiple DataFrames.
data-analysis data-visualization join network-graph pandas pandas-dataframe
Last synced: 24 Jun 2026
https://github.com/mohamedhany99/human-voice-identifier-counter
the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)
android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python
Last synced: 27 Mar 2026
https://github.com/mkk-1817/hr-attrition
This project, conducted during my internship at MeriSKILL, focuses on HR Attrition Prediction using advanced Machine Learning models. The initiative includes the development of a dynamic Dashboard and in-depth Analysis to offer actionable insights for proactive human resource strategies.
data-analysis data-science data-visualization jupyter-notebook machine-learning-algorithms powerbi python
Last synced: 03 May 2026
https://github.com/affec-ds/dashboard-ventas-vinilos
Dashboard interactivo de ventas para tienda de vinilos. Análisis visual, KPIs clave y filtros dinámicos para decisiones comerciales.
business-intelligence data-analysis data-visualization ipywidgets jupyter-notebook kpis matplotlib music-industry notebook-project python retail-analytics sales-analysis seaborn vinyl-records
Last synced: 30 Apr 2026
https://github.com/huseyincenik/tableau
This repository contains Tableau visualizations and related resources for my project.
analytics api bianalyst business-analytics business-intelligence business-solutions dashboard data data-analysis data-science data-structures dataanalysis dataset datavisualization drilldown interactive-visualizations tableau tableau-dashboards viz
Last synced: 19 Mar 2026
https://github.com/leosimoes/datascienceacademy-powerbi-3.0
Projetos do curso Microsoft Power BI Para Data Science Versão 3.0 da DataScienceAcademy. Dashboards para diversos casos de negócios.
business-intelligence dashboards data-analysis data-visualization microsoft-power-bi
Last synced: 19 Mar 2026
https://github.com/seekinginfiniteloop/fedcal
A feature-rich Python calendar that enables time series analyses of changes in federal workforce schedules and shifts in executive department funding status.
data-analysis data-science econometrics economic-data economics federal federal-government hr pandas pandas-library pandas-python pydata python
Last synced: 15 Apr 2026
https://github.com/mauriceling/sipy
Python-Based Statistical Graphical User Interface for Python
data-analysis julia julia-language jupyter jupyter-kernels pandas pandas-python python python3 r r-packages r-project r-stats scikit-learn scipy scipy-stats statistical-analysis statistical-tests statistics
Last synced: 15 Apr 2026
https://github.com/shadan100/sales-prediction-analysis
The aim is to build a predictive model and find out the sales of each product at a particular store. Using this model, BigMart will try to understand the properties of products and stores which play a key role in increasing sales.
artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python sales-prediction
Last synced: 01 Mar 2026
https://github.com/is-leeroy-jenkins/sherpa
A budget execution & data analysis tool based on Winforms, .NET 6, and written in C# for EPA analysts
budget-management data-analysis data-science data-visualization federal-government
Last synced: 13 May 2026
https://github.com/kinshuk-code-1729/data-visualisation-using-python
This Repository consists of several python snippets for creating Two-Dimensional (2D) Graphics
data-analysis data-science data-visualization matplotlib visualization
Last synced: 02 Jun 2026
https://github.com/shubham14p3/python-word-cloud
Simple python application to create word cloud.
data data-analysis data-science data-visualization nbextension python-3 upload-file
Last synced: 01 May 2026
https://github.com/stastnypremysl/lsql-csv
lsql-csv is a tool for small CSV file data querying from a shell with short queries. It makes it possible to work with small CSV files like with a read-only relational databases. The tool implements a new language LSQL similar to SQL, specifically designed for working with CSV files in a shell. LSQL aims to be a more lapidary language than SQL.
csv data-analysis data-processing haskell language linux-shell lsql lsql-csv new-language query-language relational-database sql unix-command unix-philosophy unix-shell
Last synced: 25 Feb 2026
https://github.com/jo-phil/hopkins-statistic
A Python package for computing the Hopkins statistic to assess clustering tendency.
cluster-analysis cluster-tendency clustering data-analysis data-science hopkins-statistic machine-learning python statistical-analysis statistics
Last synced: 13 May 2026
https://github.com/zpreisler/modules
Python libraries and modules for processing simulation outputs
data-analysis python scripts tensorflow
Last synced: 13 May 2026
https://github.com/hetuvpatel/research-chatgpt
Research and data analysis project evaluating the social, ethical, and educational impacts of ChatGPT using survey-driven insights and Python-powered data analysis. 📚🤖
data-analysis matplotlib pandas python seaborn
Last synced: 01 May 2026
https://github.com/elissorokin/data-analyst-portfolio-rus
Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.
ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis
Last synced: 25 Feb 2026
https://github.com/savinrazvan/degrees
A program that computes the "degrees of separation" between two actors by identifying the sequence of movies connecting them, inspired by the Six Degrees of Kevin Bacon game. Uses IMDb-based datasets for actors, movies, and their relationships.
actor-connections ai data-analysis degrees-of-separation educational-project graph-theory imdb movie-database python six-degrees-of-kevin-bacon
Last synced: 24 Apr 2026
https://github.com/iguptashubham/pizzahut-analysis-sql
best dataset for data analysis. Pizzahut data analysis done by Shubham Gupta in MySql. This dataset is provided by friend of mine intern at pizzahut. In pizzahut, they used this dataset to train and ask question. This data does not reveal anything about the pizzahut. It is safe to share. data
data-analysis data-analytics database dataset datasets mysql mysql-database pizzahut
Last synced: 14 May 2026
https://github.com/antononcube/wl-datareshapers-paclet
Wolfram Language (aka Mathematica) paclet for data reshaping functions, like, long- and wide form, cross tabulation, etc.
contingency-table cross-tabulation data-analysis data-transformation long-form wide-form
Last synced: 20 Mar 2026
https://github.com/antononcube/wl-quantileregression-paclet
Wolfram Language (aka Mathematica) paclet that provides various Quantile Regression functions.
data-analysis machine-learning quantile-regression time-series time-series-analysis
Last synced: 20 Mar 2026
https://github.com/pferreirafabricio/data-immersion
🏊🏻♂️ Activities and exercises from 'Imersão Dados' event
data data-analysis data-science dataset jupiter-notebook python
Last synced: 14 May 2026
https://github.com/tawfikhammad/data-analysis-projects
Data visualization and analysis
data-analysis data-science data-visualization matplotlib plotly seaborn
Last synced: 14 May 2026
https://github.com/mr-chang95/loan_data_visualization
Data Visualization Project for Udacity's Data Analyst Program. Using Python in Jupyter Notebook.
data-analysis data-visualization jupyter-notebook loans python udacity-data-analyst-nanodegree
Last synced: 24 Apr 2026
https://github.com/scarblase/salary-comparison
Submission for the DataCamp Salary Competition(1 level). 🏆
data data-analysis data-science data-visualization engineering python sql structured-data
Last synced: 01 May 2026
https://github.com/ac12644/fractz-ai-data-analyst
Analyze data and gain insights instantly with FRACTZ's AI Data Analyst. Flexible, fast analytics tailored to your needs.
ai data-analysis data-visualization
Last synced: 01 Feb 2026
https://github.com/nmsby/pca-machine-learning-lab
Principal Component Analysis (PCA) implementation and analysis lab for Machine Learning. Features manual PCA implementation, scikit-learn applications, data compression, and feature extraction with detailed visualizations.
data-analysis dimensionality-reduction jupyter-notebook machine-learning numpy pca python scikit-learn visualization
Last synced: 01 May 2026
https://github.com/pedestriandynamics/cloudfast-dl4pude
A Cloud-based Deep Learning System for Improving Crowd Safety at Event Entrances
anomaly-detection artificial-intelligence cloud-environment computer-vision convolutional-neural-network crowd-behavior-analysis data-analysis data-visualisation deep-learning live-camera machine-learning
Last synced: 01 May 2026
https://github.com/madhuresh2011/telco-customer-churn-analysis-using-python
The analysis primarily investigates factors influencing customer churn, particularly focusing on payment methods and contract types.
csv data-analysis matplotlib numpy pandas pyhton seaborn vizualisation
Last synced: 02 May 2026
https://github.com/heiderjeffer/evaluating-rule-offsetting-schemes-for-sustainable-policy-growth-in-modern-democracies
Python Java. Research Proposal RP
artificial-intelligence data-analysis data-collection data-merging python qualitative-data-analysis quantitative-analysis statistical-analysis
Last synced: 09 Jun 2026
https://github.com/adagio/ivoox_episodes
iVoox Episodes: Scraping & Analysis
beautifulsoup4 data-analysis ivoox pandas python python3 scraping
Last synced: 20 Apr 2026
https://github.com/lijesh010/globalsuperstoresalesanalysis
The Global Superstore Sales Analysis repository showcases a comprehensive Power BI dashboard that provides valuable insights into sales performance. This project is designed to present key information and trends to stakeholders, enabling informed decision-making.
dashboard data-analysis data-visualization msexcel power-bi sales-analysis
Last synced: 19 Mar 2026
https://github.com/tnleite/projeto_king_lift
Este projeto apresenta uma análise detalhada dos dados financeiros da King Lift, uma empresa de locação de empilhadeiras. Utilizando Microsoft Excel, Power Query e Power Pivot, desenvolvi um dashboard interativo, também em Excel, que ajuda a empresa a obter insights valiosos para melhorar a eficiência operacional e aumentar o faturamento.
data-analysis data-science data-visualization excel
Last synced: 19 Mar 2026
https://github.com/harshmule1/store-sales-analysis
Sales Analysis Using Power Bi
Last synced: 19 Mar 2026
https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba
First assignment for the course Data Mining @CSE.UOI
data-analysis data-science numpy scipy seaborn statistics
Last synced: 01 May 2026
https://github.com/jrbourbeau/cr-composition
IceCube cosmic-ray composition analysis
cosmic-rays data-analysis machine-learning physics python
Last synced: 20 Apr 2026
https://github.com/sunnybibyan/random_data_generation
A project that generates a dataset using various statistical distributions (Normal, Uniform, Exponential, Random Integers, and Binomial) and performs data analysis. Includes visualizations and an option to export the data as a CSV file.
data-analysis data-visualization python random-data-generation statistics streamlit-webapp
Last synced: 13 Jun 2026
https://github.com/abhi18av/innovation-competition
Submission for a programming challenge
clojure clojurescript data-analysis
Last synced: 13 Jun 2026
https://github.com/jofaval/pima-indian-diabetes
Data Analysis and Classification of Pima Indian Women's Diabetes in 1988
data-analysis data-science deep-learning google-colab kaggle logistic-regression machine-learning pima-diabetes-data python scikit-learn xgboost
Last synced: 16 Apr 2026
https://github.com/chandansoren/diabetics_prediction
Predicting that whether the patient has diabetes or not on the basis of the features we will provide to our machine learning model.
data-analysis machine-learning python svm
Last synced: 06 Jun 2026
https://github.com/rakumar99/power-bi-projects
This repository contains various power bi projects and dashboards of Humaan Resources , Financial Analysis using Power BI Desktop.
dashboards data-analysis data-visualization databases datacleaning datamodeling etl powerbi powerquery reports
Last synced: 04 Jun 2026
https://github.com/lisa-ho/breadit
Respository for scraping and analysing data from the Reddit/Sourdough community to explore lockdown baking trends.
data-analysis data-viz nltk python reddit-api sentiment-analysis web-scraping
Last synced: 01 May 2026
https://github.com/riddhis2226/titanic-survival-data-analysis
Titanic-Survival-Data-Analysis : Analyze passenger data from the Titanic to predict survival based on features like age, gender, class, and fare.
data-analysis data-mining data-science data-visualization database jupyter-notebook machine-learning-models machinelearning-python plotlyjs python3
Last synced: 01 May 2026
https://github.com/reinmagine/eliminating-no-sensor
Contains my project that analyzes air quality sensor data to determine if the NO (Nitric Oxide) sensor in N. Mai, Los Angeles, CA can be removed without affecting data accuracy.
air-quality-sensor colab-notebook cost-optimization data-analysis data-optimization matplotlib-python nitric-oxide pyspark-python python sql
Last synced: 14 Jun 2026
https://github.com/datavil/framex
A light-weight, dataset obtaining library for fast prototyping, tutorial creation, and experimenting.
data-analysis data-fetching data-science dataframe datasets visualization
Last synced: 06 Jun 2026
https://github.com/soufianboukir/ecom-analytics-platform
End-to-end data science project on an Amazon sales dataset, including data preprocessing, analysis, modeling, and a Streamlit dashboard for insights and decision-making.
data-analysis data-science data-visualization data-visualization-dashboard forecasting-models timeseries
Last synced: 14 Jun 2026
https://github.com/denizkarya1999/investor_data
Analyzing investor data (CIS 422 Term Project)
academic-project data-analysis database-management investments money research young-investors
Last synced: 19 Mar 2026
https://github.com/nfaltir/youtube-channel-analysis
Youtube API channel Analysis using pandas
data-analysis data-science data-visualization google webscraping youtube youtube-api
Last synced: 02 May 2026
https://github.com/lebrancconvas/data-playground
Data Science and Analysis Playground.
data-analysis data-science jupyter-notebook numpy pandas python python3 seaborn statistics
Last synced: 16 Apr 2026
https://github.com/dogan-the-analyst/model_car_warehouse_analysis
This is a SQL project.
Last synced: 15 Jun 2026
https://github.com/santiagortiiz/snowflake-data-warehousing
Snowflake University. Snowflake Data Warehousing. Foundamentals
big-data data-analysis data-warehouse olap snowflake
Last synced: 19 Mar 2026
https://github.com/nicholaskross/yt-pscore-analysis
Analysis of the Oct 2019 p-score dataset
analytics data-analysis data-cleaning social-media-analysis youtube youtube-channel
Last synced: 27 Feb 2026
https://github.com/kaz-yos/distributed
Comparison of Privacy-Protecting Analytic and Data-sharing Methods: a Simulation Study (Pharmacoepidemiol Drug Saf 2018)
data-analysis epidemiology statistics
Last synced: 15 Jun 2026
https://github.com/ensinho/data-analysis
My repository for data analysis studys in Python.
csv data-analysis graphs python python-documentation
Last synced: 15 Jun 2026
https://github.com/chetanmalviya513/Firm-Financial-Transaction-Analysis
📊 Financial Analysis & Forecasting Processed large-scale financial data using Python for trend analysis and insights. Developed interactive Tableau dashboards to improve forecasting accuracy and reduce costs by 25%.
data-analysis financial-data forecasting insights msexcel pandas python reporting tableau-dashboards
Last synced: 15 Jun 2026
https://github.com/com-480-data-visualization/project-2023-the-vizards
Lausanne Transportation : a data visualization of the Lausanne Transportation network. Developed by the Vizards team as part of the EPFL Data Visualization course project (COM-480).
buses data-analysis data-science data-visualization epfl lausanne map metro public-transport public-transportation switzerland webgl
Last synced: 01 May 2026
https://github.com/samruddhi3012/customer-behavior-analysis
Hello there! This repo contains python project based on E-Commerce Customer Behavior analysis.
customer-segmentation customerbehavior data-analysis ecommerce python
Last synced: 02 May 2026
https://github.com/com-480-data-visualization/project-2023-choo-choo-data-darlings
This repository contains the source code for our data visualization project, an interactive platform designed to explore the intricate Swiss transportation network. Developed by the Choo Choo Data Darlings team at EPFL, the project provides an in-depth view into the vast array of Swiss transportation operations, including trains, buses, and trams.
boats buses data-analysis data-science data-visualisation data-visualization epfl metro public-transport public-transportation switzerland trains trams
Last synced: 01 May 2026
https://github.com/metalwarrior665/actor-results-checker
apify data-analysis json-schema-checker
Last synced: 16 Jun 2026
https://github.com/kaushik0911/jubilant-guide
A Streamlit application for advanced route planning and accessibility analysis using OpenRouteService (ORS). Explore optimal routes while avoiding roadblocks and discover points of interest (POIs) within travel time ranges.
data-analysis data-visualization geospatial-analysis python streamlit
Last synced: 16 Jun 2026
https://github.com/techshot25/baltimore-911-calls
Analysis of 911 calls provided by the city of Baltimore.
data-analysis data-science decision-tree-classifier logistic-regression machine-learning machine-learning-algorithms statistics
Last synced: 16 Jun 2026
https://github.com/rakumar99/jp-morgan-chase-virtual-internship
This repository contains the various tasks assigned by JPMorgan Chase & Co. Virtual Internship on Microsoft Excel
conditional-formatting dashboard data-analysis data-visualization hlookup pivot-tables presentation vba-macros vlookup
Last synced: 02 Mar 2026
https://github.com/swarnim1812/crime_project
AI-Driven Crime Forecasting Across Indian States — A pioneering machine learning project that harnesses time series modeling (SARIMAX, Ridge Regression) to uncover patterns and forecast crime trends using real-world multi-state temporal and socio-economic data.
analytics crime-locator crime-prediction data-analysis deep-learning machine-learning prophet-facebook sarimax-model time-series-forecasting
Last synced: 31 Jan 2026
https://github.com/anthonybench/datapeek
Peek summary of datafile in a succinct, opinionated manner.
Last synced: 02 Mar 2026
https://github.com/wrighang/shipping-data-analysis
Independent Project: Transit time trends analysis following a major shipping process change.
data-analysis matplotlib numpy pandas python
Last synced: 18 Apr 2026
https://github.com/fdtomasi/regain-applications
Containers for notebooks and data where REGAIN has been used.
algorithms data-analysis latent-variable-models machine-learning minimization network-inference regain sklearn time-series
Last synced: 16 Apr 2026
https://github.com/snehilk1312/data_science
This Repository contains the Data Science things I have done in recent times along with visualization , cleaning , models, statistics, Courses, Datasets. :=)
data-analysis data-science glove natural-language-processing nlp nltk statistics word2vec
Last synced: 02 Apr 2026
https://github.com/mindgamesnl/yanderestats
https://mindgamesnl.github.io/YandereStats/
data-analysis reporting-pipeline yandere yandere-sim
Last synced: 18 Jun 2026
https://github.com/fybex/chatgpt-conversations-analysis
Analysis of 89,000 ChatGPT conversations to understand interaction patterns and response behaviors.
chatgpt conversation-analysis data-analysis data-visualization language-analysis prompt-patterns sentiment-analysis
Last synced: 02 May 2026
https://github.com/mayankyadav23/air-bnb-data-analysis
Data analysis and insights from NYC Airbnb listings, focusing on key metrics such as host performance, neighborhood trends, pricing, and customer reviews. Comprehensive documentation of ETL processes and analytical methodologies is provided. Perfect for understanding Airbnb dynamics and decision-making in the NYC market.
advanced-excel business-intelligence data-analysis data-analytics data-visualization power-bi ppt
Last synced: 19 Mar 2026
https://github.com/film2549/data-analysis-of-a-simulated-marketing-business-case-using-python-sql-and-power-bi
Data Analysis of a Simulated Marketing Business Case Using Python, SQL and Power BI
chulalongkorn computer-engineering computer-science data-analysis data-visualization database marketing nltk-library pandas powerbi pyodbc python simulation sql sqlserver
Last synced: 01 May 2026
https://github.com/archie-cm/credit_risk_model_vix_id-x_partners
The objective project is to decrease the company's losses by up to 30% through bad loans by creating a machine learning system to assist in automating loan assessments
credit-risk data-analysis data-visualization machine-learning scorecard
Last synced: 01 May 2026
https://github.com/duoan/machine-learning-notebook
A notebook repository for tracking learning machine learning notebook.
data-analysis decision-tree ensemble-model gbdt machine-learning numpy pandas xgboost
Last synced: 18 Jun 2026
https://github.com/henrylin03/china-gdp
Analysis and visualisation of China GDP data using Python.
data data-analysis data-visualisation dataset kaggle pandas
Last synced: 01 May 2026
https://github.com/melogabriel/nubank-expenses-analysis
This project consolidates monthly credit card statement data from Nubank into a single CSV file using Python, enabling data visualization through a Google Sheets dashboard in Looker Studio.
data-analysis data-visualization googlesheets lookerstudio pandas python
Last synced: 02 May 2026
https://github.com/jmssnr/shuffle-kit
shuffle-kit: model and analyze playing card shuffles in Python
data-analysis playing-cards python shuffle statistics
Last synced: 19 Jun 2026
https://github.com/shrawans007/google_cyclistic_2023
Google Data Analytics Capstone Case Study (SQL and Tableau)
big-query bigquery coursera-assignment cyclistic cyclistic-bike-share-analysis-case-study cyclistic-bikshare data-analysis data-analysis-project data-analytics data-cleaning data-combination data-exploration data-science google-data-analytics sql tableau tableau-dashboard tableau-public
Last synced: 19 Jun 2026
https://github.com/nurfakhri/e-commerce-data-analyst
E-commerce data analysis supported by data wrangling, EDA, and web dashboard
dashboard data-analysis e-commerce flask-application python
Last synced: 10 Feb 2026
https://github.com/lebrancconvas/how-much-love-in-thai-song
How much Love song among the Thai Songs?
data-analysis side-project web-scraping
Last synced: 19 Jun 2026
https://github.com/kirkalyn13/open-signal-report-generator
Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,
data-analysis data-science data-visualization matplotlib numpy pandas python
Last synced: 19 Jun 2026
https://github.com/seankwarren/water-quality-analysis
An examination of water quality in the Atlanta watershed with a focus on identifying neglected areas and potential strategies for improving water quality monitoring
analytics data-analysis jupyter-notebook python
Last synced: 03 May 2026
https://github.com/mohnoor94/datasciencefundementalsusingpython
My journey to learn Data Science with Python
data data-analysis data-science data-visualization learning learning-by-doing python python3
Last synced: 19 Jun 2026