Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-30 00:07:38 UTC
- JSON Representation
https://github.com/kambleakash0/mubi_eda
Mini Project #1 for EAS503 course at SUNY Buffalo
data-analysis data-visualization eda
Last synced: 16 Apr 2026
https://github.com/chanmeng666/mnist-handwritten-digit-recognition-project
【Sprinkle some star dust on this repo! ⭐️ It's good karma!】A comprehensive implementation and analysis of handwritten digit recognition using multiple neural network architectures on the MNIST dataset. Features basic MLP, optimized feature-selected model, and deep CNN approaches with detailed performance comparisons and visualizations.
cnn computer-vision data-analysis data-visualization deep-learning feature-analysis handwritten-digit-recognition keras machine-learning mlp mnist model-optimization neural-networks python scikit-learn tensorflow
Last synced: 02 Apr 2026
https://github.com/suchi25sathavara/r-projects
R projects in Real world Scenerios for Data Analysis
data data-analysis datavisualization r
Last synced: 01 Apr 2025
https://github.com/guermoud98/data-analysis-with-python-projects
data-analysis matplotlib pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/fatihilhan42/turkey_earthquake_analysis_1915-2021_python
In this project, earthquakes in Turkey from 1915 to 2021 were analyzed. The data taken from the data set, which you can find in the repo, was first organized using data cleaning algorithms. Afterwards, these cleaned data were printed out as graphics and animation using data visualization algorithms.
data-analysis data-cleaning data-visualization jupyter-notebook
Last synced: 23 Mar 2025
https://github.com/fhdsl/seattlestatsummer_r
A 4-day introduction to R programming, focused on Fred Hutch Research Interns
beginner beginner-friendly course data-analysis data-science introduction-to-programming r-programming tidyverse
Last synced: 19 Mar 2026
https://github.com/mikkelrask/henryrollins-scraper
FANATIC! A dataset of Henry Rollins' listens on his KRCW radio show, with data dating back to 2017 - 496 episodes of weird and rare finds, fast paced punk and frog sounds. Includes a scraper that keeps the data up-to-date with henryrollins.com
archive data-analysis data-visualization music
Last synced: 29 Jun 2026
https://github.com/manel15279/datamining-project
A university project that aims to explore various data mining techniques like Data Exploration, Association Rule Mining, Supervised and Unsupervised Learning, applied to real-world datasets, focusing on soil fertility analysis and COVID-19 cases evolution over time.
covid-19 data-analysis data-mining data-visualization datascience gradio machine-learning python soil-properties
Last synced: 10 Jun 2025
https://github.com/achique-luisdan/tops-songs-db
Base de datos de Tops Semanales de Canciones🎵 más reproducidas en Spotify🎶. Prácticas de SQL enfocadas en el Análisis de Datos (Data Analysis).
Last synced: 15 Feb 2026
https://github.com/nmelgar/marathons_data_viz
Data visualization project to analyze finishing times and other data.
csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau
Last synced: 15 Feb 2026
https://github.com/hemangsharma/job-tracker
A comprehensive Streamlit application for tracking and analyzing job applications.
data-analysis python streamlit-dashboard streamlit-webapp
Last synced: 15 Mar 2025
https://github.com/siddhant2105s/bring-your-own-device-boyd-system
This repository contains the design and implementation of the Bring Your Own Device (BYOD) System for managing personal devices at Life Insurance Company. It includes an ERD diagram, MySQL scripts for database creation, data insertion, and queries, as well as detailed data definitions and system requirements documentation.
data-analysis database-design database-normalization entity-relationship-diagram entity-relationship-models my-sql relational-databases relational-model sql-queries
Last synced: 15 Feb 2026
https://github.com/alejandrolara11/data-preprocessing
Data preprocessing through the use of the libraries NumPy and pandas.
data-analysis data-cleaning data-preprocessing numpy pandas python
Last synced: 09 May 2026
https://github.com/l1ght14/customer-churn-prediction
Predict customer churn using machine learning models like Logistic Regression and Random Forest. Includes data preprocessing, model evaluation, feature importance, and insights to drive retention strategies.
churn-prediction classification customer-churn customer-churn-prediction data-analysis logistic-regression machine-learning python random-forest scikit-learn telecom
Last synced: 09 May 2026
https://github.com/arthurosipyan/jamascript
JamaScript, your personal data assistant
blueprint data-analysis data-mining data-science exceldatareader jama jama-api python python-3 python-script python3
Last synced: 03 Sep 2025
https://github.com/parthds02/pizza_sales_sql
SQL project analyzing pizza sales data. Includes creating tables, executing queries, and solving basic to advanced analytical questions to derive insights from sales data.
analytics data-analysis data-science pizza-sales sql sql-query
Last synced: 04 Mar 2026
https://github.com/k-bloch/car-theft-analysis
A dashboard created to inform the public about car theft, providing insights extracted from real-world police stats.
data-analysis maven-analytics tableau
Last synced: 19 Mar 2026
https://github.com/alinababer/covid19-timeseries-cases-and-deaths-forecasting-
This study is based on confirmed cases and deaths collected from Pakistan. Results demonstrate the promising potential of TIME SERIES model in forecasting COVID-19 cases and highlight the superior performance of the time series compared to the LSTM.we apply AI-based forecasting models such time series ARIMA, LSTM, prophet and VAR.
arima covid-19 data-analysis data-science data-visualization fbprophet forecasting lstm rnn time-series var vectorautoregression
Last synced: 19 Jun 2026
https://github.com/tejas-130704/dataanalysis-hr-manager
Presence Insights of Employees This project provides insightful data analysis on employee attendance and presence, including work-from-home (WFH) data, sick leave records, and presence excluding holidays. The analysis spans a three-month period and is visualized using Power BI to help HR managers understand trends and optimize workflow.
dashboard data-analysis data-visualization hr-manager power-bi
Last synced: 01 Mar 2026
https://github.com/arnoudbuzing/iowa-caucus
Data Analysis on 2020 Iowa Caucus results
caucus data-analysis iowa iowa-caucus mathematica primaries primary-election wolfram-language
Last synced: 01 Mar 2026
https://github.com/mahapeth/invest-track
Реализация инструмента для мониторинга активности пользователей ИС "Инвест" для ВКР по направлению 01.03.02 Прикладная математика и информатика
analitycs app data-analysis data-visualization jupyter-notebook python sites
Last synced: 20 Jun 2026
https://github.com/rachit901109/simppl_task
Social Media Analytics Dashboard
dashboard-application data-analysis data-visualization network-graphs social-network-analysis
Last synced: 16 Apr 2026
https://github.com/rizkipragustono/data_analysis_spark
Exploration: Data Analysis using Spark
apache-spark data-analysis pyspark python spark-sql sql
Last synced: 09 May 2026
https://github.com/m4tice/qm_project
Bicycle project crowd evaluation.
data-analysis data-engineering data-visualization
Last synced: 16 Mar 2025
https://github.com/tsbarr/toronto-open-data
Analysis of Toronto's open data initiatives. 🌆 Exploring Toronto's urban systems through data science 📊 Python-based analyses of public datasets 🔍 Focus on community impact and urban patterns 🎓 Academic rigour meets practical insights 🔄 Regularly updated with new analyses
api-integration civic-tech ckan-api data-analysis data-cleaning data-science data-visualization exploratory-data-analysis jupyter-notebook open-data pandas public-data python tableau toronto urban-analytics
Last synced: 09 May 2026
https://github.com/aniruddha-biswas/jpmorgan-chase-excel-internship
JPMorgan Chase & Co.'s Excel Skills on Forage Virtual Internship
conditional-formatting data-analysis data-cleaning data-visualization excel excel-dashboard macos pivot-tables power-query shortcuts storytelling vba-excel
Last synced: 01 Apr 2025
https://github.com/aleks-andrs/bigdataanalytics
Public repository for CM3111: Big Data Analytics Coursework (Meteorite landings analysis)
data-analysis data-science machine-learning
Last synced: 02 Mar 2026
https://github.com/lucaso21/euro-2021-player-stats-analysis
A short project analyzing stats for players at the Euro 2021 tournament.
data-analysis data-science r rvest tidyverse
Last synced: 16 Mar 2025
https://github.com/yash22222/pwc-power-bi-virtual-case-experience
The Power BI PwC Virtual Case Experience is an exciting and educational program designed to provide participants with hands-on exposure to Power BI, a prominent business intelligence and data visualization tool, within the context of consulting at PwC.
business-analyst business-analytics business-intelligence dashboard data-analysis data-analyst data-analytics dax microsoft-power-bi powerbi powerbi-dashboards powerbi-visuals pwc
Last synced: 02 Mar 2026
https://github.com/mayankyadav23/amazon-sales-data-analysis
Diving into Amazon sales data to uncover hidden gems! 📈 Analyzing iNeuron's dataset to optimize sales strategies and boost performance 💡 Driving business growth with data-driven decisions! 💻
amazon data-analysis data-visualization ineuron-ai internship-project
Last synced: 02 Mar 2026
https://github.com/mbarbetti/bachelor-thesis-public
:book: My bachelor thesis at the University of Firenze
bachelor-degree bachelor-degree-thesis bachelor-thesis data-analysis lhcb-experiment particle-physics thesis
Last synced: 02 Mar 2026
https://github.com/gaboelc/analysis-of-the-employment-situation-in-costa-rica-2018-2022
This is an analysis with data extracted from the INEC in order to identify the changes that occurred in the Costa Rican labor market before, during and after the COVID-19 pandemic.
costa-rica data-analysis empleo employment
Last synced: 24 Mar 2025
https://github.com/eubrunoo/beer-consumption-predictor
An R project analyzing the impact of environmental factors on beer consumption in São Paulo, with a predictive linear regression model.
data-analysis data-science data-visualization machine-learning r statistical-analysis statistics
Last synced: 02 Apr 2025
https://github.com/rileynwong/forecasting-coffee-prices
Predict coffee prices in Kenya
data data-analysis data-scraping data-visualization forecasting forecasting-models forecasting-prices jupyter-notebook prophet prophet-model
Last synced: 20 Jun 2026
https://github.com/wardenkenny/data-analyst-portfolio
A repository I have created to show and explore data analytics.
data-analysis excel r spreadsheets sql tableau
Last synced: 02 Apr 2025
https://github.com/madusales/powerbi-etl-elt
Venho estudando, através do Bootcamp da DIO sobre Data Analytics & Power BI, acerca do uso de SQL para criar soluções em BI. Esse repositório é dedicado a registrar os meus conhecimentos adquiridos até então sobre o que é BI, Tipos de análises, ETL e ELT.
big-data business-intelligence data-analysis powerbi
Last synced: 19 Mar 2026
https://github.com/chaitanyaprasad60/sql-queries
This is a list of complex SQL Queries I have practiced.
data-analysis sql window-functions
Last synced: 03 Mar 2026
https://github.com/urmesthamondal/data_analysis_projects
Portfolio Data analysis projects built using Excel, Python, SQL and for visualization used Power bi .
data-analysis pivot-tables powerbi python sql sql-server visualisation
Last synced: 09 May 2026
https://github.com/elrf3lipes/ramon-s_portfolio
I'm passionate about Cloud and DevOps, and for the moment I'm posting some of my work and personal projects here to showcase that. If its useful for you, feel free to integrate or contribute!
api-integration biopython clinical-trials data-analysis data-extraction data-parsing django docker entrez ipython medline-xml pandas pubmed-parser requests rest-api
Last synced: 27 Mar 2026
https://github.com/guptaachin/airline-sentiment-analysis-from-twitter-feeds
Analyses of the airline service providers' sentiment from twitter feeds
classification data-analysis data-science jupyter-notebook machine-learning natural-language-processing pandas pca python sklearn-library tf-idf visualization
Last synced: 09 May 2026
https://github.com/dina-hosny/investigate-a-dataset-project
Investigate a Dataset project - Udacity Data Analysis Professional Track. In this project, I used the Python libraries NumPy, Pandas and Matplotlib to analyze a dataset and communicate my findings about it.
csv data-analysis jupyter-notebook matplotlib numpy pandas python
Last synced: 16 Apr 2026
https://github.com/mugambi645/exploring-ebay-car-sales-data
Exploring ebay car sales dataset
car-sales data-analysis numpy pandas
Last synced: 16 Apr 2026
https://github.com/ibrahimceyisakar/hotel-finder-streamlit-dashboard
Streamlit dashboard of hotel-finder
data-analysis data-science data-visualization pandas plotly python streamlit
Last synced: 16 Apr 2026
https://github.com/soumya-kushwaha/uber-analysis
data-analysis data-science data-visualization uber-analysis
Last synced: 16 Apr 2026
https://github.com/asghar-rizvi/eda_student_dataset
This repository contains the results of data analysis and exploratory data analysis (EDA) conducted on the Student_Dataset. The analysis focuses on understanding various factors affecting student grades and visualizing these relationships using Matplotlib and Seaborn.
data-analysis data-analysis-python data-science jupyter-notebook python3
Last synced: 16 Apr 2026
https://github.com/dvaser/world-happiness-expanatory-data-analysis
DATA ANALYSIS
data-analysis data-visualization dataset jupyter jupyter-notebook kaggle python
Last synced: 03 Mar 2026
https://github.com/comsavvy/water-analysis-project
Project on water analysis
data-analysis data-visualization predictive-analytics python water-analysis
Last synced: 21 May 2026
https://github.com/grindelfp/logistic-regression-study
Example of logical regression data analysis and exercise on it.
data-analysis ipynb logistic-regression python
Last synced: 03 Mar 2026
https://github.com/samalyarov/practicum_projects
Various data analysis projects displaying tools and instruments that I am proficient with
data-analysis datetime folium geojson matplotlib numpy pandas plotly postgresql powerpoint python regular-expressions requests-library-python scipy seaborn sql sqlalchemy tableau tqdm
Last synced: 02 Apr 2026
https://github.com/lintangwisesa/ujian_analyticsvisualization_jcds07
Panduan Soal Ujian Data Analytics & Visualization Job Connector Data Science batch 7
data-analysis data-science data-visualisation exam
Last synced: 04 Mar 2026
https://github.com/adrianlardies/feelms_predict_by_emotion
Feelms is a mood-based movie recommendation app that uses collaborative filtering and machine learning to suggest films based on your emotions. Built with Streamlit and powered by AWS, Feelms personalizes each user's experience through simulated interactions and tailored predictions.
aws-ec2 aws-rds data-analysis data-science machine-learning python streamlit
Last synced: 16 Apr 2026
https://github.com/edanur-y/bank-customer-churn-prediction-with-classification-models
Comparing the performances of multi-layer perceptron, decision tree, random forest, gradient boosting and extreme gradient boosting classifications on customer data to predict their status of exiting the bank.
data-analysis data-transformation hyperparameter-tuning python
Last synced: 16 Apr 2026
https://github.com/bishopce16/school_district_analysis
The school board requested an analysis on the various performance metrics for the school district.
data-analysis jupyter-notebook numpy pandas python visual-studio-code
Last synced: 16 Apr 2026
https://github.com/samuelson777/titanic-dataset-analysis
Exploratory data analysis of the Titanic dataset, uncovering insights on passenger survival rates based on gender, age, and class. Includes data cleaning, visualization, and findings.
data-analysis data-visualization exploratory-data-analysis kaggle machine-learning matplotlib pandas python seaborn titanic-dataset
Last synced: 16 Apr 2026
https://github.com/santiago-giordano/ahora12project
Excel, SQL and Python processing from excel files
data-analysis excel jupyter-notebook microsoft-sql-server pandas sql sqlalchemy sqlserver
Last synced: 16 Apr 2026
https://github.com/themihirmathur/qlik-intern-project
Qlik Analysis of Road Safety & Accident Patterns in India 📈 Analyzed & visualized road safety data for 20.85k+ accident cases with 9+ accident data patterns in India using Qlik 📉 Reduced inefficiencies by 25% by developing design of an avant-garde data tracking dashboard that monitored injuries.
data-analysis data-visualization presentation qlik qlik-cloud qlik-sense qlikview
Last synced: 04 Mar 2026
https://github.com/malucor/analise_dados_eleicao
Programa em Python para fazer uma Análise Exploratória de Dados da Eleição de 2022.
analise-de-dados analise-exploratoria analise-exploratoria-de-dados data-analysis ebac election election-analysis election-data elections eleicoes eleicoes2022 exploratory-data-analysis ipynb jupyter-notebook python
Last synced: 04 Mar 2026
https://github.com/prakashjha1/new-analysis-using-llm-locally
An interactive news analysis tool built with Streamlit and local LLMs. This app allows users to analyze and gain insights from the latest news articles using advanced language models, all running locally. Explore trends, sentiment, and key topics with an intuitive interface.
artificial-intelligence data-analysis data-science llms ollama python streamlit
Last synced: 14 Mar 2025
https://github.com/danpoynor/omdb-api-data-analysis
Gathers data for Oscar-winning movies using their IMDB ids, saves the information to a CSV file, and answers a few data analysis questions about the movies using JupyterLab.
analytics csv data-analysis jupyter-notebook matplotlib omdb-api pandas-dataframe python-dotenv python3 seaborn-plots
Last synced: 16 Apr 2026
https://github.com/lotfiferaga/energeiahub
data-analysis data-visualization energy-consumption python streamlit
Last synced: 09 May 2026
https://github.com/e1washere/weather-spark-pipeline
Scalable pipeline using Apache Spark to process and analyze weather data.
apache-spark batch-processing big-data data-analysis data-engineering data-pipeline data-processing etl python spark-sql weather-data
Last synced: 17 Apr 2026
https://github.com/agricolamz/2018_fe_r_statistics
Further Education R course
data-analysis r rstats static teaching teaching-materials
Last synced: 24 Mar 2025
https://github.com/hadeel-13/new_home
New Home is a Website for Buying and Selling Real Estate with user preferences, it is my Graduation project with a grade of 93%.
bootstrap5 chartjs css css3 data-analysis data-mining google-maps html html5 javascript jquery
Last synced: 12 Apr 2026
https://github.com/thbaylson/datascience
All of my past data science assignments put into one singular notebook. Most of this comes from my Machine Learning course.
data-analysis data-science data-visualization decision-tree jupyter-notebook k-nearest-neighbors linear-regression machine-learning neural-network pandas-library python3 scikit-learn
Last synced: 09 May 2026
https://github.com/shashwat9kumar/us-accidents-data-analysis
Analysis of the US accidents using the US-Accidents dataset (4.2 million entries) from Kaggle
accidents accidents-analysis data-analysis data-analytics data-visualisation data-visualization matplotlib numpy pandas python
Last synced: 17 Apr 2026
https://github.com/vaishnavis03/finlatics_ml_program
This repository contains the .ipynb files for 3 datasets, along with a PPT for each. The datasets included are Facebook Marketplace Data, Sales Prediction Data, and Wine Quality data.
correlation data-analysis data-science data-visualization knn linear-regression machine-learning matplotlib numpy pandas random-forest-classifier scikit-learn
Last synced: 17 Apr 2026
https://github.com/hilalguleryuz/excel_supermarket_data_analysis_project
Supermarket Data Analysis with Excel
dashboard data-analysis data-visualization excel microsoft-excel supermarket supermarket-data-analysis supermarket-dataset
Last synced: 06 Jan 2026
https://github.com/nathadriele/ifood-data-governance-pipeline
Este projeto demonstra uma solução completa de Data Governance com foco em qualidade, rastreabilidade, segurança e conformidade com LGPD. Utiliza tecnologias modernas como Streamlit, Airflow, dbt e Pydantic para implementar um ecossistema funcional e interativo com dashboard de governança de dados.
airflow dashboard data-analysis data-catalog data-engineering data-governance data-quality data-visualization dbt ifood lgpd matplotlib numpy observability-data pandas pipeline pyspark redis seaborn streamlit
Last synced: 02 Apr 2026
https://github.com/ruajean/netflixmoviescraper
🎬 A powerful tool for gathering movie data and user reviews from FilmAffinity's Netflix category. This script scrapes movie details and iterates through user reviews, saving structured information to a CSV file for analysis. Ideal for insights into user sentiments and movie popularity on FilmAffinity.
data-analysis data-visualization dataset jupyter-notebook python scraping
Last synced: 17 Apr 2026
https://github.com/mateib20/proiect-achizi-ia-i-prelucrarea-datelor
Procesarea semnalului, analiza datelor și analiza spectrală pentru semnal sonor
c c-language c-programming c-programming-language data-analysis data-engineering data-science data-visualization datascience python python-lambda python-library signal-analysis signal-processing
Last synced: 11 Jun 2025
https://github.com/kathkoeh/pimaindian-kk
Logistic regression analysis of diabetes risk using the Pima Indians dataset. Includes prevalence analysis, modeling, ROC/AUC evaluation, and patient testing in Python.
data-analysis diabetes epidemiology logistic-regression machine-learning public-health python
Last synced: 28 Apr 2026
https://github.com/ngangawairimu/linear-regression-
This project builds a linear regression model in Python to predict outcomes and derive insights from feature data. It covers data cleaning, feature analysis, and model evaluation, showcasing predictive modeling techniques using scikit-learn, pandas, and visualization libraries.
data-analysis linear-regression machine-learning predictive-modeling python scikit-learn
Last synced: 17 Apr 2026
https://github.com/hugo-hattori/rpa_email_report
Robotic Process Automation Project.
automation data-analysis data-analysis-python data-analytics jupyter jupyter-notebook pandas pandas-dataframe pandas-python pyautogui pyautogui-automation pyperclip python time
Last synced: 17 Apr 2026
https://github.com/leosimoes/datascienceacademy-python
Atividades do curso Fundamentos de Linguagem Python Para Análise de Dados e Data Science (Com ChatGPT) da DataScienceAcademy.
chatgpt data-analysis data-science python
Last synced: 02 May 2026
https://github.com/eliasdehondt/learn-r
Welcome to the Learn-R repository! This is your go-to resource for learning the R programming language, whether you're a beginner or looking to enhance your skills.
data-analysis data-visualization education machine-learning programming r statistics tutorials
Last synced: 03 Apr 2026
https://github.com/nicodupont/kaggle
All my Data analysis and Competitions
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 17 Apr 2026
https://github.com/zxjahid/matplotlib
A comprehensive guide to mastering data visualization with Matplotlib through hands-on examples and advanced techniques. 🚀📊
candlestick candlestick-chart cheatsheet data-analysis data-visualization gtk jupyter-notebook maps matplotlib-python pandas thesis-template tk tutorial wx
Last synced: 09 May 2026
https://github.com/mahmoudwal27/manufacturing_downtime
This project focuses on improving manufacturing efficiency by analyzing production data. Using Python, SQL, and Power BI, we built interactive dashboards to uncover patterns, minimize downtime, and optimize operations. The goal is to help stakeholders make data driven decisions for enhanced productivity.
data-analysis data-analysis-python data-visualization google-colab powerbi python sql
Last synced: 17 Apr 2026
https://github.com/dcs-training/intro-to-statistics
Intro to Statistics workshop. In this repo, you are going to find the code and files we are going to use for the practical part of the workshop, together with the ppt associated with this training. Go to the readme file
data-analysis data-visualisation data-wrangling r statistics
Last synced: 20 Jun 2026
https://github.com/sharmas1ddharth/data-analysis-with-python
Freecodecamp's Data Analysis with Python Projects Code
data-analysis data-analysis-with-python freecodecamp-project
Last synced: 03 Jun 2026
https://github.com/nathaliacosim/migration-patrim
Automação para extração, conversão e migração de dados patrimoniais para o sistema patrimônio cloud da betha sistemas. O projeto garante um fluxo estruturado e seguro de transferência de informações, utilizando C# (.NET Framework), PostgreSQL e integração via API.
conversion-tool data-analysis data-conversion data-transformation dotnet dotnet-code dotnet-console-app migration-tool
Last synced: 17 Apr 2026
https://github.com/rishisolanke/pdf_query_langchain
PDF Query LangChain is a tool that extracts and queries information from PDF documents using advanced language processing. Leveraging LangChain, OpenAI, and Cassandra, this app enables efficient, interactive querying of PDF content. Ideal for data analysis, research, and automated reporting, it simplifies detailed document analysis with ease.
artificial-intelligence data-analysis document-query langchain natural-language-processing nlp openai pdf-analysis pdf-extraction python research-tool
Last synced: 17 Apr 2026
https://github.com/victoorv/maladie_cardiaque
Prédire si un individu est atteint ou non de maladie cardiaque.
classification data-analysis data-science data-visualization exploratory-data-analysis heart-disease heart-disease-analysis heart-disease-classification heart-disease-prediction hyperparameter-tuning machine-learning machine-learning-algorithms neural-networks oversampling-algorithms python statistical-analysis statistical-tests statistics
Last synced: 17 Apr 2026
https://github.com/royungar/sql_chicago_data_analysis_project
SQL-based data analysis project using SQLite, pandas, and Jupyter SQL magic commands. Analyzes crime, school, and census data from Chicago to explore socioeconomic patterns using filtering, joins, aggregation, and subqueries.
aggregation census-data chicago crime-data data-analysis data-engineering education-data ibm jupyter-notebook pandas sql sqlite subqueries
Last synced: 04 Jun 2026
https://github.com/tszon/data-science-projects
Included are all the worth-noting Data Science projects in my learning journey with DataCamp.
data-analysis data-science exploratory-data-analysis feature-engineering machine-learning modelling preprocessing-data scikit-learn supervised-learning
Last synced: 15 Mar 2025
https://github.com/phomint/udacity_free_datawragling_with_mongodb
Udacity Free course to use MongoDB
data-analysis mongodb udacity-course
Last synced: 11 Jun 2025
https://github.com/royungar/automotive_sales_insights_dashboard
Data visualization project analyzing automotive sales, recalls, and customer sentiment using IBM Cognos Analytics. Features KPIs, treemaps, heatmaps, and advanced visual storytelling techniques.
automotive-industry business-intelligence cognos-analytics csv customer-sentiment dashboard data-analysis data-engineering data-visualization eda excel heatmap ibm kpi recall-analysis sales-data treemap
Last synced: 04 Jun 2026
https://github.com/phanchenh/businessdashboard_powerbiproject_pizzadataset
Pizza Business Performance Analysis and Growth Strategies 2015
business-analytics business-intelligence dashboard data-analysis data-visualization insights powerbi python
Last synced: 17 Apr 2026
https://github.com/vitornegromonte/eda_stroke
Exploratory data analysis in the stroke prediction dataset
data-analysis data-science exploratory-data-analysis kaggle-dataset visualization
Last synced: 17 Apr 2026
https://github.com/ahmad-ali-rafique/decision-tree-regressor-modeling
Comprehensive exploration of decision tree regressors, including data cleaning, model building, and performance evaluation on various datasets.
artificial-intelligence data data-analysis dataanalytics decision-trees decisiontreeregressor modeling models regression-models
Last synced: 17 Apr 2026
https://github.com/vasilescur/decsci-group-9
DECSCI 101 - Final Project (Group 9)
data-analysis decision-science qualtrics statistics visualization
Last synced: 17 Apr 2026
https://github.com/jatin-s16/digital-marketing
This repository contains raw data for Marketing analysis along with key business questions. I performed data cleaning using Python and its libraries and extracted meaningful insights. The results were then visualised using Tableau to enhance business understanding.
data-analysis data-science python3 tableau
Last synced: 16 Mar 2025
https://github.com/q-viper/blog-notebooks
This is the repo to store most of my blogs in dataqoil.com and q-viper.github.io.
data-analysis data-science machine-learning-algorithms timeseries
Last synced: 04 Apr 2026
https://github.com/hugo-hattori/watercraft_values_ai_prediction
Data Science Project.
ai-model artificial-intelligence artificial-intelligence-algorithms data-analysis data-analytics data-science jupyter jupyter-notebook machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot pandas pandas-dataframe pandas-python python seaborn sklearn sklearn-library sklearn-metrics
Last synced: 23 Aug 2025
https://github.com/2013xile/sheethub
Organize, import, export, concatenate sheet files on web application.
data-analysis data-wrangler excel sheets
Last synced: 08 Apr 2025
https://github.com/nicovandenhooff/kaggle-competitions
A repository that contains my Kaggle projects.
data-analysis data-visualization deep-learning exploratory-data-analysis kaggle machine-learning matplotlib modeling neural-network numpy pandas seaborn sklearn
Last synced: 04 Apr 2026
https://github.com/pawlo77/airline-performance-data-analysis
Preprocessing of structured data - part of IAD study program, Faculty of Mathematics and Information Science, Warsaw University of Technology
data-analysis data-science visualization
Last synced: 10 May 2026
https://github.com/shrunga92/restaurant_order_analysis_sql
This project is a structured SQL-based analysis of restaurant orders, aimed at deriving key insights from transactional data.
Last synced: 03 Jul 2025
https://github.com/rajeev2806/retail-order-data-analysis
Dataset downloaded from kaggle api and then data cleaning and analysis is performed
data-analysis data-cleaning postgresql
Last synced: 18 Apr 2026