Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-29 00:07:38 UTC
- JSON Representation
https://github.com/huseyincenik/looker_studio
Looker Studio
dashboard data-analysis data-visualization looker-studio lookerstudio
Last synced: 03 Mar 2026
https://github.com/steno-aarhus/mediation-analysis-course
Modern mediation analysis for basic, clinical and epidemiological research in diabetes and endocrinology
data-analysis data-analysis-in-r diabetes diabetes-epidemiology mediation-analysis open-educational-resource
Last synced: 03 Mar 2026
https://github.com/grindelfp/logistic-regression-study
Example of logical regression data analysis and exercise on it.
data-analysis ipynb logistic-regression python
Last synced: 03 Mar 2026
https://github.com/samalyarov/practicum_projects
Various data analysis projects displaying tools and instruments that I am proficient with
data-analysis datetime folium geojson matplotlib numpy pandas plotly postgresql powerpoint python regular-expressions requests-library-python scipy seaborn sql sqlalchemy tableau tqdm
Last synced: 02 Apr 2026
https://github.com/jofaval/melbourne-housing
Data Analysis of the Housing Market in Melbourne, Australia in 2016-2017
data-analysis data-science data-visualization deep-learning google-colab kaggle machine-learning melbourne python xgboost
Last synced: 16 Apr 2026
https://github.com/edanur-y/bank-customer-churn-prediction-with-classification-models
Comparing the performances of multi-layer perceptron, decision tree, random forest, gradient boosting and extreme gradient boosting classifications on customer data to predict their status of exiting the bank.
data-analysis data-transformation hyperparameter-tuning python
Last synced: 16 Apr 2026
https://github.com/abhipatel35/gym-performance-analysis
Analyzing gym performance and user engagement in Arizona using Spark SQL, PySpark, and visualization techniques on the Yelp dataset.
apache-spark asu business-insights data-analysis data-processing-at-scale data-visualization dps gym-analysis rating-patterns sql trend-analysis user-insights yelp-dataset
Last synced: 16 Apr 2026
https://github.com/dpb24/netflix-global-top-10-performance
Using Machine Learning to predict Netflix Global Top 10 viewership trends (Python & R)
data-analysis data-science data-visualization decision-tree-regression gradient-boosting-regressor machine-learning media netflix predictive-analytics predictive-modeling python r random-forest random-forest-regression regression-models sklearn streaming-video xgboost-regression
Last synced: 16 Apr 2026
https://github.com/samuelson777/titanic-dataset-analysis
Exploratory data analysis of the Titanic dataset, uncovering insights on passenger survival rates based on gender, age, and class. Includes data cleaning, visualization, and findings.
data-analysis data-visualization exploratory-data-analysis kaggle machine-learning matplotlib pandas python seaborn titanic-dataset
Last synced: 16 Apr 2026
https://github.com/ronaessi-28/sales-data-analysis-visualization-project
A comprehensive data analysis and visualization project using Python, Pandas, Matplotlib, Seaborn, and Streamlit. The project explores Superstore sales data to uncover trends, region-wise performance, product category insights, and builds an interactive dashboard.
data-analysis data-visualization eda matplotlib pandas plotly python-project sales-dashboard seaborn streamlit
Last synced: 16 Apr 2026
https://github.com/akash-srm/user-engagement-analysis
Analyzed user engagement and feedback data to derive actionable insights for an online learning platform.
analytics-projects data-analysis data-cleaning eda jupyter-notebook pandas python seaborn student-engagement
Last synced: 16 Apr 2026
https://github.com/marben06/rent-in-germany
Interactive visualizations and maps depicting topics around rent prices and income in Germany built with Svelte.
charts d3 d3-visualization d3js data-analysis data-visualization gis gis-data infographic infographics map mapbox mapbox-gl mapbox-gl-js mapboxgl svelte
Last synced: 27 Apr 2026
https://github.com/danpoynor/omdb-api-data-analysis
Gathers data for Oscar-winning movies using their IMDB ids, saves the information to a CSV file, and answers a few data analysis questions about the movies using JupyterLab.
analytics csv data-analysis jupyter-notebook matplotlib omdb-api pandas-dataframe python-dotenv python3 seaborn-plots
Last synced: 16 Apr 2026
https://github.com/yasumorishima/yasumorishima
Manufacturing Engineer & Data Analyst. 17 years exp in MFG. Python, VBA, Automation Specialist. (盛島康徳 / Yasunori Morishima)
automation data-analysis manufacturing portfolio python vba
Last synced: 05 Mar 2026
https://github.com/e1washere/weather-spark-pipeline
Scalable pipeline using Apache Spark to process and analyze weather data.
apache-spark batch-processing big-data data-analysis data-engineering data-pipeline data-processing etl python spark-sql weather-data
Last synced: 17 Apr 2026
https://github.com/satyacoder29/e-commerce-sales-analysis
Performed E-commerce Sales Analysis to identify trends, optimize sales, and improve decision-making. Analyzed customer patterns, seasonal trends, and product performance using Python, SQL, and Power BI. Delivered actionable insights to enhance revenue, streamline inventory management, and boost customer engagement.
data-analysis data-visualization datacleaning msexcel pivottables powerquerym visualisation vlookups
Last synced: 05 Mar 2026
https://github.com/pizofreude/divvybikes-share-success
Developing data-driven marketing campaign for Divvy to convert casual riders into annual members. Divvy is a bike-share program of the Chicago Department of Transportation (CDOT).
airflow bi-analytics data-analysis data-engineering data-visualization database dbt docker etl jupyterlab python r redshift s3
Last synced: 17 Apr 2026
https://github.com/dina-hosny/analyze-and-model-airline-system
Analyzing Airline System and Building Data Warehouse Model to Store the Data and Answer Some Business Questions
data-analysis data-modeling data-warehouse datawarehousing dwh plsql sql
Last synced: 05 Mar 2026
https://github.com/shashwat9kumar/us-accidents-data-analysis
Analysis of the US accidents using the US-Accidents dataset (4.2 million entries) from Kaggle
accidents accidents-analysis data-analysis data-analytics data-visualisation data-visualization matplotlib numpy pandas python
Last synced: 17 Apr 2026
https://github.com/vaishnavis03/finlatics_ml_program
This repository contains the .ipynb files for 3 datasets, along with a PPT for each. The datasets included are Facebook Marketplace Data, Sales Prediction Data, and Wine Quality data.
correlation data-analysis data-science data-visualization knn linear-regression machine-learning matplotlib numpy pandas random-forest-classifier scikit-learn
Last synced: 17 Apr 2026
https://github.com/kheriberto/knn_project
This is a simple project that uses dummie data to practice and demonstrate my knowledge of the KNN algorithm.
data-analysis knn-classifier numpy python scikit-learn seaborn
Last synced: 02 Apr 2026
https://github.com/nathadriele/ifood-data-governance-pipeline
Este projeto demonstra uma solução completa de Data Governance com foco em qualidade, rastreabilidade, segurança e conformidade com LGPD. Utiliza tecnologias modernas como Streamlit, Airflow, dbt e Pydantic para implementar um ecossistema funcional e interativo com dashboard de governança de dados.
airflow dashboard data-analysis data-catalog data-engineering data-governance data-quality data-visualization dbt ifood lgpd matplotlib numpy observability-data pandas pipeline pyspark redis seaborn streamlit
Last synced: 02 Apr 2026
https://github.com/ruajean/netflixmoviescraper
🎬 A powerful tool for gathering movie data and user reviews from FilmAffinity's Netflix category. This script scrapes movie details and iterates through user reviews, saving structured information to a CSV file for analysis. Ideal for insights into user sentiments and movie popularity on FilmAffinity.
data-analysis data-visualization dataset jupyter-notebook python scraping
Last synced: 17 Apr 2026
https://github.com/jabercrombia/video-game-data
This project integrates FastAPI as the backend and Next.js as the frontend to create a full-stack web application. It processes and displays vides game sales data, enabling seamless API communication while maintaining a scalable and efficient architecture.
data-analysis nextjs nintendo playstation python typescript video-game
Last synced: 02 Apr 2026
https://github.com/humayun-raza-030/restaurant-recommendation-system
This project is a Restaurant Recommendation System that helps users find restaurants in Lahore based on their location, customer reviews, and ratings. The system scrapes restaurant data from Google Maps, analyzes user reviews for sentiment, and provides a visualization dashboard using Tableau.
data-analysis data-science data-visualization python
Last synced: 17 Apr 2026
https://github.com/jhrcook/checkplease
Analysis of an immune checkpoint-blockade screen.
bayesian-statistics data-analysis pymc3 python python3 r
Last synced: 17 Apr 2026
https://github.com/nicodupont/kaggle
All my Data analysis and Competitions
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 17 Apr 2026
https://github.com/atlassandx90/cryptocurrency-volatility-prediction
Cryptocurrency volatility prediction ML pipeline
cryptocurrency data-analysis data-science data-visualization machine-learning
Last synced: 17 Apr 2026
https://github.com/mahmoudwal27/manufacturing_downtime
This project focuses on improving manufacturing efficiency by analyzing production data. Using Python, SQL, and Power BI, we built interactive dashboards to uncover patterns, minimize downtime, and optimize operations. The goal is to help stakeholders make data driven decisions for enhanced productivity.
data-analysis data-analysis-python data-visualization google-colab powerbi python sql
Last synced: 17 Apr 2026
https://github.com/dharininadkar/movies-data-dashboard
Data Analysis of Movies data
data-analysis data-mining data-science data-visualization ms-excel ms-sql-server tableau
Last synced: 04 Apr 2026
https://github.com/ridemountainpig/education-level-data-analysis
An analysis of the relationship between education levels, unemployment rates, and credit card spending in Taiwan's six major cities.
data-analysis matplotlib pandas-python
Last synced: 17 Apr 2026
https://github.com/kgotsosm/fcc-data-analysis
Notebooks created for the Data Analysis Course on freeCodeCamp
data-analysis data-visualization matplotlib pandas seaborn
Last synced: 17 Apr 2026
https://github.com/victoorv/maladie_cardiaque
Prédire si un individu est atteint ou non de maladie cardiaque.
classification data-analysis data-science data-visualization exploratory-data-analysis heart-disease heart-disease-analysis heart-disease-classification heart-disease-prediction hyperparameter-tuning machine-learning machine-learning-algorithms neural-networks oversampling-algorithms python statistical-analysis statistical-tests statistics
Last synced: 17 Apr 2026
https://github.com/victoorv/prediction_covid19
Prédire si un invidu est positif au COVID19 ou non.
classification covid-19-classifier covid-19-data-analysis covid19-data data-analysis data-science data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning machine-learning-algorithms neural-networks oversampling-algorithms python statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/victoorv/criminalite_us
Une analyse de la criminalité en fonction de variables socio-économiques a été menée, incluant la sélection et la comparaison de modèles de régression multiple ainsi que des tests d'hypothèses sur les coefficients et la significativité des modèles.
data-analysis data-science r regression regression-analysis regression-models statistical-analysis statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/royungar/automotive_sales_insights_dashboard
Data visualization project analyzing automotive sales, recalls, and customer sentiment using IBM Cognos Analytics. Features KPIs, treemaps, heatmaps, and advanced visual storytelling techniques.
automotive-industry business-intelligence cognos-analytics csv customer-sentiment dashboard data-analysis data-engineering data-visualization eda excel heatmap ibm kpi recall-analysis sales-data treemap
Last synced: 04 Jun 2026
https://github.com/vitornegromonte/eda_stroke
Exploratory data analysis in the stroke prediction dataset
data-analysis data-science exploratory-data-analysis kaggle-dataset visualization
Last synced: 17 Apr 2026
https://github.com/vasilescur/decsci-group-9
DECSCI 101 - Final Project (Group 9)
data-analysis decision-science qualtrics statistics visualization
Last synced: 17 Apr 2026
https://github.com/q-viper/blog-notebooks
This is the repo to store most of my blogs in dataqoil.com and q-viper.github.io.
data-analysis data-science machine-learning-algorithms timeseries
Last synced: 04 Apr 2026
https://github.com/sanam2405/ahs
This contains the analysis of result of AHS Madhyamik Examination 2022
data-analysis data-visualization jupyter-notebook python
Last synced: 18 Apr 2026
https://github.com/yuvrajsaraogi/sales-prediction-using-python
Sales prediction involves estimating future product sales based on factors like advertising spend, target audience, and platform. Businesses rely on data scientists to forecast sales and optimize advertising costs. Machine learning in Python can be used for this task.
data data-analysis data-science data-visualization machine-learning matplotlib natural-language-processing numpy pandas prediction python sales-prediction-using-python sql
Last synced: 19 Apr 2026
https://github.com/nicovandenhooff/kaggle-competitions
A repository that contains my Kaggle projects.
data-analysis data-visualization deep-learning exploratory-data-analysis kaggle machine-learning matplotlib modeling neural-network numpy pandas seaborn sklearn
Last synced: 04 Apr 2026
https://github.com/prangonghose/wikipedia-blocking-policies
This study investigates the relationship between editors’ disruptive behavior and regulation policies on English Wikipedia, focusing on the Blocking Policy page. The study collects and analyzes data from 2004 to 2022 using the Wikipedia API, page statistics, and keyword extraction.
data-analysis data-visualization matplotlib open-source pandas python3 seaborn
Last synced: 18 Apr 2026
https://github.com/andryadsm/predicting-house-prices
🏘️ Project Predicting House Prices (Python)
data-analysis data-preprocessing data-visualization feature-engineering house-prices machine-learning matplotlib numpy pandas python real-estate seaborn sklearn
Last synced: 04 Apr 2026
https://github.com/rajeev2806/retail-order-data-analysis
Dataset downloaded from kaggle api and then data cleaning and analysis is performed
data-analysis data-cleaning postgresql
Last synced: 18 Apr 2026
https://github.com/vansh-py04/data-analysis-questions-pandas-numpy-sql
Solution to 450+ Data Science Tech Stack questions essential for Data Analysts and Scientists!
data-analysis data-science deepnote machine-learning numpy pandas python sql
Last synced: 18 Apr 2026
https://github.com/vvhacker007/technocolabs
This repo contains the projects that were assigned to me during the internship.
data-analysis data-science flask heroku-deployment internship machine-learning project streamlit website
Last synced: 18 Apr 2026
https://github.com/akhundmuzzammil/energyconsumptionprediction
This repository contains code and resources for training a linear regression model to predict energy consumption based on various building parameters.
data-analysis energy-consumption linear-regression machine-learning python scikit-learn streamlit visualization
Last synced: 18 Apr 2026
https://github.com/wang-q/tva
tva: Tab-separated Values Assistant
cli command-line-tool csv data-analysis data-processing etl high-performance rust streaming tabular-data tsv unix-philosophy
Last synced: 05 Apr 2026
https://github.com/manalisbhavsar/mall-customers-clustering
K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.
data-analysis data-visualization matplotlib numpy pandas python scikit-learn
Last synced: 18 Apr 2026
https://github.com/mtimma001/clinical-trial-data-tool
Clinical Trial Data Analysis Tool is a Flask-based web app for healthcare professionals to manage and analyze clinical trial data. It features full CRUD functionality, interactive visualizations (Plotly/Matplotlib), a responsive Bootstrap UI, MySQL database integration, and Heroku deployment for accessible, scalable use.
bootstrap5 clinical-trials crud data-analysis data-visualization flask healthcare heroku mysql pandas plotly python
Last synced: 05 Apr 2026
https://github.com/satti-hari-krishna-reddy/data-whisperer
Data Whisperer is an AI-driven tool that automates exploratory data analysis (EDA), generates actionable insights, and enables natural language querying of datasets. it combines the power of AI (Google Gemini) with interactive visualizations and professional reporting.
ai data-analysis data-visualization llm python3 streamlit
Last synced: 18 Apr 2026
https://github.com/al-ghaly/prosper-loans-analysis
A statistical Analysis Project, to analyze the data of a finance company’s loans Using Python packages (pandas – NumPy – seaborn – matplotlib)
data-analysis matplotlib numpy pandas python python-data-analysis seaborn statistical-analysis statistics
Last synced: 18 Apr 2026
https://github.com/jordanconallluthaiswright/purchase-behaviour-data-analysis
This project analyzes Black Friday purchase behavior for Company XYZ, uncovering trends by gender, age, and location. Using data cleaning, statistical analysis, and visualization, it evaluates spending patterns, confidence intervals, and category preferences to provide actionable insights for optimizing marketing strategies and targeting.
business-analytics data-analysis jupyter-notebook python
Last synced: 18 Apr 2026
https://github.com/kwokhing/visualizing-datasets-with-facets
Demo on using Facets: An Open Source Visualization Tool for Machine Learning Training Data developed by Google's PAIR Initiative
anaconda data-analysis data-visualization facets jupyter-notebook missing-data open-source python skewness unbalanced-data visualisation visualization
Last synced: 18 Apr 2026
https://github.com/bolshovaelizaveta/covid19_spark_analysis
Учебный проект по дисциплине 'Базы данных для компьютерного зрения'. Разработка аналитической платформы для эпидемиологического мониторинга COVID-19 с использованием Apache Hadoop и Spark
apache-hadoop apache-spark covid-19 data-analysis jupyter-notebook machine-learning medical-imaging pyspark sql
Last synced: 18 Apr 2026
https://github.com/shubh-bharadwaj/income-dataset-analysis
data-analysis data-science pandas python
Last synced: 18 Apr 2026
https://github.com/mksingh431/free-data-science-courses
Data science is a rapidly growing tech field that’s transforming business decision-making. To break into this field, you need the right skills. Fortunately, top institutions like Harvard and IBM offer free online courses. These courses cover everything from basic programming to advanced machine learning.
course data data-analysis data-science data-visualization free freecou python
Last synced: 19 Apr 2026
https://github.com/shyamkumarnagilla/ai-powered-forecasting-for-agricultural-productivity
AI Powered Forecasting for Agricultural Productivity is a project that utilizes machine learning to predict crop yields and optimize farming practices. By harnessing historical and real-time data, this model empowers farmers with data-driven insights to enhance productivity and sustainability in agriculture.
data-analysis data-visualization deep-learning flask neural-network
Last synced: 19 Apr 2026
https://github.com/rodriguesl1/analise-ibovespa-fiap
Modelo de previsão do índice IBOVESPA utilizando técnicas de séries temporais. O projeto inclui análise exploratória, decomposição sazonal, testes de estacionariedade e modelagem com Prophet, AutoARIMA e outros modelos estatísticos para apoiar decisões de investimento.
autoarima b3 brasil data-analysis economia finance forecasting ibovespa pandas prophet python statsmodels time-series
Last synced: 19 Apr 2026
https://github.com/yuvrajsaraogi/unemployment-analysis-with-python
Unemployment is measured by the unemployment rate which is the number of people who are unemployed as a percentage of the total labour force. We have seen a sharp increase in the unemployment rate during Covid-19, so analyzing the unemployment rate can be a good data science project.
big-data big-data-analytics data-analysis data-science data-visualization engineering excel jupyter-notebook machine-learning mini-project natural-language-processing nlp project python3 sql
Last synced: 19 Apr 2026
https://github.com/robertochiosa/automatic-powerpoint-report-rmd
Automatically generate good looking powerpoint presentations from a csv dataset
data-analysis data-science medium medium-article python r
Last synced: 19 Apr 2026
https://github.com/decepticon-ts/cap-ai-studio
Description: A modern, powerful web application for advanced image analysis and batch processing, featuring real-time AI-powered image captioning, comprehensive reporting, and an intuitive user interface. Built with Streamlit and Google's Gemini API.
artificial-intelligence batch-processing computer-vision data-analysis gemini-api image-processing image-processing-python python streamlit streamlit-webapp threading
Last synced: 19 Apr 2026
https://github.com/vyjayanthipolapragada/data_analytics_medical_appointments
Analyzing the data set which consists of medical appointments to draw insights about patient's no-show scenarios
data-analysis data-analytics data-cleaning data-visualization data-wrangling jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 19 Apr 2026
https://github.com/diegoglezsu/bulletin-fetcher
bulletin-py is python package to easily fetch bulletins and legal acts from a wide variety of sources of Eurpean Union.
data-analysis european-union legal-documents python sparql
Last synced: 19 Apr 2026
https://github.com/mlucifer27/bilateral-visualization
Streamlit app visualizes bilateral relationship scores between 100 countries from 1945 to 2024. It supports interactive heatmaps, network graphs, pairwise comparisons, and more.
d3blocks data-analysis data-visualization plotly-python python streamlit
Last synced: 04 Jun 2026
https://github.com/akash-v7/telecom_customer_churn_prediction
A machine learning project to predict customer churn in the telecom industry using data analysis and classification models. The project includes data preprocessing, exploratory data analysis (EDA), model building, and insights to help telecom companies improve customer retention strategies.
data-analysis data-science data-visualization jupyter-notebook machine-learning predictive-modeling python
Last synced: 20 Apr 2026
https://github.com/leftcoastnerdgirl/introduction_to_python
This project provides an introduction to data analysis using Python.
data-analysis data-analysis-python data-analytics data-comparison data-import for-loop jupyter-notebook min-max python
Last synced: 20 Apr 2026
https://github.com/montanaz0r/suicide-rate-analysis
Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector
analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate
Last synced: 20 Apr 2026
https://github.com/dthung1602/goodread-bestbook-prediction
Data analysis - trying to predict the result of Goodreads Choice Adward
data-analysis goodreads pca python r xgboost
Last synced: 20 Apr 2026
https://github.com/natnaelhhaile/text-similarity-analysis
bag-of-words cosine-similarity data-analysis machine-learning natural-language-processing nltk-python one-hot-encoding python stemming stop-word-removal stop-words text-mining text-processing text-similarity-analysis tf tf-idf tokenization
Last synced: 20 Apr 2026
https://github.com/jbalooshie/school_district_analysis
Analysis of standardized testing results using NumPy and Pandas, executed in Jupyter Notebook. Summaries of the testing results are provided based on school, test type, and grade level.
data-analysis data-science dataframes jupyter-notebook numpy pandas python
Last synced: 20 Apr 2026
https://github.com/anjaliwork20/moodify
Mood-based music recommendation system that considers a user's emotional state to recommend songs, genres, artists and playlists using Machine learning
artificial-intelligence cnn-keras cnn-model convolutional-neural-networks data data-analysis data-science data-structures data-visualization database deep-learning machine-learning machine-learning-algorithms python recommended song songs
Last synced: 20 Apr 2026
https://github.com/hugo-hattori/customer_profile_analysis
Data Analysis Project.
data-analysis data-analysis-python data-analytics jupyter jupyter-notebook pandas pandas-dataframe pandas-python plotly plotly-express plotly-io python
Last synced: 20 Apr 2026
https://github.com/sarthakmishraa/bike_rental_predictor
Bike Sharing Dataset : This dataset contains the hourly and daily count of rental bikes between years 2011 and 2012 in Capital bikeshare system with the corresponding weather and seasonal information.
data-analysis machine-learning python xgboost
Last synced: 20 Apr 2026
https://github.com/abinashsahoo007/project-bankruptcy-prevention
The project is to create a classification model that predicts the chances of a business facing bankruptcy based on the key feature like Industrial Risk, Management Risk, Financial Flexibility, Credibility, Competitiveness, Operating Risk.
data-analysis data-mining data-visualization deployments eda machine-learning pickle python statistics streamlit
Last synced: 20 Apr 2026
https://github.com/ak-pydev/python_practice
Documenting my learning journey from python -> ML -> DL -> LLM/GenAI -> Agents exercises solved daily from Udemy/Kaggle/YouTube.
data-analysis data-science feature-engineering llms machine-learning mlflow mlops-workflow modeling python3 streamlit uvicorn
Last synced: 20 Apr 2026
https://github.com/salfaris/toy-data-analysis
Random toy data projects. For my portfolio data projects, see linked website
Last synced: 20 Apr 2026
https://github.com/william-franco/fuzzy-logic
data-analysis data-science rust rust-application rust-lang terminal-app
Last synced: 04 Jun 2026
https://github.com/profasem/logistics-performance-analysis
Power BI dashboard analyzing logistics performance, delivery delays, carrier efficiency, and regional risk.
business-intelligence dashboard data-analysis logistics powerbi python supply-chain
Last synced: 21 Apr 2026
https://github.com/danpoynor/pet-shelter-data-analysis-notebook
Demonstration of skills analyzing data from a pet shelter. The CSV data contains tables detailing the incoming and outgoing animals and I use my knowledge of Pandas to gather and present the requested information.
csv data-analysis data-cleaning data-science jupyter-notebook matplotlib numpy pandas pet-shelter tabular-data
Last synced: 21 Apr 2026
https://github.com/rachel-xmr/data-analysis-in-health-set-csc3062
CSC3062 Data Analysis and visualization
classification-algorithm data-analysis data-visualization model-evaluation nmf pca python svm t-sne visualization
Last synced: 05 Jun 2026
https://github.com/nxion/sql-data-warehouse-project
Building a modern data warehouse with MS SQL server, ETL processes, data modeling and analyitics.
data data-analysis data-analytics data-engineering data-lakehouse data-warehouse datalake datascience etl etl-job medallion-architecture ms mssql sql sql-query sql-server
Last synced: 05 Jun 2026
https://github.com/martinkalema/power-distribution-modelling
Power Distribution Modelling for cea and cel algorithms
data-analysis python synthetic-dataset
Last synced: 21 Apr 2026
https://github.com/rahulpatel0615/sales-analysis-project
Sales Data Analysis Dashboard with Python, Pandas, and Matplotlib. Features 12+ visualizations and comprehensive insights.
data data-analysis data-visualization matplotlib pandas portfolio python
Last synced: 21 Apr 2026
https://github.com/anushkundu/churn-prediction
Telecom Customer Churn Prediction Using Machine Learning!
accuracy-score classification-algorithm classification-report data-analysis data-science deep-learning gradient-boosting-classifier keras-tensorflow logistic-regression machine-learning random-forest-classifier recall-precision roc-auc-score smote-sampling svm-classifier
Last synced: 21 Apr 2026
https://github.com/nikhilfuke1/a-b-testing-and-regression-analysis-python
Python Statistical Project involves data analysis, visualization, A/B testing, and regression analysis to determine the best-performing platform.
ab-testing data-analysis hypothesis-testing libraries python regression-analysis statistics visualization
Last synced: 21 Apr 2026
https://github.com/mhuwaimel/data-analysis-of-students-results-in-qiyas
Analysis of student performance data from Qiyas (قياس), the Saudi Arabian National Center for Assessment
data-analysis jupyter-notebook python
Last synced: 22 Apr 2026
https://github.com/tmmvn/analytics-notebooks
A bunch of data analytics notebooks done testing out JetBrains DataLore
ai algorithms data-analysis datalore elements-of-ai helsinki-university-mooc python
Last synced: 22 Apr 2026
https://github.com/robinmillford/optimizing-treatment-plans-through-data-analysis
The primary focus was on understanding customer health, treatment, and associated charges over multiple years.
data-analysis data-visualization healthcare mysql powerbi sql
Last synced: 22 Apr 2026
https://github.com/kgelli/apple-data-analysis---apache-spark
Modular ETL pipeline for analyzing Apple product purchase patterns using Apache Spark on Databricks with factory design patterns.
apache-spark data-analysis databricks delta-lake etl-pipeline factory-pattern pyspark
Last synced: 22 Apr 2026
https://github.com/thinogueiras/jornada-python
Jornada Python - Hashtag Programação.
data-analysis data-science inteligencia-artificial python rpa
Last synced: 22 Apr 2026
https://github.com/rajesh9943/sentiment-analysis-of-consumer-opinions-on-amazon-products
Developed a comprehensive Sentiment Analysis System aimed at classifying Amazon product reviews into positive, neutral, and negative sentiments. The project leveraged advanced Natural Language Processing (NLP) techniques alongside machine learning algorithms to deliver accurate and actionable insights from customer feedback
amazon data-analysis data-manipulation data-preprocessing data-presentation data-visualization machine-learning nlp nlp-library nltk product-reviews-analysis sentiment-analysis sklearn-library word-cloud-generator-in-python-3
Last synced: 05 Jun 2026
https://github.com/kgotsosm/epl-analysis
Preparing data for machine learning algorithms to predict English Premier League match winners.
data-analysis data-cleaning data-modeling
Last synced: 22 Apr 2026
https://github.com/devexpress-examples/web-forms-pivot-grid-export-additional-captions-header-or-footer
This example illustrates how to add a custom header to the document exported to PDF in Pivot Grid for Web Forms.
asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms
Last synced: 22 Apr 2026
https://github.com/ayushi-gajendra/buenos-aires-subway-statistics
A comprehensive data analysis of the Buenos Aires subway system ridership using Python and Pandas. This project identifies peak-hour congestion patterns, explores hourly passenger distributions, and utilizes the 95th percentile to isolate extreme traffic conditions for urban mobility insights.
95th-percentile buenos-aires data-analysis data-science-portfolio data-visualization matplotlib pandas python statistical-analysis subway-ridership transit-data urban-mobility
Last synced: 05 Jun 2026
https://github.com/floffah/my-listening
Various ways to analyse your Spotify extended streaming history data
convex data-analysis listening-history spotify
Last synced: 23 Apr 2026
https://github.com/tranngoca5039/bigquery-a5y
📊 Streamline your data analysis with bigquery-a5y, a powerful tool for optimizing BigQuery performance and improving query efficiency.
analytics api big-data bigquery cloud-computing data-analysis data-integration data-management data-pipeline data-visualization data-warehouse google-cloud machine-learning serverless sql
Last synced: 05 Jun 2026