Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/andrii04/ga4-gcs-to-bigquery-etl
Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.
automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql
Last synced: 18 May 2026
https://github.com/okwilkins/retailanalysis
A comprehensive exploratory analysis and implementation of kmeans/hierarchical clustering on online retail data.
data-analysis data-science machine-learning statistics
Last synced: 18 Oct 2025
https://github.com/gattupalli-saketh/sentiment-analysis-on-products-
Product reviews sentiment analysis.
data-analysis machine-learning nlp review-analysis sentiment-analysis sentiment-classification
Last synced: 18 Apr 2026
https://github.com/badranalyst/titanic-survival-prediction-full-data-science-project-classification
This project predicts Titanic survivors using classification models. It includes data cleaning, pre-processing, exploratory data analysis (EDA), categorical feature conversion, model building, and evaluation. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used to analyze and predict survival outcomes.
classification data-analysis data-science eda exploratory-data-analysis machine-learning matplo matplotlib-pyplot ml model numpy pandas predictive-modeling python seaborn
Last synced: 06 May 2026
https://github.com/quocduyenanhnguyen/human-trafficking-analysis
I analyzed human trafficking data
data-analysis data-analytics data-visualization human-trafficking mysql mysql-database mysql-workbench query sql tableau tableau-dashboards tableau-public
Last synced: 02 May 2026
https://github.com/sanchittechnogeek/overscripted-analysis
Geolocation and user language extraction analysis from Mozilla Overscripted dataset
analysis data data-analysis mozilla
Last synced: 23 Mar 2025
https://github.com/nevermendel/revolut-analysis
Python script to analyse Revolut transactions
data-analysis revolut revolut-analysis
Last synced: 12 Apr 2025
https://github.com/asghar-rizvi/youtube-statistics-project
This project analyzes a dataset of global YouTube statistics to uncover insights about YouTube channels, their ranks, and other attributes. The dataset used for this analysis was obtained from Kaggle.
data-analysis data-analysis-python data-science data-science-projects matplotlib numpy pandas pycharm-ide python seaborn
Last synced: 13 Jun 2026
https://github.com/laudebugs/fec-data-analysis-2020
The project aimed to determine the total sum of contributions to the candidate committees as well as the number of contributions made by individuals.
data-analysis fec presidential-candidates
Last synced: 16 May 2026
https://github.com/dogan-the-analyst/data_analysis_in_the_office
Data analysis with R in the Office.
data-analysis ggplot r theoffice tidyverse
Last synced: 14 Mar 2025
https://github.com/itrauco/data-dirtying-tool
a simple command line tool to generate dirty data and do common data things in google cloud
data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning
Last synced: 24 Feb 2025
https://github.com/dsrodrigovieira/favoritasales
Este repositório contém o projeto desenvolvido para o desafio do kaggle "Store Sales - Time Series Forecasting. Use machine learning to predict grocery sales"
data-analysis data-science kaggle-competition machine-learning python telegram-bot xgboost-regression
Last synced: 05 May 2026
https://github.com/motapinto/agent-based-simulation-conquest
Agent-based simulation modelation of the conquest Battlefield gamemode
agent-based-simulation data-analysis jade java sajas swing
Last synced: 24 Jan 2026
https://github.com/shubham200137/icc-women-s-t20-world-cup-data-analytics
Created a Power BI report to identify top 11 players for a T20 cricket team by scraping data from espncricinfo with Python, cleaning and transforming the data with pandas, and evaluating various player performance metrics.
beautifulsoup4 data-analysis data-visualization numpy-python pandas-python powerbi web-scraping
Last synced: 25 Feb 2025
https://github.com/faisal-fida/box-office-mojo-analysis
Analyzed box office data from Box Office Mojo, exploring relationships between worldwide revenue, release year, and a combined score that considers both factors. It includes visualizations like scatter plots, bar charts, and identifies top and bottom performing movies.
box-office data-analysis data-science python revenue-prediction visualization
Last synced: 06 May 2026
https://github.com/kath92/my_data_projects
My data projects.
data-analysis data-vizualisation nlp-machine-learning poewrbi python tableau
Last synced: 23 Mar 2025
https://github.com/grlyntng/rpims
Django Code and documentation for the Retail Pharmacy Inventory Management System (best final year project award)
data-analysis django erp forecasting-models lstm-neural-networks reporting
Last synced: 26 May 2026
https://github.com/weybsonalves/prevendo-o-atrito-de-clientes
Projeto em que percorro as etapas que compõem o ciclo de vida da ciência de dados a fim de prever o atrito de clientes do serviço de cartões de crédito de um banco.
data-analysis data-science data-visualization machine-learning python
Last synced: 06 May 2026
https://github.com/elakkiya-u/digital-marketing-campaign
A machine learning project to predict whether a customer will convert based on digital marketing campaign data.
campaigns data-analysis deployment digital-marketing machine-learning predictive-modeling python
Last synced: 30 Jun 2025
https://github.com/apsinghanalytics/hranalytics_myersbriggspersonalityinsights
A Excel analytics study exploring the correlation between personality traits and key HR-relevant parameters, including tenure and performance
data-analysis data-visualization excel pivot-tables
Last synced: 30 Jan 2026
https://github.com/jayita11/healthcare-management-optimization-analysis-and-visualization
This project analyzes healthcare data from 2019 to May 2024, optimizing patient care, resource allocation, and financial management. Insights include billing trends, blood bank management, doctor performance, and medication demand, supported by excel,interactive Tableau dashboards and SQL analysis.
data-analysis excel healthcare interactive-dashboards mysql sql tableau-dashboards
Last synced: 23 Mar 2025
https://github.com/shimaa83/eda_v2
Automatic EDA library
data-analysis data-science python
Last synced: 20 Apr 2026
https://github.com/misszeferino/erp-data-analysis
Data Analysis - ERP Data (merge and outliers)
data-analysis data-visualization matplotlib merge numpy outlier-detection python scipy
Last synced: 11 Apr 2026
https://github.com/dug22/jjournal
A Jupyter like notebook software for Java
data data-analysis data-science java jshell jshell-repl notebook swing swing-application
Last synced: 11 Apr 2026
https://github.com/bala-1409/power-bi-visualization-project
This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.
dashboard data-analysis data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint power-bi powerbi powerbi-reports powerbi-visuals visualization
Last synced: 04 Jan 2026
https://github.com/samanhur/data_visualization_pcc
First experiences in data visualization with python
data-analysis data-science data-visualization python3
Last synced: 23 Mar 2025
https://github.com/neha-adnani/sql_music-store-analysis
SQL-based data analysis of a digital music store's sales and customer data.
business-analysis data data-analysis database follow-along-projects pgadmin4 portfolio-project postgres queries sql
Last synced: 18 Jun 2025
https://github.com/abhay-sinha-0/carpricepredictionproject
A machine learning project that predicts the selling price of a car based on its features such as year, mileage, fuel type, transmission, and more. This model can assist individuals and dealerships in estimating fair market prices for used cars.
artificial-intelligence data-analysis data-science data-visualization exploratory-data-analysis machine-learning-algorithms matplotlib-pyplot mysql-database numpy-library pandas-library python skit-learn sklearn-library
Last synced: 15 May 2025
https://github.com/danpoynor/python-number-guessing-game-with-stats
A number guessing game written in Python 3 that presents median, mode, and mean statistics
console-game data-analysis number-guessing-game python3 statistics
Last synced: 26 May 2026
https://github.com/regmibijay/opencarp-analyzer
Reads Trace Files created by OpenCARP Models and exports data for easy plotting with inbuilt plotter script.
bioinformatics data-analysis opencarp
Last synced: 16 Jan 2026
https://github.com/rodrigojunqueiradev/data-exploration-and-cleaning
Credit Analysis Data: Foundations for Cleaning and Exploration
data-analysis data-engineering data-science data-visualization datascience matplotlib matplotlib-pyplot numpy pandas python python-3 python3
Last synced: 13 Apr 2026
https://github.com/stas1f1/methods-and-models-for-multivariate-data-analysis
Completed tasks for the course on methods of mutivatiate data analysis, 1st year of masters, FDT ITMO
data-analysis multivariate-analysis python
Last synced: 10 Mar 2026
https://github.com/deva-246/datacleaning-excel-powerqueryeditor
data-analysis data-science excel powerquery
Last synced: 04 Jan 2026
https://github.com/27ahmad/ibm-data-science-capstone
The Capstone is the final course in the IBM Data Science Professional Certificate program. It's a project that combines all the skills and knowledge you've gained throughout the specialization.
data-analysis data-science folium-maps machine-learning plotly-dash python sql
Last synced: 26 May 2026
https://github.com/amanyadav-07/customer-churn-prediction
Machine Learning project to predict customer churn using Logistic Regression, Random Forest, and XGBoost. Includes data preprocessing, feature engineering, SMOTE balancing, model training, evaluation, and business insights.
accuracy-metrics data-analysis data-visualization logistic-regression machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn sklearn xgboost-classifier
Last synced: 11 Apr 2026
https://github.com/aksoni07/movie-recommendation
A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.
clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions
Last synced: 11 Apr 2026
https://github.com/shreyaamenon/data-analysis-aiml-mini-projects
mini projects to help me grow skills in data analysis, artificial intelligence and machine learning.
ai data-analysis jupyter-notebook machine-learning python
Last synced: 11 Apr 2026
https://github.com/mudassir-a/vendor-performance-analysis
vendor performance data analysis project using sql, python and power bi
data-analysis powerbi python sql
Last synced: 18 May 2026
https://github.com/bhavanachitragar/data-analysis-using-pyspark
Working with pyspark module in python and using google colab environment in order to apply some queries to the dataset. The dataset consist of two csv files listening.csv and genre.csv. Also, visualizing query results using matplotlib.
data-analysis google-colab pyspark-sql
Last synced: 30 Jun 2025
https://github.com/zulfachafidz/titanic_explorer_predicting_survival_with_classification_using_knn_algorithm
Tracking Life Safety with the KNN Predictive Analysis Approach. Leveraging the Titanic Dataset, we apply classification analysis to predict the fate of passengers based on a variety of features.
algorithm algorithms data data-analysis data-mining data-science datamodeling datapreprocessing dataset knn-algorithm knn-classification machine-learning machine-learning-algorithms prediction-model
Last synced: 01 Sep 2025
https://github.com/andersoncrs/analisis_exploratorio_de_datos-eda-_rendimiento_estudiantil
Este análisis exploratorio de datos (EDA) realizado sobre el conjunto de datos de rendimiento estudiantil tiene como objetivo identificar y comprender los factores que influyen en el desempeño académico de los estudiantes. A través de la limpieza, transformación y visualización de datos, se busca descubrir patrones y relaciones significatvas.
data-analysis data-exploration data-exploration-and-preprocessing data-visualization seaborn
Last synced: 30 Mar 2025
https://github.com/andersoncrs/arboles_de_decision_calidad_del_vino
Contiene un análisis detallado de la calidad del vino utilizando un modelo de clasificación basado en árboles de decisión. Incluye la exploración de datos, detección y manejo de valores atípicos, análisis Univariado y Bivariado, y la creación y evaluación de un modelo predictivo. El objetivo principal es predecir la calidad del vino.
data-analysis data-science data-visualization machine-learning matplotlib seaborn sklearn tree-decision
Last synced: 20 May 2026
https://github.com/gaaniruddha/mphil
This repository contains a copy of my final MPhil presentation and panel report.
data-analysis gpu-imager radio-astronomy
Last synced: 03 Mar 2026
https://github.com/jbizzlefoshizzle/crowdfunding-trends-excel
Excel project examining funding trends for Kickstarter projects
category-breakdown data-analysis excel kickstarter kickstarter-campaigns line-graph pivot-charts pivot-tables trends
Last synced: 05 Jan 2026
https://github.com/jatin-s16/hr_mysql_powerbi
This repository contains raw HR data along with key business questions. I performed data cleaning using MySQL queries and wrote analytical queries to extract meaningful insights. The results were then visualised using Power BI to enhance business understanding.
data-analysis data-science data-visualization mysql powerbi
Last synced: 29 May 2026
https://github.com/siddharthbadal/kpmgdataanalysisproject
Data Analytics Consulting Virtual Internship
data-analysis data-cleaning data-visualization googlestudio msexcel powerpoint
Last synced: 05 Jan 2026
https://github.com/arianarmw/da01-bike-sharing-analysis
🚴♀️ Data analysis project on bike-sharing systems. Includes data wrangling, exploratory data analysis (EDA), visualization, and interactive dashboards built with Streamlit. Explore patterns in bike usage and rental data!
bike-sharing-analysis data-analysis exploratory-data-analysis python streamlit visualization
Last synced: 11 Apr 2026
https://github.com/jedrzej-wydra/data-analysis-associate
Associate Data Analyst Exam by DataCamp
Last synced: 23 Mar 2025
https://github.com/mohit01chugh/edu_sql_analysis
SQL queries used to analyze student data.
data-analysis database education plpgsql postgresql sql
Last synced: 17 May 2026
https://github.com/chanmeng666/advanced-neural-network-applications
Practical implementations of perceptron and linear neuron models for classification and regression, with mathematical analysis and visualizations in Jupyter notebooks.
classification data-analysis data-science educational gradient-descent jupyter-notebook linear-neuron machine-learning matplotlib neural-network neural-networks numpy perceptron python regression
Last synced: 03 May 2026
https://github.com/takshshah-16/spotify_eda
Spotify data analytics and advanced querying
data-analysis eda pgadmin4 postgresql
Last synced: 30 Oct 2025
https://github.com/vedantshi/coffee-sales-dashboard
This project analyzes coffee sales data using Excel, featuring data cleaning, trend analysis, and an interactive dashboard. Key insights highlight top-performing products, regional sales trends, and seasonal patterns. Recommendations focus on marketing strategies and inventory optimization. Future plans include Power BI integration for visuals.
business-insights data-analysis data-visualization excel-dashboard pivot-tables sales-trends
Last synced: 05 Jan 2026
https://github.com/benami171/ml_knn_decision-trees
A ml implementation comparing Decision Trees and k-Nearest Neighbors (k-NN) algorithms for Iris flower classification. Features comprehensive analysis of different approaches including brute-force and entropy-based decision trees, along with k-NN using multiple distance metrics.
classification cross-validation data-analysis decision-trees iris-dataset k-nearest-neighbours machine-learning nearest-neighbors python
Last synced: 30 Jun 2025
https://github.com/mahmoudwal27/powerbi-projects-for-data-analysis
This project leverages Power BI for data visualization, DAX for custom calculations, and integrates SQL and Excel for data preprocessing, analysis, and reporting, enabling dynamic and interactive insights.
data-analysis data-analysis-project data-analytics-project project
Last synced: 07 Mar 2026
https://github.com/first-coding/smart_analysis
Smart Analysis is an AI-powered data analysis tool that leverages large language models (LLMs) to generate SQL queries from natural language prompts. Upload CSV files, explore the data schema, and retrieve insights with ease. The system ensures error correction in SQL queries, delivering detailed reports and visualizations in a streamlined workflow
data-analysis llm openai prompt-engineering python
Last synced: 08 Mar 2025
https://github.com/andrii-zapukhlyi/otomoto_visualization
Scraping, data visualization, and building a price prediction model with data from the car classifieds website otomoto.pl
data-analysis machine-learning r scraping statistics visualization
Last synced: 26 Jul 2025
https://github.com/busradeveci/kaggle-iris_data_analysis
Exploratory data analysis and visualization of the Iris dataset using Python.
data-analysis iris-dataset kaggle pandas python seaborn visualization
Last synced: 15 Apr 2025
https://github.com/sun-lab-nbb/sl-shared-assets
A Python library that stores assets shared between multiple Sun (NeuroAI) lab data acquisition and processing repositories.
data-analysis data-collection data-processing experiment sunlab
Last synced: 10 Mar 2026
https://github.com/upes-open/open-cryptocurrency-analysis
A web app to visualise and predict the cryptocurrency’s impact by using Web scraping, data exploration, EDA and Data Visualization.
analysis cryptocurrency data-analysis data-science data-visualization jupyter-notebook streamlite visualization
Last synced: 15 Apr 2025
https://github.com/chokzb/covid19_vaccination_analysis
An EDA project examining global COVID-19 vaccination progress. The notebook investigates vaccination trends by country, daily vaccination rates, timeline patterns, and dose distribution. The project includes visualisations created with Matplotlib, Seaborn, and Plotly.
covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas plotly python seaborn vaccination
Last synced: 07 May 2026
https://github.com/vidyadnina/cyclistic-sql-tableau-project
Trip data analysis for a bike-sharing service company using SQL and Tableau.
bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql
Last synced: 02 Jan 2026
https://github.com/lvmalware/lsm-module
A simple statistics module, which provides 4 basic types of regressions using the Least Squares Method (LSM)
data-analysis least-square-regression regression regression-analysis statistics
Last synced: 31 Mar 2025
https://github.com/syarwinaaa09/hypothesis-testing-with-mens-and-womens-soccer-matches
a data-driven exploration of international men's and women's football (soccer) match results using Python
data-analysis data-visualization football jupyter-notebook men-vs-women pandas python soccer sports-analytics visualization
Last synced: 05 May 2026
https://github.com/arsalan-dev-engineer/ai-repository
A repository that contains AI related projects, notes, practice files and documentations.
ai algorith beginner-friendly data-analysis data-preprocessing developer jupyterlab matplotlib matplotlib-pyplot natural-language-processing numpy pandas python unsupervised-learning visualization
Last synced: 12 Apr 2026
https://github.com/masum184e/exploratory_data_analysis_projects
This space to showcase my journey in exploring various datasets, uncovering patterns, and extracting meaningful insights. Each project highlights different aspects of EDA, demonstrating techniques and tools that are essential for making sense of data.
data-analysis data-analysis-projects data-science data-science-projects eda eda-projects exploratory-data-analysis exploratory-data-analysis-projects
Last synced: 31 Mar 2025
https://github.com/kislerdm/github-repo-details
Application and library to fetch open source libraries details from github to perform due diligence
data-analysis golang opensource
Last synced: 01 Jul 2025
https://github.com/matheusafonseca/c111
Este repositório é dedicado ao armazenamento e organização dos códigos desenvolvidos na disciplina C111 - Análise de Dados, oferecida pelo Instituto Nacional de Telecomunicações (INATEL).
data-analysis matplotlib numpy pandas python
Last synced: 06 May 2026
https://github.com/totonga/ods-exd-api-box
Helper package to build ASAM ODS EXD API grpc plugins.
asam data-analysis grpc grpc-server ods plugin python
Last synced: 03 Feb 2026
https://github.com/jiyanshgarg/delhivery-logistics-data-analysis
This project analyzes Delhivery's logistics delivery dataset to understand delivery performance, route efficiency, and operational patterns using data analytics techniques. The analysis focuses on transforming raw segment-level logistics data into meaningful trip-level insights that can help improve delivery efficiency and route planning.
business-insights-and-recommendations data-analysis data-cleaning-and-preprocessing data-visualization exploratory-data-analysis feature-engineering feature-extraction feature-selection hypothesis-testing outlier-detection outlier-treatment
Last synced: 12 Jun 2026
https://github.com/rodrigojunqueiradev/rodrigojunqueiradev.github.io
Professional Portfolio - Rodrigo Junqueira
analytics artificial-intelligence data-analysis data-engineering data-science data-visualization machine-learning mathematics nosql powerbi python r sql statistics
Last synced: 15 May 2026
https://github.com/rodrigojunqueiradev/2025-python-data-analysis-and-visualization-masterclass
2024 Python Data Analysis & Visualization Masterclass
data-analysis data-science data-structures data-visualization pandas python python-3 python3 seaborn
Last synced: 10 May 2026
https://github.com/mehassanhmood/storyboard
data-analysis framer-motion javascript react scrollytelling storyboard tailwind-css vite
Last synced: 21 Feb 2026
https://github.com/vishal-verma-96/pre-owned-car-price-prediction-using-streamlit-app
Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app
data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/thenorthkun/movies-dataset-analysis
Analysis & categorizing of Movies based on Actors, Genres, Gross covered etc 🦸🏼🧜🏼♀️🎧
data-analysis data-visualization filtering
Last synced: 23 Mar 2025
https://github.com/iwasakiyuuki/data-analysis-platform-etl
A collection of Airflow DAGs for automating data collection into our on-premises data analysis platform.
airflow airflow-dags data-analysis data-collection
Last synced: 01 Jul 2025
https://github.com/anurag-kumar-molankala/sales-performance-dashboard
A Power BI dashboard that analyzes sales trends, product performance, customer segmentation, and payment distribution. It uses DAX, time intelligence, and interactive visuals for data-driven insights. The model includes Sales, Product, and Customer tables for in-depth analysis.
dashboards data-analysis data-visualization dax dax-functions dax-measures dax-query etl-process powerbi powerbi-visuals powerquery sql-query sql-server
Last synced: 03 Apr 2025
https://github.com/chitranjan806/predicting-on-time-premium-deposits
A Predictive analysis project to predict the success rate of On-Time deposits of Premiums by Policy Holders.
analytics-vidhya analytics-vidhya-competition catboostregressor data-analysis data-science linear-regression logistic-regression python3
Last synced: 16 May 2026
https://github.com/nimomach/skateboarding-in-olympics
Skateboarding made its debut in Olympics at the 2020 Summer Olympics. This is a dashboard focused on "Skateboarding in the Olympics" representing a comprehensive overview of the sport's performance, popularity, and key metrics during the Olympic Games.
data-analysis data-visualization olympics paris skateboarding tokyo
Last synced: 10 Mar 2026
https://github.com/shru924/ecommerce_customer_behavior_analysis
A machine learning project that analyzes and segments e-commerce customers based on behavior patterns using Python, Random Forest, and data visualization.
customer-segmentation data-analysis jupyter-notebook machine-learning matplotlib pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/rosanafss/r-journey
Diving into to wonderful see of DATA
Last synced: 19 Nov 2025
https://github.com/salma-mamdoh/investigating-netflix-movies-and-guest-stars-in-the-office
My Project to learn the Basics of Analysis & Visualization on DataCamp
data-analysis data-visualization datacamp matplotlib pandas python
Last synced: 11 Apr 2026
https://github.com/alejandrolara11/machinelearningcourse
Machine Learning Basics: From Setup to Clustering
data-analysis data-science machine-learning numpy pandas plotly preprocessing-data python scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/anderson-andre-p/exploratory-data-analysis.roller-coaster
This repository contains an exploratory data analysis (EDA) project focused on roller coasters. The project involved organizing, cleaning, and visualizing the data to gain insights into roller coasters' characteristics and performance.
data-analysis eda exploratory-data-analysis exploratory-data-visualizations notebook
Last synced: 15 Mar 2025
https://github.com/aya-jafar/python
Practice files & exercises during the journey of Python leaning 🐍
Last synced: 16 May 2025
https://github.com/jimohola/azure-flight-price-predict-deployment
Deploying Machine Learning Models via Microsoft Azure
cloud-computing css data-analysis data-preprocessing data-visualization flask html machine-learning python3
Last synced: 09 Mar 2025
https://github.com/mylena13s/electric-vehicle-data-analysis
data-analysis data-science programming pyspark-notebook
Last synced: 15 Mar 2025
https://github.com/hfzdzakii/dicoding-solvinghrproblem
This repo is a master submission for my Dicoding Final Project. Employee Attrition & Performance Dataset was being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!
data-analysis data-visualization
Last synced: 16 May 2025
https://github.com/aicorsair/python-case-study-365-data-science-subscription-purchase-prediction
This repository contains a comprehensive case study on predicting 365 Data Science customer subscriptions using real-world student engagement data.
data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization decision-tree feature-engineering feature-selection hyperparameter-optimization hyperparameter-tuning k-nearest-neighbors logistic-regression machine-learning purchase-prediction python random-forest scikit-learn statsmodels svc
Last synced: 08 May 2026
https://github.com/petrosdemetrakopoulos/caraccidentsvrilissia
A data project for car accidents in my neighbourhood
article data-analysis data-science data-visualization machine-learning predictive-modeling
Last synced: 23 Mar 2025
https://github.com/praveingk/lipidanalysis
data-analysis data-visualisation
Last synced: 17 Mar 2025
https://github.com/iliyasalve/tiktok_claim_classification_model
Develop a predictive model for classifying videos with claims to reduce the backlog of user reports and optimize the content moderation process.
data-analysis machine-learning python regression-models tiktok
Last synced: 21 May 2026
https://github.com/firdevstorlak/maritime-signature-lab
Prototyp einer maritimen Signaturdatenbank (Akustik, Magnetik, RCS, IR) mit Python, SQLite und einfacher Computer-Vision.
acoustic-signatures cli-tool computer-vision data-analysis demo-project engineering-prototype infrared-imaging maritime opencv python radar rcs relational-database scientific-computing signal-processing sqlite synthetic-data
Last synced: 07 May 2026
https://github.com/filip-kustura/statistics-olympics-analysis
A group seminar analyzing the relationship between citizens' average height and a country's Olympic success. The project involved data collection, descriptive statistics and statistical testing. Created and presented as part of the mandatory undergraduate Statistics course in spring 2021.
correlation-analysis data-analysis data-visualization descriptive-statistics group-project hypothesis-testing olympic-games r-programming research sports-analytics statistical-testing statistics university-project
Last synced: 05 Jan 2026
https://github.com/jesuserro/ab-testing-ui-redesign-vanguard
A/B testing analysis to evaluate the impact of a user interface redesign at Vanguard.
a-b-testing data-analysis eda exploratory-data-analysis testing ui-design ux-design
Last synced: 08 Jul 2025
https://github.com/hemangsharma/dataanalysis
This repo contains analysis like a dashboard and time series forecast on NASDAQ data
analysis data data-analysis data-visualization python
Last synced: 10 Mar 2026
https://github.com/82luli02/sakila_dvd_rental_database_analysis
Analysis of the Sakila DVD Rental database using SQL
data data-analysis data-science data-visualization sql
Last synced: 10 Mar 2026