Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/odeyiany2/flit-apprenticeship-data-science-projects
This repo contains all my projects for my FLiT Apprenticeship
data-analysis data-science data-visualization machine-learning sql
Last synced: 17 May 2026
https://github.com/mgobeaalcoba/matplotlib_y_seaborn
Aquí dejaré trabajos de visualización realizados con ambas librerías de Python.
data-analysis data-science data-visualization dataset matplotlib numpy pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/cintia0528/data_cleaning_and_analytics-python
Evaluate if aggressive discounting benefits Eniac long-term, considering differing views on customer acquisition and brand positioning. Focus on data cleaning for informed decision-making.
colab-notebook data data-analysis datacleaning dataquality jupyter-notebook matplotlib pandas python seaborn
Last synced: 08 Jan 2026
https://github.com/ryanfranklin237/data-visualization-python
A tool that allows you to visualize data from a csv or excel file in a graph or charts form
data-analysis data-science data-visualization matplotlib pandas-dataframe python
Last synced: 11 Jun 2026
https://github.com/parmeetbhamrah/air-quality-india-analysis
Exploratory data analysis of real-time air quality data from Indian cities using Python, Pandas, Matplotlib, and Seaborn.
air-quality data-analysis eda exploratory-data-analysis government-data india matplotlib numpy pandas python seaborn
Last synced: 05 May 2026
https://github.com/happybono/sonatasmooth
Provides three different noise reduction algorithms for smoothing out data : Rectangular Averaging, Binomial Median Filtering, and Binomial Averaging. It processes data from a list and displays the results in another list.
algorithms average binomial binomial-coefficient binomial-theorem calibration csharp data-analysis data-calibration dynamic-noise-reduction median noise-algorithms noise-reduction noise-reduction-kernel outliers rectangular-averaging windows-desktop windows-desktop-application windows-forms winforms
Last synced: 30 Oct 2025
https://github.com/ajimaulana123/e-commerce-data-analis
Analisis dataset e-commerce guna menjawab kebutuhan product mana yang paling laris dibeli customer
Last synced: 28 Apr 2026
https://github.com/rafiulgits/data-analysis
Data Analysis with python programming language
classification data-analysis data-mining data-visualization machine-learning mglearn regression regression-models sklearn
Last synced: 08 May 2026
https://github.com/prime-infinity/type-one
Software to visualize and analyze GitHub repos based on certain statistics such as stars, forks and issues
data-analysis data-visualization
Last synced: 03 Feb 2026
https://github.com/derrickbaruga7/mapping-median-age-europe
An R project that creates an interactive map of the median age across European regions using Eurostat data and spatial visualization packages.
data-analysis data-science data-visualization datascience european-union mapping r
Last synced: 25 Mar 2025
https://github.com/prernarohra/mental-health-prediction
This project focuses on predicting mental health outcomes using machine learning algorithms. By analyzing various psychological, social, and lifestyle factors, the model aims to identify individuals at risk, enabling early intervention and support.
data-analysis data-science data-visualization machine-learning mental-health python
Last synced: 20 May 2026
https://github.com/prernarohra/quakeguard
QuakeGuard is an innovative project for reducing earthquake intensity and structural damage. It takes a proactive approach to seismic activity, by using complex algorithms and real-time data to improve safety and resilience for people in earthquake-prone areas.
artificial-intelligence backend data-analysis data-science earthquake-intensity final-year-project front-end geology machine-learning open-source python visualization
Last synced: 21 May 2026
https://github.com/gustavo-zamai/product_return_data_analysis
Analysis returns products of differents stores
data-analysis excel pandas plotly-express python3 pywin32
Last synced: 13 May 2026
https://github.com/gustavo-zamai/shop_data_analisys
Analysis diferents shopping mall sells
data-analysis openpyxl pandas python3 pywin32
Last synced: 01 Mar 2025
https://github.com/ivanildobarauna-dev/api-to-dataframe
Python library that simplifies obtaining data from API endpoints by converting them directly into Pandas DataFrames. This library offers robust features, including retry strategies for failed requests.
data-analysis data-analytics data-engineering library pypi-packages python
Last synced: 06 Mar 2025
https://github.com/soumya-thoutam/revenue-and-demand-forecasting-analysis
End-to-end data analysis project with SQL and Power BI for revenue and demand forecasting in bike-sharing.
data-analysis data-visualization powerbi sql sql-server
Last synced: 16 Mar 2025
https://github.com/ronaldkanyepi/python-streamlit-covid-19-dashboard
This is a responsive streamlit covid 19 Dashboard
analytics data data-analysis data-visualization datascience python streamlit
Last synced: 18 May 2026
https://github.com/anshmnsoni/pizza-market-analysis
data-analysis powerbi-visuals powerbidashboard
Last synced: 05 Feb 2026
https://github.com/jatin-mehra119/bike-rentals-dataset
This repository focuses on optimizing bike rental availability during peak hours and days using machine learning techniques. Leveraging publicly available data from the UCI Machine Learning Repository, it includes scripts for data preprocessing, model training, and visualization, along with detailed observations and results.
data-analysis data-science ensemble-model pandas scikitlearn-machine-learning
Last synced: 15 Apr 2026
https://github.com/bgr8/bokeh-ile-veri-gorsellestirme
Data visualization with Bokeh Library
bokeh color data-analysis data-visualization hbar html python vbar
Last synced: 30 Oct 2025
https://github.com/garcane/global-shipping-analytics-dashboard
This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.
data data-analysis data-analyst data-visualization metrics tableau
Last synced: 01 Mar 2026
https://github.com/walkerdustin/vergleich-von-messmethoden-fuer-punktwolken
Bei der Vermessung eines physischen Raumes ist das Ergebnis eine Punktwolke. Diese Punktwolke beschreibt dann ausgewählte Punkte im Raum, zum Beispiel auf den Wänden und der Decke. Wenn diese Punkte in zwei seperaten Messungen gemessen werden, vielleicht sogar von unterschiedlichen Geräten, soll hinterher herausgefunden werden wie genau diese Punktwolken übereinstimmen. Dafür gibt es zwei grundsätzlich verschiedene Methoden. Diese sollen hier verglichen werden.
3d-models accuracy-metrics data-analysis data-visualization kaggle measure-distance numpy point-cloud pointcloudprocessing punkte python science-research simulation statistics
Last synced: 11 Apr 2026
https://github.com/mardavsj/weather-prediction
Weather prediction model which mainly focuses on visualization.
data-analysis data-visualization matplotlib numpy pandas pandas-dataframe
Last synced: 10 Apr 2026
https://github.com/nouman6093/advanced-statistical-models
in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl
data data-analysis data-science data-visualization
Last synced: 11 Jun 2026
https://github.com/vineet416/airbnb_data_analysis
Ineuron.ai Internship Project
data-analysis exploratory-data-analysis matplotlib numpy pandas-python python seaborn statistical-analysis statistics
Last synced: 10 Apr 2026
https://github.com/tomasoak/datahopper
Python package for data engineering and data wrangling
data data-analysis data-engineering data-mining data-science data-structures data-wrangling datascience pandas python
Last synced: 12 Mar 2026
https://github.com/devexpress-examples/aspnet-pivot-grid-custom-aggregates
This example shows how to aggregate data by the field's first value.
asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms
Last synced: 06 Jul 2025
https://github.com/greed2411/ndl
Numbers Don't Lie, attempt on Data Analysis using pandas and matplotlib.
cities data-analysis data-science data-visualization india kaggle
Last synced: 19 Apr 2026
https://github.com/manvendra747/customer-segmentation
Customer segmentation using Python and PowerBI
customer-segmentation dashboard data-analysis data-science data-visualization powerbi python rfm-analysis
Last synced: 28 Apr 2025
https://github.com/nafisalawalidris/building-a-clustering-model-for-customer-segmentation
Customer Segmentation Using Clustering: This repo applies clustering algorithms to a customer transaction dataset, grouping similar customers together based on their purchasing behavior. Targeted marketing strategies can be developed by analyzing distinct customer segments.
clustering customer-segmentation data-analysis data-visualization k-means machine-learning marketing-analytics unsupervised-learning
Last synced: 16 Mar 2025
https://github.com/nafisalawalidris/tools-for-data-science
It covers popular languages (Python, R, SQL) and libraries (NumPy, Pandas) used in the field. The author shares their objectives of teaching data analysis, web development, and critical thinking skills. The repository also includes code examples, explanations of arithmetic expressions, and contact information for the author.
arithmetic-expressions data-analysis data-science data-visualization languages libraries matplotlib numpy pandas programming python r sql tools web-development
Last synced: 11 Apr 2026
https://github.com/gf712/abpytools-qt
Qt interface of AbPyTools
antibody-numbering antibody-sequences cpp11 data-analysis python3 qt5
Last synced: 10 Apr 2026
https://github.com/nafisalawalidris/buybuy-e-commerce-company
The BuyBuy E-commerce Company repository is a comprehensive hub for the company's e-commerce platform. It includes source code, documentation, and data analysis insights, providing a data-driven approach to improve customer experience, drive revenue, and inform decision-making.
buybuy cleaning-data company customer-experience data data-analysis decision-making documentation e-commerce excel insights postgresql repository revenue source-code sql
Last synced: 16 Mar 2025
https://github.com/kishlayjeet/zomato-data-exploration
In this project, we will be exploring a dataset containing information on various restaurants and their ratings, location, and other attributes.
data-analysis eda matplotlib numpy pandas zomato-data-exploration
Last synced: 10 Apr 2026
https://github.com/matthewgrosman/messenger-analytics
Project that ingests Facebook Messenger conversations and generates analytics.
analytics data-analysis excel facebook facebook-messenger java mongodb
Last synced: 15 Apr 2025
https://github.com/ssreeramj/youtube_channels_analysis
This web app gives a detailed analysis of the videos uploaded in a particular youtube channel.
data-analysis heroku pandas python streamlit youtube
Last synced: 29 Apr 2026
https://github.com/5ekastanx/data-analysis
Extracting data from parsing, for example, like hacking using Python using all sorts of function methods
Last synced: 14 Mar 2025
https://github.com/sathyasris27/statistical-analysis-on-rehoming-time-for-different-dog-breeds-in-animal-shelter
The aim of this project is given a collection of records documenting the stray, unwanted, or neglected dogs sent to animal shelters to be rehomed, we analyse their rehoming patterns based on their breeds.
data-analysis r statistical-analysis statistical-inference statistical-models
Last synced: 05 Jun 2026
https://github.com/chigwell/rankextractplus
rankextractplus extracts and structures ranked info from text, organizing data for easier comparison and analysis.
consistent-output data-analysis data-comparison data-organization data-summarization formatted-output llm-processing llmatch-messages pattern-matching ranked-information retries structured-data system-prompt text-extraction unstructured-text
Last synced: 14 Jan 2026
https://github.com/chigwell/partnershipparser
partnershipparser extracts and structures key info from tech partnership news for easy analysis of companies, focus areas, and impacts
business-analysts collaborating-companies competitive-advantages consistency data-analysis data-integration focus-area goals investors market-opportunities potential-impact researchers standardized-output strategic-alliances strategic-partnerships structured-data tech-industry technology-sector text-extraction trends
Last synced: 14 Jan 2026
https://github.com/heiderjeffer/enhancing-digital-maturity-and-analytical-capabilities-of-smes
Research Proposals RP
analytics data-analysis data-driven digital framework jupyter modeling-and-simulation pyrhon quantative smes statistical-analysis stochastic-processes
Last synced: 02 Apr 2025
https://github.com/paezha/isdas
Companion package for An Introduction to Spatial Data Analysis and Statistics with R
data-analysis gis rstats spatial-analysis spatial-statistics
Last synced: 04 Jan 2026
https://github.com/jubinjacob03/heartdiseaseclassify-ml
Heart Disease Dataset Analysis & Classification using ML models such as linear, support vector machine, k-means, k-nearest neighbors and logistic regression.
data-analysis data-science data-visualization ipython-notebook kaggle-dataset kmeans knn linear-regression logistic-regression machine-learning matplotlib python seaborn support-vector-machine
Last synced: 18 Jan 2026
https://github.com/md-emon-hasan/1-simple-stock-price-ml-app
A simple mahcine learning application for stock prices, demonstrating data preprocessing, model training, and deployment using scikit-learn.
data-analysis data-science eda ml-app streamlit-webapp time-series time-series-analysis webapp
Last synced: 31 May 2026
https://github.com/husna-poyraz/titanic-machine-learning
Use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.
data data-analysis data-science data-visualization deep-learning machine-learning missing-data outlier-detection python titanic
Last synced: 10 May 2026
https://github.com/robinmillford/analytics_for_fashion_supply_management
This Streamlit dashboard provides a comprehensive analysis of supply chain data, focusing on key metrics such as production volumes, stock levels, order quantities, revenue, manufacturing costs, lead times, shipping costs, transportation routes, risk factors, and sustainability factors
dashboard data-analysis data-visualization streamlit supply-chain-management
Last synced: 07 Sep 2025
https://github.com/satvikvirmani/engineering-graduate-salary-analysis
A Data Science/Machine Learning project to analyse and study salary patterns of a engineering graduate in India.
data-analysis data-science jupyter-notebook machine-learning prediction python regression
Last synced: 19 Jun 2026
https://github.com/albertomorini/policesviolence
Repository for the project of the course Data Science (Fondamenti di Scienza dei Dati) at UniUD.
data-analysis data-science data-visualization r
Last synced: 31 May 2026
https://github.com/programmer-rd-ai/moviedatascraper
Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!
beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web
Last synced: 01 Mar 2025
https://github.com/programmer-rd-ai/dimensionality-reduction
DimRed is a comprehensive Python toolkit for advanced dimensionality reduction, integrating with major machine learning libraries and featuring real-time performance monitoring to enhance data analysis and model efficiency.
analytics data-analysis data-science lightgbm machine-learning matplotlib numpy pandas programming python python3 sklearn university xgboost
Last synced: 01 Mar 2025
https://github.com/as16082023/restaurant-order-analysis
Analyzing order data to identify the most and least popular menu items and types of cuisine
data-analysis maven-analytics mysql restaurant-order sql
Last synced: 10 Apr 2025
https://github.com/as16082023/music-store-analysis
This project involves analyzing music store data using SQL queries in MySQL workbench to enhance decision-making, identify trends, and understand customer behavior
data-analysis music-store-analysis mysql sql
Last synced: 06 Jul 2025
https://github.com/as16082023/hotel-booking-analysis-eda-
Exploratory Data Analysis on hotel booking data using Python
data-analysis data-visualization exploratory-data-analysis jupyter-notebook python
Last synced: 29 Apr 2026
https://github.com/sarathchandranpm/restaurant_order_analysis
This project entails an in-depth analysis of a restaurant's order and menu data. The focus is on exploring customer ordering behaviors, menu item attributes, and order specifics. By investigating the connections between order details, menu items, and order dates, the project seeks to generate valuable insights into the restaurant's operations.
Last synced: 10 Apr 2025
https://github.com/sauravbhattacharya001/biobots
A simple tool for checking 3D printer run statistics.
3d-bioprinting aspnet-web-api biobot bioinformatics bioprinting csharp data-analysis dotnet rest-api tissue-engineering
Last synced: 19 May 2026
https://github.com/nafisalawalidris/hotel-reservation-analysis
This project analyses hotel reservation data from Resort Hotels and City Hotels to uncover booking trends and insights. Utilising Microsoft Excel for initial data cleaning, PostgreSQL for data analysis and Tableau for creating visualisations, the project aims to deliver a comprehensive dashboard that highlights key metrics such as booking status.
data-analysis data-cleaning data-visualisation hotel-reservations microsoft-excel postgresql sql tableau tableau-dashboards tableau-desktop tableau-public
Last synced: 06 Jul 2025
https://github.com/joannescode/data-series-with-kaggle
Repositório de notebooks práticos sobre tratamento e análise de datasets
data-analysis matplotlib pandas python
Last synced: 13 Mar 2025
https://github.com/dataideaorg/dataidea-science-backup
The Complete Programming For Data Science Course
data-analysis data-science data-visualization deep-learning machine-learning nbdev programming python
Last synced: 10 Jan 2026
https://github.com/sijuswamy/data-analytics-using-r
Course Repository for Data Analysis using R- Add-on course
Last synced: 12 Apr 2025
https://github.com/jerela/mola
A Python library for matrix algebra
data-analysis linear-algebra-library matrix-algebra python
Last synced: 14 Jan 2026
https://github.com/gesiscss/wikipedia-language-olga-master
Measuring Gender Inequalities of German Professions on Wikipedia
bias crowdflower data-analysis data-science gender images python statistics wikipedia
Last synced: 10 Apr 2026
https://github.com/jcaperella29/stock_evaluation_python
A Python script to classify companies based on financial metrics like Piotroski F-Score and Stock Valuation, using CSV financial data for analysis and output.
ai-in-finance artificial-intelligence classification csv-processing data-analysis expert-system finance financial-analysis financial-analysis-tools piotroski-f-score python quantitative-analysis rule-based-classifier stock-analysis stock-valuation
Last synced: 07 Sep 2025
https://github.com/hifza-khalid/book-management-system-sql
A Book Management System SQL project 📚 featuring tables for Authors ✍️, Books 📖, Customers 👤, and Orders 🛒. Includes sample queries for tracking book sales 💰, pricing by genre 🎭, and customer order history 📅.
book-management data-analysis database-management sql sql-queries
Last synced: 03 Feb 2026
https://github.com/rapter1990/data-visualization-examples
Data Visualization Examples
data data-analysis data-visualization folium matplotlib plot plotly python seaborn visualization
Last synced: 13 Apr 2026
https://github.com/jeniljani-4444/end-to-end-world-cup-analysis-web-app
Our streamlined Streamlit web app fetches and processes ESPN CricInfo data delivering dynamic graphs for a quick and engaging cricket experience. Deployed on AWS EC2 with CI/CD pipelines.
aws-ec2 data-analysis plotly preprocessing streamlit-webapp
Last synced: 02 Apr 2025
https://github.com/rara-ch/data-analysis-portfolio
This repository to store my data analytics projects, showcasing my skills in SQL and Python.
data-analysis mathematics matplotlib numpy pandas portfolio probability python seaborn sql statistics
Last synced: 12 Mar 2025
https://github.com/tks18/pyquery
PyQuery is a local-first data operating system built on lazy execution that processes 100GB+ files while you doomscroll. No cap. 🧢
data-analysis data-science etl hdfs parquet pipeline polars python
Last synced: 14 Jan 2026
https://github.com/celineboutinon/bottleneck
ENSAE-ENSAI Formation Continue (Cepe)/OpenClassrooms Data Analyst 2022-2023 - Projet 5
data-analysis data-analytics data-visualisation dataframes market-intelligence marketing-analytics matplotlib-pyplot missingno numpy pandas python seaborn
Last synced: 07 Sep 2025
https://github.com/antonijn/polyfit
Fits a polygon to a given data input
c data-analysis linear-algebra toy
Last synced: 16 Jul 2025
https://github.com/allanotieno254/codsoft
This repository showcases a series of data science projects completed during an internship with CODESOFT. Each project utilizes Python and various machine learning techniques to solve specific problems in data analysis, classification, regression, and predictive modeling.
classification data-analysis data-science feature-engineering machine-learning model-evaluation predictive-modeling python-programming regression
Last synced: 15 May 2025
https://github.com/erickkhosasi/thelook-data_analysis
Final project for my SQL mini bootcamp. This project explores an e-commerce dataset to uncover key business insights. Data insights were queried in Google BigQuery and visualized with Google Sheets.
bigquery data-analysis e-commerce sql
Last synced: 05 Oct 2025
https://github.com/komailmk/instagram-reach-forecasting
This repository provides a Python-based solution for forecasting Instagram reach using historical data and SARIMA modeling techniques.
data-analysis data-visualizations machine-learning
Last synced: 05 Oct 2025
https://github.com/shridhar1504/power-bi-visualization-project
This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.
dashboard data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-report powerbi-visuals powerpoint-slides
Last synced: 21 Jan 2026
https://github.com/aryansharma5/data-visualization-and-thorough-analysis
comprehensive guide for data analysis and visualization
data-analysis data-visualization
Last synced: 18 Mar 2025
https://github.com/victoriapm/analyze_a-b_test_results
Understand the results of an A/B test run by an e-commerce website.
ab-testing data-analysis ecommerce-website
Last synced: 06 Oct 2025
https://github.com/abhinavsharma07/fraud_analytics-credit_card_fraud_detection
The aim of this project is to predict fraudulent credit card transactions with the help of different machine learning models.
banking data-analysis decision-trees hyperparameter-optimization machine-learning-algorithms pipelines random-forest-classifier svm-classifier xgboost-classifier
Last synced: 06 Oct 2025
https://github.com/an1mch1k-theone/project_2_hh_analyze
Анализ вакансий из HeadHunter
data-analysis data-analysis-project postgresql python sql
Last synced: 14 Apr 2026
https://github.com/gappeah/global-shipping-analytics-dashboard
This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.
data data-analysis data-analyst data-visualization metrics tableau
Last synced: 25 Feb 2025
https://github.com/siddharthbadal/sql-case-studies-data-analysis
Data Analysis case studies on various databases using SQL . Demonstrating proficiency in solving diverse business problems. Projects cover sales, orders, products, finance, healthcare and other sectors, and highlight my ability to analyze complex datasets through SQL queries, data manipulation, and visualization techniques.
data-analysis sql sql-query sql-server sqlserver
Last synced: 08 Jan 2026
https://github.com/raccoon-hero/gender-equality-tracker
A web application visualizing gender equality metrics with a focus on Ukraine. Built with Flask, it's powered by live data from global open sources, with dynamic research insights and analysis.
chartjs css dashboard data-analysis data-visualization flask frontend gender-equality global-metrics html linked-data openalex opendata python representation semantic-web ukraine webapp wikidata world-bank-api
Last synced: 07 May 2026
https://github.com/aritrakar/statpy
A simple package containing some functions for analysing Gaussian and Binomial distributions. Created for the Udacity AWS MLE Foundations 2021 course.
data-analysis python statistics
Last synced: 24 Oct 2025
https://github.com/ibnaleem/cyberchef-discord
A versatile Discord bot that implements CyberChef's features for encoding, decoding, encrypting, compressing, analysing data directly and more in your Discord server
compression cti cyberchef cybersecurity data-analysis data-manipulation discord-bot discord-js encoding encryption hashing infosec parsing redteam
Last synced: 28 Jan 2026
https://github.com/ahammadshawki8/playing-with-pandas
🐼 Pandas is one of my favourite library in python. It is well-known for "Analyzing" data. Learn basics and beyond the basics of Pandas from this repository. 🤍🖤
beginner-friendly data-analysis favourite-library pandas python
Last synced: 17 Apr 2026
https://github.com/carolinedotxyz/dp_sgd_classification
A hands-on educational walkthrough of training a CelebA (Eyeglasses) image classifier with Differentially Private SGD using PyTorch and Opacus. The focus of this repo is on clarity and reproducibility through balanced subsets, deterministic preprocessing, and side-by-side baseline vs. DP training, while acknowledging real trade-offs.
celeba-dataset classification data-analysis dp-sgd machine-learning opacus python pytorch
Last synced: 16 May 2026
https://github.com/virajbhutada/global-universities-success-analysis-powerbi-sql-excel
This capstone project conducts in-depth analysis using Power BI, SQL, and Excel to explore complex dynamics shaping global university success. Integrating data from diverse ranking systems and criteria, our aim is to unravel the factors influencing universities worldwide.
capstone capstoneproject data-analysis data-analytics data-insights data-science data-science-projects data-visualization excel exploratory-data-analysis mece mysql powerbi powerpoint sql
Last synced: 20 Jun 2025
https://github.com/sunnybibyan/exploratory-data-analysis-eda
Welcome to the Titanic Dataset - Exploratory Data Analysis (EDA) project repository! This project aims to uncover insights from the Titanic dataset using Python and Jupyter Notebook. By analyzing key variables such as age, gender, and class, we aim to visualize relationships between passenger characteristics and survival rates.
data-analysis data-visualization jupyter-notebook python titanic-dataset
Last synced: 18 Jan 2026
https://github.com/targetta/ankaflow
YAML-based data pipeline framework that runs both locally and fully in-browser designed for data engineers, ML teams, and SaaS developers who need flexible, SQL-powered pipelines.
bigquery clickhouse data-analysis dataops deltalake duckdb elt-pipeline etl etl-automation motherduck parquet python sql
Last synced: 09 Oct 2025
https://github.com/jabhij/eda_experiments
In this repo I'll use different types of datasets to explore and implement various Exploratory Data Analysis (EDA) approaches.
ames-housing analysis battery-life blackfriday-analysis data-analysis data-science data-visualization eda matplotlib-pyplot numpy pandas python seaborn visualization zomato-data-analysis
Last synced: 14 Apr 2026
https://github.com/muneeb1030/webscrapper_mastodon
The Mastodon Social Platform Scraper is a Python-based web scraping tool designed to explore and extract valuable data from the Mastodon social platform.
data-analysis data-collection mastodon python3 scrapy scrapy-spider selenium-python webscraping
Last synced: 09 Oct 2025
https://github.com/madhuresh2011/career-aspiration-of-gen-z-project-using-excel
Career Aspiration Of Gen-Z ,To explore the industries, roles, and pathways using Excel .
dashboards data-analysis data-visualization datacleaning dataset designing excel functional-dashboard gen-z kpi pivot-charts pivot-tables project-using-excel
Last synced: 27 Jan 2026
https://github.com/gustavo-zamai/shop-sell-data-analisys
Analysis of selling
data-analysis pandas plotly python3
Last synced: 16 May 2026
https://github.com/ipanalytics/vpn-provider-overlap-intelligence
Aggregate VPN provider infrastructure overlap analysis: exact-IP overlap, shared /24 prefixes, hosting dependency, and provider relationship clusters. No raw VPN IP lists.
anti-fraud asn cybersecurity data-analysis fraud-detection infrastructure ip ip-intelligence ip-reputation network-analysis network-intelligence osint proxy-detection threat-intelligence vpn vpn-detection
Last synced: 25 May 2026
https://github.com/datasciencelovers/ai-financial-market-data-analysis
Analyse Financial Market Data of AI companies with Python
ai artificial-intelligence big-data-analytics chatgpt data-analysis data-analytics data-science data-visualization financial-analysis gemini google llama machine-learning market-data-analysis matplotlib-python meta openai pandas-python python
Last synced: 05 May 2026
https://github.com/lostvikx/fintech-pg
Files related to my FinTech course
analytics data-analysis data-science data-visualization finance fundamental-analysis numpy pandas python technical-analysis
Last synced: 14 Apr 2026
https://github.com/shuklayash02/data_analysis_using_r
Covid19 analysis and cleaning of data where the death age and deaths of specific gender is cleaned and analysed
analysis cleaning-data data-analysis data-visualization rprogramming
Last synced: 09 Oct 2025
https://github.com/rkirlew/workoutrecommendationsdataset
This repository contains a synthetic dataset designed for building personalized workout recommendation models. The data is generated for educational and experimental purposes, allowing users to practice machine learning techniques such as classification, k-NN, and clustering, as well as explore fitness-related data analysis.
classification data-analysis dataset k-nearest-neighbours machine-learning
Last synced: 08 May 2026
https://github.com/elkronos/stat_py
Statistics functions for python
assumption-check data-analysis data-visualization python regression statistical-analysis statistical-inference statistical-models statistical-tests statistics
Last synced: 10 Oct 2025