Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-27 00:07:21 UTC
- JSON Representation
https://github.com/rita94105/ethereum-fraud-detection
This project focuses on detecting fraudulent transactions in the Ethereum network using both traditional machine learning models and deep learning techniques. By analyzing transaction attributes and interaction patterns, we aim to develop an effective fraud detection model.
data-analysis deep-learning ethereum fraud-detection machine-learning
Last synced: 01 May 2026
https://github.com/jaseel342/pizza_sales_report
This Pizza Sales dashboards provide valuable insights, including sales trends, pizza category breakdown, size distribution, top-selling, and least-selling pizzas, enabling data-driven decisions to boost sales and business performance.
data-analysis dax-query power-query powerbi sql sql-server-management-studio visualization
Last synced: 05 Jan 2026
https://github.com/blakeziegler/binary-classification-competition
Binary Classification of Insurance Crosselling Kaggle Competition
data-analysis data-science database kaggle kaggle-competition machine-learning python rstudio scikit-learn xgboost
Last synced: 17 Nov 2025
https://github.com/jbizzlefoshizzle/crowdfunding-trends-excel
Excel project examining funding trends for Kickstarter projects
category-breakdown data-analysis excel kickstarter kickstarter-campaigns line-graph pivot-charts pivot-tables trends
Last synced: 05 Jan 2026
https://github.com/rupeshtr78/machine_learning
Machine Learning TensorFlow Neural Networks Deep Learning
classification data-analysis deep-learning deep-neural-networks flink jupyter-notebook keras machine-learning machinelearning-python perceptron python3 spark tensorflow
Last synced: 11 Apr 2026
https://github.com/jatin-s16/hr_mysql_powerbi
This repository contains raw HR data along with key business questions. I performed data cleaning using MySQL queries and wrote analytical queries to extract meaningful insights. The results were then visualised using Power BI to enhance business understanding.
data-analysis data-science data-visualization mysql powerbi
Last synced: 29 May 2026
https://github.com/csoren66/customer-personality-analysis
Predict how different customer segments will respond for a particular product or service.
data-analysis data-visualization python
Last synced: 03 Mar 2025
https://github.com/haonamnguyen/costumer-shopping-trends-analysis
This project analyzes a synthetic dataset of customer shopping behavior to see key trends and insights. Using SQL and Tableau, the analysis focuses on customer demographics, purchase patterns, and preferences, including age distribution, payment methods, shipping types, and top product categories.
data-analysis data-visualization sql tableau
Last synced: 05 Jan 2026
https://github.com/siddharthbadal/kpmgdataanalysisproject
Data Analytics Consulting Virtual Internship
data-analysis data-cleaning data-visualization googlestudio msexcel powerpoint
Last synced: 05 Jan 2026
https://github.com/rohithsaji97/toll_gate
This is a electronic toll collection system.
data-analysis digital-image-processing ocr-text-reader opencv python3 trained-models
Last synced: 29 Apr 2026
https://github.com/arianarmw/da01-bike-sharing-analysis
🚴♀️ Data analysis project on bike-sharing systems. Includes data wrangling, exploratory data analysis (EDA), visualization, and interactive dashboards built with Streamlit. Explore patterns in bike usage and rental data!
bike-sharing-analysis data-analysis exploratory-data-analysis python streamlit visualization
Last synced: 11 Apr 2026
https://github.com/jedrzej-wydra/data-analysis-pro
Professional Data Analyst Exam by DataCamp
Last synced: 23 Mar 2025
https://github.com/jedrzej-wydra/data-analysis-associate
Associate Data Analyst Exam by DataCamp
Last synced: 23 Mar 2025
https://github.com/javedali99/machine-learning-hw-solution-notebooks
Machine Learning Homework Solution Notebooks (UCF CAP5610)
data-analysis data-preprocessing data-science decision-trees machine-learning python random-forest recommender-system supervised-learning support-vector-machines titanic-kaggle unsupervised-learning
Last synced: 05 Jan 2026
https://github.com/mohit01chugh/edu_sql_analysis
SQL queries used to analyze student data.
data-analysis database education plpgsql postgresql sql
Last synced: 17 May 2026
https://github.com/yash1882/music-store-data-analysis
A project focuses on analyzing music store data using SQL ♬
begineer-friendly data-analysis music music-store-data music-store-data-analysis sql-project
Last synced: 28 Jan 2026
https://github.com/javedali99/geospatial-and-earth-science-data
A comprehensive collection of global earth science and geospatial datasets 🌍
data-analysis dataset earth-observations earth-science earth-sciences earthscience geography geospatial geospatial-analysis geospatial-analytics geospatial-data open-datasets satellite-data
Last synced: 05 Jan 2026
https://github.com/chanmeng666/douban-review-scraper
【One star = One happy developer doing a little dance 💃⭐️】A robust Python scraper for collecting and analyzing movie reviews from Douban.com, featuring comprehensive data processing and analysis capabilities.
beautifulsoup4 data-analysis data-processing douban movie-reviews pandas python sentiment-analysis text-mining web-scraping
Last synced: 02 May 2026
https://github.com/chanmeng666/advanced-neural-network-applications
Practical implementations of perceptron and linear neuron models for classification and regression, with mathematical analysis and visualizations in Jupyter notebooks.
classification data-analysis data-science educational gradient-descent jupyter-notebook linear-neuron machine-learning matplotlib neural-network neural-networks numpy perceptron python regression
Last synced: 03 May 2026
https://github.com/khushi-sabarad/adinsights_dashboard
AdInsights Dashboard: An interactive web dashboard built with Python (Flask, Pandas, Plotly) to visualize and analyze digital advertising performance. Allows filtering by gender, ad type, and location for detailed insights
ad-performance advertising dashboard data-analysis data-visualization flask pandas plotly python web-application
Last synced: 01 May 2026
https://github.com/takshshah-16/spotify_eda
Spotify data analytics and advanced querying
data-analysis eda pgadmin4 postgresql
Last synced: 30 Oct 2025
https://github.com/apfirebolt/numpy-and-pandas-examples
Some examples and sample datasets to learn numpy, pandas and other data science libraries in Python
data-analysis jupyter-notebook numpy pandas python
Last synced: 17 Apr 2026
https://github.com/vedantshi/coffee-sales-dashboard
This project analyzes coffee sales data using Excel, featuring data cleaning, trend analysis, and an interactive dashboard. Key insights highlight top-performing products, regional sales trends, and seasonal patterns. Recommendations focus on marketing strategies and inventory optimization. Future plans include Power BI integration for visuals.
business-insights data-analysis data-visualization excel-dashboard pivot-tables sales-trends
Last synced: 05 Jan 2026
https://github.com/kingflow-23/ai-related-article-detector
Create a simple system that determines whether an article is related to AI or not using web scraping, text representation, and a classifier.
data-analysis data-engineering data-science logistic-regression pca-analysis scraping selenium umap
Last synced: 04 May 2026
https://github.com/gaaniruddha/mphil
This repository contains a copy of my final MPhil presentation and panel report.
data-analysis gpu-imager radio-astronomy
Last synced: 03 Mar 2026
https://github.com/benami171/ml_knn_decision-trees
A ml implementation comparing Decision Trees and k-Nearest Neighbors (k-NN) algorithms for Iris flower classification. Features comprehensive analysis of different approaches including brute-force and entropy-based decision trees, along with k-NN using multiple distance metrics.
classification cross-validation data-analysis decision-trees iris-dataset k-nearest-neighbours machine-learning nearest-neighbors python
Last synced: 30 Jun 2025
https://github.com/bhaskaracharjee/student-results-analysis
Analyzing student results to uncover insights
Last synced: 16 May 2025
https://github.com/mahmoudwal27/powerbi-projects-for-data-analysis
This project leverages Power BI for data visualization, DAX for custom calculations, and integrates SQL and Excel for data preprocessing, analysis, and reporting, enabling dynamic and interactive insights.
data-analysis data-analysis-project data-analytics-project project
Last synced: 07 Mar 2026
https://github.com/ayushsiloiya619/spotify-song-analysis
Data Analytics with Python
data-analysis matplotlib-pyplot pandas-dataframe python3 seaborn
Last synced: 08 May 2026
https://github.com/first-coding/smart_analysis
Smart Analysis is an AI-powered data analysis tool that leverages large language models (LLMs) to generate SQL queries from natural language prompts. Upload CSV files, explore the data schema, and retrieve insights with ease. The system ensures error correction in SQL queries, delivering detailed reports and visualizations in a streamlined workflow
data-analysis llm openai prompt-engineering python
Last synced: 08 Mar 2025
https://github.com/dpbm/diabetes-analysis
simple diabete analysis with python
analysis data-analysis data-science data-science-projects data-set diabetes-detection diabetes-prediction machine-learning pandas python
Last synced: 11 Apr 2026
https://github.com/andrii-zapukhlyi/otomoto_visualization
Scraping, data visualization, and building a price prediction model with data from the car classifieds website otomoto.pl
data-analysis machine-learning r scraping statistics visualization
Last synced: 26 Jul 2025
https://github.com/busradeveci/kaggle-iris_data_analysis
Exploratory data analysis and visualization of the Iris dataset using Python.
data-analysis iris-dataset kaggle pandas python seaborn visualization
Last synced: 15 Apr 2025
https://github.com/sun-lab-nbb/sl-shared-assets
A Python library that stores assets shared between multiple Sun (NeuroAI) lab data acquisition and processing repositories.
data-analysis data-collection data-processing experiment sunlab
Last synced: 10 Mar 2026
https://github.com/malucor/livros
Programa em Python para fazer uma análise de dados sobre livros, a partir de um arquivo Excel.
analise-de-dados book books bookshelf data-analysis ipynb jupyter-notebook livro livros python
Last synced: 16 May 2026
https://github.com/upes-open/open-cryptocurrency-analysis
A web app to visualise and predict the cryptocurrency’s impact by using Web scraping, data exploration, EDA and Data Visualization.
analysis cryptocurrency data-analysis data-science data-visualization jupyter-notebook streamlite visualization
Last synced: 15 Apr 2025
https://github.com/chitranjan806/greyatom_learning_repo
A Collection of Projects, Tasks and Challenges as part of Data Science Masters - Transition Program at GreyAtom.
data-analysis data-science greyatom python3
Last synced: 29 Aug 2025
https://github.com/andersoncrs/arboles_de_decision_calidad_del_vino
Contiene un análisis detallado de la calidad del vino utilizando un modelo de clasificación basado en árboles de decisión. Incluye la exploración de datos, detección y manejo de valores atípicos, análisis Univariado y Bivariado, y la creación y evaluación de un modelo predictivo. El objetivo principal es predecir la calidad del vino.
data-analysis data-science data-visualization machine-learning matplotlib seaborn sklearn tree-decision
Last synced: 20 May 2026
https://github.com/andersoncrs/analisis_exploratorio_de_datos-eda-_rendimiento_estudiantil
Este análisis exploratorio de datos (EDA) realizado sobre el conjunto de datos de rendimiento estudiantil tiene como objetivo identificar y comprender los factores que influyen en el desempeño académico de los estudiantes. A través de la limpieza, transformación y visualización de datos, se busca descubrir patrones y relaciones significatvas.
data-analysis data-exploration data-exploration-and-preprocessing data-visualization seaborn
Last synced: 30 Mar 2025
https://github.com/yaser-123/energy-consumption-dashboard
A Power BI dashboard to analyze energy consumption for water, gas, and electricity across cities and buildings. Features include interactive charts, drill-down insights, and dynamic filters for easy monitoring and optimization.
dashboard data-analysis data-analytics data-visualization energy-consumption energy-efficiency powerbi
Last synced: 05 Jan 2026
https://github.com/prathmesh2507/global-stock-intelligence-dashboard
Interactive Global Stock Market Analytics Dashboard built using Python, YFinance, Pandas, Streamlit, and Plotly. Analyze 20+ countries and 400+ top stocks with advanced visualizations and financial insights.
dashboard data-analysis data-visualization python stock-analysis streamlit
Last synced: 15 Jun 2026
https://github.com/chokzb/covid19_vaccination_analysis
An EDA project examining global COVID-19 vaccination progress. The notebook investigates vaccination trends by country, daily vaccination rates, timeline patterns, and dose distribution. The project includes visualisations created with Matplotlib, Seaborn, and Plotly.
covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas plotly python seaborn vaccination
Last synced: 07 May 2026
https://github.com/akansharajput280799/strategic-analysis-of-retail-brand-in-south-america-using-sql
Leveraged Big Query and MySQL to analyze 100K records for sales optimization, trend identification, and enhancing customer satisfaction for a retail brand in South America and to provide insights and recommendations to improve their userbase and improve their services
bigquery data-analysis data-science database database-schema google-bigquery mysql-server sql
Last synced: 19 May 2026
https://github.com/27ahmad/heart-disease-diagnostic-eda
This project conducts Exploratory Data Analysis on a dataset related to heart diagnostic disease, aiming to derive valuable insights from the analysis.
data-analysis data-visualization pandas python
Last synced: 06 May 2026
https://github.com/vidyadnina/cyclistic-sql-tableau-project
Trip data analysis for a bike-sharing service company using SQL and Tableau.
bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql
Last synced: 02 Jan 2026
https://github.com/yandexdataschool/ml-sweights-experiments
Experiments for the "Machine Learning on data with sPlot background subtraction" paper
data-analysis high-energy-physics machine-learning statistics
Last synced: 15 May 2025
https://github.com/danhnnguyen0606/bitcoin-navigator
Bitcoin Navigator: A data-driven dashboard designed to analyze Bitcoin trends, empowering investors to refine their strategies and identify optimal investment opportunities.
bitcoin btc crypto cryptocurrency data-analysis data-analytics data-science data-visualization investment looker looker-studio
Last synced: 15 Mar 2025
https://github.com/zulfachafidz/titanic_explorer_predicting_survival_with_classification_using_knn_algorithm
Tracking Life Safety with the KNN Predictive Analysis Approach. Leveraging the Titanic Dataset, we apply classification analysis to predict the fate of passengers based on a variety of features.
algorithm algorithms data data-analysis data-mining data-science datamodeling datapreprocessing dataset knn-algorithm knn-classification machine-learning machine-learning-algorithms prediction-model
Last synced: 01 Sep 2025
https://github.com/lvmalware/lsm-module
A simple statistics module, which provides 4 basic types of regressions using the Least Squares Method (LSM)
data-analysis least-square-regression regression regression-analysis statistics
Last synced: 31 Mar 2025
https://github.com/mizzy/tweetduck
Twitter Archive to DuckDB Importer - Extract and import Twitter archive data (2025 format) into DuckDB for analysis
archive cli data-analysis duckdb golang twitter
Last synced: 02 Sep 2025
https://github.com/hanzopgp/lolanalysis
League Of Legends game data engineering, analysis, visualization and machine learning. Business intelligence project.
data-analysis data-cleaning data-engineering data-visualization dataiku deep-learning etl machine-learning scraping university
Last synced: 27 May 2026
https://github.com/ianfelps/jornada_python
Projetos realizados durante a Jornada Python da Hashtag Treinamentos em maio de 2024.
artificial-intelligence automation data-analysis python
Last synced: 28 Apr 2026
https://github.com/syarwinaaa09/hypothesis-testing-with-mens-and-womens-soccer-matches
a data-driven exploration of international men's and women's football (soccer) match results using Python
data-analysis data-visualization football jupyter-notebook men-vs-women pandas python soccer sports-analytics visualization
Last synced: 05 May 2026
https://github.com/bhavanachitragar/data-analysis-using-pyspark
Working with pyspark module in python and using google colab environment in order to apply some queries to the dataset. The dataset consist of two csv files listening.csv and genre.csv. Also, visualizing query results using matplotlib.
data-analysis google-colab pyspark-sql
Last synced: 30 Jun 2025
https://github.com/arsalan-dev-engineer/ai-repository
A repository that contains AI related projects, notes, practice files and documentations.
ai algorith beginner-friendly data-analysis data-preprocessing developer jupyterlab matplotlib matplotlib-pyplot natural-language-processing numpy pandas python unsupervised-learning visualization
Last synced: 12 Apr 2026
https://github.com/dacrol/filterdataset
Filters a dataset based on attributes
data-analysis dataset deep-learning machine-learning python python3
Last synced: 25 Jul 2025
https://github.com/badranalyst/student-tests-data-analysis-application
Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.
data-analysis data-visualization dataset matplotlib numpy pandas python sklearn
Last synced: 05 May 2026
https://github.com/navp7/roadaccident_powerbi
An interactive Power BI dashboard designed to analyze road accident data
dashboards data-analysis data-visualization powerbi
Last synced: 19 Mar 2026
https://github.com/masum184e/exploratory_data_analysis_projects
This space to showcase my journey in exploring various datasets, uncovering patterns, and extracting meaningful insights. Each project highlights different aspects of EDA, demonstrating techniques and tools that are essential for making sense of data.
data-analysis data-analysis-projects data-science data-science-projects eda eda-projects exploratory-data-analysis exploratory-data-analysis-projects
Last synced: 31 Mar 2025
https://github.com/sahilmb/social-media-data-analysis
A social media platform chat analysis system built using python for root level analysis of huge blocks of text
analytics data-analysis python3 streamlit
Last synced: 31 Mar 2025
https://github.com/kislerdm/github-repo-details
Application and library to fetch open source libraries details from github to perform due diligence
data-analysis golang opensource
Last synced: 01 Jul 2025
https://github.com/docuvesta/youtube-api-fragrance-channel-analytics
Engagement metrics analysis of perfume Youtube channel using Youtube API 🎀
analysis beauty-products comments data-analysis data-analysis-python engagement-metrics insights jupyter-notebook likes-count marketing marketing-analytics perfume python views-count youtube youtube-api youtube-api-v3
Last synced: 03 May 2026
https://github.com/mudassir-a/vendor-performance-analysis
vendor performance data analysis project using sql, python and power bi
data-analysis powerbi python sql
Last synced: 18 May 2026
https://github.com/shreyaamenon/data-analysis-aiml-mini-projects
mini projects to help me grow skills in data analysis, artificial intelligence and machine learning.
ai data-analysis jupyter-notebook machine-learning python
Last synced: 11 Apr 2026
https://github.com/aksoni07/movie-recommendation
A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.
clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions
Last synced: 11 Apr 2026
https://github.com/rcv911/cluster_generation
Generation of cluster test data
cluster cluster-analysis cluster-generation clustering clustering-algorithm clusters data-analysis machine-learning
Last synced: 18 Jan 2026
https://github.com/jfaccioli/citi-bike-tableau
A data analysis of Citi Bike users in Jersey City using Tableau
data-analysis data-visualization tableau tableau-public
Last synced: 26 Jan 2026
https://github.com/amanyadav-07/customer-churn-prediction
Machine Learning project to predict customer churn using Logistic Regression, Random Forest, and XGBoost. Includes data preprocessing, feature engineering, SMOTE balancing, model training, evaluation, and business insights.
accuracy-metrics data-analysis data-visualization logistic-regression machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn sklearn xgboost-classifier
Last synced: 11 Apr 2026
https://github.com/audy21/datacamp
Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.
data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/matheusafonseca/c111
Este repositório é dedicado ao armazenamento e organização dos códigos desenvolvidos na disciplina C111 - Análise de Dados, oferecida pelo Instituto Nacional de Telecomunicações (INATEL).
data-analysis matplotlib numpy pandas python
Last synced: 06 May 2026
https://github.com/totonga/ods-exd-api-box
Helper package to build ASAM ODS EXD API grpc plugins.
asam data-analysis grpc grpc-server ods plugin python
Last synced: 03 Feb 2026
https://github.com/bhavinpatel4199/artificial-intelligence---ai-for-decision-making
Artificial Intelligence for Decision Making is a collection of projects focused on applying AI and machine learning techniques to solve decision-making challenges. It includes projects on wine quality prediction, Cassandra data modeling, and text classification, showcasing a range of data science and machine learning applications.
artificial-intelligence cassandra-cql data-analysis data-engineering data-preprocessing data-structures decision-making deep-learning feature-selection machine-learning-algorithms sentiment-analysis text-classification
Last synced: 20 Jun 2026
https://github.com/jiyanshgarg/delhivery-logistics-data-analysis
This project analyzes Delhivery's logistics delivery dataset to understand delivery performance, route efficiency, and operational patterns using data analytics techniques. The analysis focuses on transforming raw segment-level logistics data into meaningful trip-level insights that can help improve delivery efficiency and route planning.
business-insights-and-recommendations data-analysis data-cleaning-and-preprocessing data-visualization exploratory-data-analysis feature-engineering feature-extraction feature-selection hypothesis-testing outlier-detection outlier-treatment
Last synced: 12 Jun 2026
https://github.com/leosimoes/datascienceacademy-powerbi-clinicadebi
Atividades do curso Análise de Dados com Microsoft Power BI e Clínica de BI da Data Science Academy.
dashboards data-analysis data-visualization microsoft-power-bi power-bi
Last synced: 05 Jan 2026
https://github.com/richardwarepam16/learning-statistics-using-python-for-data-analysis
Data Analyst: Learning Statistics using Python (Descriptive Stats, Inferential Stats, ML)
data-analysis data-analysis-python data-science descriptive-statistics inferential-statistics statistical-learning statistics
Last synced: 15 Mar 2025
https://github.com/farhannirzhor/vrinda_store_excel_project
This project is about excel analysis and visualization. In this project, I analyzed Vrinda Store's sales and made an annual sales report
data-analysis data-cleaning data-preprocessing data-visualization microsoft-excel reporting
Last synced: 05 Jan 2026
https://github.com/rodrigojunqueiradev/rodrigojunqueiradev.github.io
Professional Portfolio - Rodrigo Junqueira
analytics artificial-intelligence data-analysis data-engineering data-science data-visualization machine-learning mathematics nosql powerbi python r sql statistics
Last synced: 15 May 2026
https://github.com/rodrigojunqueiradev/2025-python-data-analysis-and-visualization-masterclass
2024 Python Data Analysis & Visualization Masterclass
data-analysis data-science data-structures data-visualization pandas python python-3 python3 seaborn
Last synced: 10 May 2026
https://github.com/amlanmohanty1/genai-data-analysis-report-generator
Generating data analysis and EDA reports from CSV files using Generative AI - Langchain, Llama, Groq.
ai data-analysis data-science flask generative-ai groq langchain llama3 llm prompt-engineering python
Last synced: 28 Jan 2026
https://github.com/27ahmad/ibm-data-science-capstone
The Capstone is the final course in the IBM Data Science Professional Certificate program. It's a project that combines all the skills and knowledge you've gained throughout the specialization.
data-analysis data-science folium-maps machine-learning plotly-dash python sql
Last synced: 26 May 2026
https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn
This is the last project in the nanodegree udacity program. it's about data visualization.
data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree
Last synced: 09 May 2026
https://github.com/vyjayanthipolapragada/marketing_statistical_analysis
Statistical analysis of customer data and their impact on the sales of products based on marketing campaigns
customer-data data-analysis dataframes marketing matplotlib numpy pandas python seaborn statistical-analysis
Last synced: 11 Apr 2026
https://github.com/deva-246/datacleaning-excel-powerqueryeditor
data-analysis data-science excel powerquery
Last synced: 04 Jan 2026
https://github.com/manumoolimani/data-analysis
Data Analysis Projects
data-analysis data-visualization excel
Last synced: 21 Feb 2026
https://github.com/mehassanhmood/storyboard
data-analysis framer-motion javascript react scrollytelling storyboard tailwind-css vite
Last synced: 21 Feb 2026
https://github.com/tolumie/exploratory-data-analytics-projects
Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.
data-analysis data-visualization data-wrangling exploratory-data-analysis-eda feature-engineering hypothesis-testing machine-learning matplotlib numpy pandas predictive-modeling python seaborn statistical-analysis
Last synced: 11 Apr 2026
https://github.com/vanshuchaudhary/retail-sale
project uses MySQL to analyze retail sales data, focusing on customer behavior, sales trends, and product performance. The dataset includes transactions, customer demographics, and purchase details, helping businesses optimize strategies. Key Insights: 📊 Revenue Analysis – Total sales, top-spending customers 📅 Sales Trends
business-intelligence customer-behavior customer-behavior-analysis data-analysis mysql predictive-analytics retail-analytics sales-analysis sql-queries
Last synced: 23 Mar 2025
https://github.com/jofaval/game-of-thrones
Data Analysis and Predictions of the Game of Thrones' character's survivance from 2016
classification data-analysis data-science data-visualization deep-learning game-of-thrones google-colab kaggle keras machine-learning matplotlib python scikit-learn seaborn tensorflow xgboost
Last synced: 11 Apr 2026
https://github.com/vishal-verma-96/pre-owned-car-price-prediction-using-streamlit-app
Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app
data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020
Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).
bigquery data data-analysis data-visualization python sql tableau
Last synced: 15 Jun 2026
https://github.com/delabrov/jwstoolkit
A python package for handling JWST observations
astronomy-astrophysics data-analysis data-cube data-visualization imagery jwst python3 spectroscopic-data
Last synced: 26 May 2026
https://github.com/thenorthkun/movies-dataset-analysis
Analysis & categorizing of Movies based on Actors, Genres, Gross covered etc 🦸🏼🧜🏼♀️🎧
data-analysis data-visualization filtering
Last synced: 23 Mar 2025
https://github.com/stas1f1/methods-and-models-for-multivariate-data-analysis
Completed tasks for the course on methods of mutivatiate data analysis, 1st year of masters, FDT ITMO
data-analysis multivariate-analysis python
Last synced: 10 Mar 2026
https://github.com/27ahmad/netflix_sql_project
The Netflix SQL Project analyzes the Netflix dataset using SQL queries to gain insights into its content, identify trends, and address business problems related to movies and TV shows.
data-analysis postgresql-database sql
Last synced: 03 Feb 2026
https://github.com/iwasakiyuuki/data-analysis-platform-etl
A collection of Airflow DAGs for automating data collection into our on-premises data analysis platform.
airflow airflow-dags data-analysis data-collection
Last synced: 01 Jul 2025
https://github.com/rock12231/weather-analysis-backend
Weather analysis, visualization & Data science
data-analysis data-science data-visualisation django-rest-framework jyputer-notebook prediction python
Last synced: 15 Mar 2025
https://github.com/ilovenooodles/probstat-water-potability
Tugas Besar Probabilitas dan Statistika 1
csv data-analysis jupyter-notebooks python
Last synced: 03 May 2026
https://github.com/anurag-kumar-molankala/sales-performance-dashboard
A Power BI dashboard that analyzes sales trends, product performance, customer segmentation, and payment distribution. It uses DAX, time intelligence, and interactive visuals for data-driven insights. The model includes Sales, Product, and Customer tables for in-depth analysis.
dashboards data-analysis data-visualization dax dax-functions dax-measures dax-query etl-process powerbi powerbi-visuals powerquery sql-query sql-server
Last synced: 03 Apr 2025