Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/devexpress-examples/winforms-create-a-custom-exporter-for-pivotgridcontrol-with-xtrareport
This example illustrates how to dynamically create a custom report based on PivotGridControl content in WinForms.
data-analysis dotnet pivot-grid pivot-grid-for-winforms winforms
Last synced: 26 Apr 2026
https://github.com/devexpress-examples/wpf-pivotgrid-customize-the-cell-template
This example demonstrates how to customize the cell appearance in Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 26 Apr 2026
https://github.com/deliprofesor/cinematic-data-analytics-and-recommendation-platform
This project analyzes a movie dataset using machine learning algorithms to predict success, explore revenue-popularity relationships, and develop recommendation systems. It employs techniques like K-Means, DBSCAN, GMM, decision trees, PCA, and NLP for insights and personalized suggestions.
clustering content-based-recommendation data-analysis data-visualization decision-tree gmm k-means machine-learning natural-language-processing nlp pca predictive-modeling python recommendation-system scikit-learn user-based-recommendation
Last synced: 26 Apr 2026
https://github.com/prady2309/unemployement-analysis
Data Science Project
colab-notebook data-analysis data-science data-visualization eda jupyter-notebook machine-learning python3
Last synced: 26 Apr 2026
https://github.com/akashvarma26/data-analysis-on-imbd-using-sqlite3
Data Analysis on IMDb dataset using sqlite3 and Pandas in Jupyter notebook.
data-analysis jupyter-notebook pandas-dataframe sqlite
Last synced: 27 Apr 2026
https://github.com/arush-codes/paris-olympic-de
data engineering project on paris olympics 2024
azure data-analysis data-engineering microsoft-azure olympics2024 pipeline
Last synced: 27 Apr 2026
https://github.com/odinleepro/airbnbnewyorkcityanalysis
AirbnbNewYorkCityAnalysis is a comprehensive data analysis and visualization project exploring short-term Airbnb rental trends across New York City (2008–2022). Using open source Airbnb data, the project combines data cleaning, statistical summaries, and Tableau dashboards to uncover pricing patterns, borough level distribution, and insights.
airbnb analytics-project data-analysis data-cleaning data-science data-visualization new-york-city real-estate-analytics tableau urban-analysis
Last synced: 27 Apr 2026
https://github.com/as16082023/project-portfolio
A guide to all my projects
dashboard data-analysis data-cleaning data-visualization excel mysql power-bi python sql tableau
Last synced: 27 Apr 2026
https://github.com/busesimsek/sql-projects
A collection of my SQL projects with insights into real-world datasets.
data-analysis data-analytics mysql sql
Last synced: 07 Jun 2026
https://github.com/manasashetty01/regulatory-affairs-of-road-accidents
Regulatory Affairs of Road Accidents in Million-Plus Cities (India, 2020)
data-analysis data-science data-visualization exploratory-data-visualizations jupyter-notebook numpy pandas python
Last synced: 27 Apr 2026
https://github.com/elakkiya-u/digital-marketing-campaign-conversion-prediction
A Predictive Modelling whether a customer will convert based on digital marketing campaign data.
campaign-analytics churn-prediction data-analysis deployment digital-marketing-analytics machine-learning power-bi predictive-modelling presentation-slides python
Last synced: 27 Apr 2026
https://github.com/hfzdzakii/dicoding-airqualityanalysisdata
This repo is a master submission for my Dicoding Final Project. Air Quality Dataset is being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!
data-analysis data-visualization streamlit
Last synced: 27 Apr 2026
https://github.com/caesaredia/food-app-user-behavior-analysis
Analyze user behavior and optimize app experience in a food-tech startup through funnel analysis and A/A/B testing. Includes data prep, visualization, and statistical testing in Python.
a-b-testing chi-square data-analysis data-visualization funnel-analysis python statistical-testing user-behavior
Last synced: 27 Apr 2026
https://github.com/airdac/ml-palmerpenguins
Classification and analysis of the palmerpenguins dataset in Python. Team project from UPC's Master's Degree in Data Science
classification data-analysis data-science machine-learning palmer-penguin python upc
Last synced: 07 Jun 2026
https://github.com/lotfiferaga/hotel-reviews-sentiment-analysis
Efficient Python-driven sentiment analysis for hotel reviews, providing insightful evaluations.
data-analysis data-visualization nlp python
Last synced: 07 Jun 2026
https://github.com/josedanielchg/1990s-netflix-movie-insight
Small exploratory analysis of Netflix movie data from the 1990s. This project is part of the DataCamp Associate Data Scientist in Python program and focuses on filtering, visualizing, and extracting insights from a dataset using Python. Analyze trends in movie durations and count short action films to practice key data science skills!
Last synced: 27 Apr 2026
https://github.com/bheemisme/employee-attrition-analysis
A Dashboard on employee-attrition-analysis
dashboard data-analysis data-science plotly plotly-dash python
Last synced: 28 Apr 2026
https://github.com/l2nce/datamining-study
Introduction to data mining
data-analysis data-mining matplotlib numpy panda
Last synced: 28 Apr 2026
https://github.com/sferez/simple_linear_regression
Simple Linear Regression using Python
data-analysis data-science linear-regression python regression
Last synced: 28 Apr 2026
https://github.com/sujata-adhikari/data-analysis
Data analysis of Market sales data using PowerBi, created dashboard to show analysis.
data-analysis excel pandas powerbi
Last synced: 12 Jun 2026
https://github.com/stefagnone/movies-dataset-analysis-project
Comprehensive analysis of the Movies dataset, exploring genre trends, comparisons, and qualitative insights using Python, Pandas, and visualizations. Designed to uncover actionable findings for stakeholders.
data-analysis data-visualization exploratory-data-analysis matplotlib movies-analysis pandas python seaborn storytelling-with-data
Last synced: 28 Apr 2026
https://github.com/datalopes1/warehouse_rfv
Neste projeto será realizada uma análise do tipo RFV (Recência, Frequência e Valor) com dados que encontrei neste video no Youtube do canal Jie Jenn.
analise-rfv data-analysis data-science kmeans python rfm-analysis
Last synced: 28 Apr 2026
https://github.com/emmanuelletocs/steam-game-recommender
A powerful recommendation system for Steam games, combining Content-Based and Collaborative Filtering techniques. Built with Python, Scikit-learn, and Streamlit to deliver accurate, real-time game recommendations. Perfect for gamers and data scientists interested in building intelligent recommendation engines.
als-algorithm data-analysis gaming-industry knn machine-learning mds mysql ncf neural-network pyspark recommendation-engine recommendation-system scikit-learn spark
Last synced: 28 Apr 2026
https://github.com/rajivaleaakash/customer-churn-prediction
A machine learning project focused on predicting customer churn using various data analysis and modeling techniques. The repository includes data preprocessing, feature engineering, exploratory data analysis (EDA), model training, evaluation, and visualization to help businesses identify customers at risk of leaving.
churn-prediction classification customer-churn data-analysis data-science gridsearchcv imblearn machine-learning numpy pandas pyhton randomsearchcv scikit-learn
Last synced: 28 Apr 2026
https://github.com/abdeldjalilchafai/us-flight-delay-eda
Structured EDA on 2015 US flight delay data. Clean, reproducible notebook using a 6-step data analysis framework for real-world datasets.
data-analysis data-cleaning eda exploratory-data-analysis flight-delays kaggle matplotlib numpy pandas python seaborn
Last synced: 28 Apr 2026
https://github.com/george-njuguna/spotify-etl-pipeline
This is an ETL pipeline that uses Spotify API , Docker and Airflow
apache-airflow data-analysis docker pipelines python
Last synced: 28 Apr 2026
https://github.com/shreeparab1890/indian-elections-2019-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Indian Lok Sabha Elections 2019.
data data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib numpy pandas plotly python python3 visualization
Last synced: 28 Apr 2026
https://github.com/delonnewman/relational
Relational programming for Ruby
csv csv-import data data-analysis database export json relational relational-algebra relational-database relational-model relational-programming reporting reports ruby yaml
Last synced: 28 Apr 2026
https://github.com/rosanafss/sql-journey
SQL, practicing for Udacity Data Track.
data-analysis database datascience jupyter-notebook python queries relational-databases sql sql-server
Last synced: 28 Apr 2026
https://github.com/matheusafonseca/python-data-visualization-matplotlib-seaborn-masterclass-udemy
This repository is dedicated to storing the code developed during the "Python Data Visualization: Matplotlib & Seaborn Masterclass" course on Udemy.
charts data-analysis data-analysis-python data-science data-visualization database graphics graphics-programming jupyter-notebook matplotlib matplotlib-plots python python3 seaborn seaborn-plots
Last synced: 28 Apr 2026
https://github.com/rorrell/coviddeaths
A Jupyter Notebook where I create several visualizations based on data about COVID-19 deaths from 2020 to 2024
data-analysis data-visualization jupyter-notebook python3
Last synced: 28 Apr 2026
https://github.com/abhi227070/car-price-prediction
This project implements a machine learning model to predict the price of cars based on various features such as mileage, manufacturing date, fuel type, and more. Users can input car information, and the model will estimate the price of the car based on the provided data. This tool can be useful for both car buyers and sellers to estimate car price.
data-analysis machine-learning machine-learning-algorithms machinelearning python3 regression regression-models scikit-learn scikitlearn-machine-learning
Last synced: 28 Apr 2026
https://github.com/szapp/candyanalysis
Case study: Analyze the candy power ranking to identify and recommend popular candy characteristics
data-analysis data-visualization feature-selection interaction-terms
Last synced: 28 Apr 2026
https://github.com/wei-rongrong2/openfoodfactclustering
A project that explores clustering food products based on nutritional attributes using K-Means, Fuzzy C-Means, and DBSCAN algorithms, with a Streamlit dashboard for visualizing results.
clustering dashboard data-analysis dbscan food-products fuzzy-cmeans k-means machine-learning nutrition nutrition-clustering open-food-facts streamlit
Last synced: 28 Apr 2026
https://github.com/tanzeelgcuf/medical-information-rule-based-prediction-model-with-api
a rule based system, that learns to make new rules, for medical information that will take the first 11 text fields and predict the last 2 text fields - the diagnosis and disposition. I needs to show the key words used to make the predicted diagnosis and disposition.
data-analysis django machine-learning-algorithms openpyxl python python3 rest-api
Last synced: 28 Apr 2026
https://github.com/gaurav-van/optimizing-rate-of-penetration-in-geothermal-drilling-a-digital-twin-approach
Let’s explore something interesting together. In this project, we developed a machine learning digital twin using Intel-optimized XGBoost and daal4py to simulate and optimize the Rate of Penetration (ROP) in geothermal drilling. We leveraged SHAP for Explainable AI (XAI) to interpret model predictions.
data-analysis data-science digital-twin explainable-ai geothermal geothermal-energy jupyter-notebook machine-learning python shap xai xgboost
Last synced: 28 Apr 2026
https://github.com/josedanielchg/efficient-data-storage-for-predictive-modeling
DataCamp project from the Associate Data Scientist track, focusing on optimizing dataset storage by transforming data types and filtering. Prepares data for efficient machine learning workflows
cleaning-dataset data-analysis jupyter-notebook python
Last synced: 28 Apr 2026
https://github.com/leosimoes/alura-7daysofcode-dados
Desafios das Trilhas de Dados - Ciência de Dados, Machine Learning e Python Pandas.
data-analysis data-science jupyter-notebook machine-learning python
Last synced: 28 Apr 2026
https://github.com/kisaa-fatima/data-visualization-with-tableauleu
Conducted Exploratory Data Analysis (EDA) on the Berkeley Earth Dataset (large scale dataset), which features high-resolution land and ocean time series data. Created interactive dashboards using Tableau to effectively visualize and highlight trends and patterns within the data.
data-analysis data-science exploratory-data-analysis insights python tableau visualizations
Last synced: 29 Apr 2026
https://github.com/prady2309/sales-prediction-using-python
Implemented using Multiple Linear Regression
data-analysis data-science machine-learning python
Last synced: 29 Apr 2026
https://github.com/emircanakyuzz/veri_gorsellestirilmesi_ve_analizi-analysis_and_visualization_of_dataset
Bu çalışmada numpy, pandas, seaborn ve matplotlib gibi veri biliminde çokca bilinen modülleri kullanarak analiz ve görselleştirme işlemleri gerçekleştirdim.
data-analysis data-science data-visualization jupyter-notebook python
Last synced: 29 Apr 2026
https://github.com/thanaraklee/pyspark-dataframe-operations
This project focuses on utilizing PySpark DataFrames to analyze and visualize data sourced from external datasets, such as CSV files. It provides a practical example of how to manipulate, transform, and gain insights from large datasets using the PySpark framework.
data-analysis dataframe pyspark python
Last synced: 29 Apr 2026
https://github.com/marcinz20/anomaly-detection-in-credo-dataset
University project, which goal is to build a system, that detects anomalies in CREDO dataset
credo data-analysis data-science encoder-decoder-model jupiter-notebook pca-analysis python3
Last synced: 29 Apr 2026
https://github.com/devexpress-examples/winforms-visualize-pivot-grid-data-in-chart
The following example shows how to integrate the Pivot Grid with the Chart control.
charting data-analysis dotnet pivot-grid-for-winforms winforms
Last synced: 29 Apr 2026
https://github.com/nivasharmaa/spiderverse
A comprehensive Java program for analyzing and managing events and data points within a fictional spiderverse. Features event handling, anomaly detection, cluster management, and robust file I/O operations.
advanced-algorithms anomaly-detection clustering data-analysis file-io object-oriented-programming
Last synced: 29 Apr 2026
https://github.com/vanshuchaudhary/zomato
This Jupyter Notebook contains an exploratory data analysis (EDA) of Zomato restaurant data. It includes data cleaning, visualization, and insights into restaurant ratings, pricing, cuisine distribution, and location-based trends.
business-analytics data-analysis data-mining data-science data-visualization datascience matplotlib pandas-dataframe pandas-python python python-3 python-library
Last synced: 29 Apr 2026
https://github.com/kasraskari/learn-r-codes
A learning repository for R programming, covering data manipulation, visualization, and statistical analysis. (Work in progress!) 🚧
data-analysis data-analysis-r data-visualization r r-examples r-graphics r-statistics statistics
Last synced: 08 Jun 2026
https://github.com/jakebrehm/ezpz-plotting
📈 Easily visualize and manipulate plots from multiple data files.
data-analysis data-visualization engineering matplotlib matplotlib-pyplot pandas plotting python python-3 software software-engineering tkinter tkinter-gui
Last synced: 29 Apr 2026
https://github.com/mumtaz4118/scraping-medium-and-data-analytics
The file DataExtraction.py extracts information from the json files scrapped by the scrapper medium_scrapper_post.py. To extract information from json files scrapped by medium_scrapper_tag_archive.py (scrapping from tags archive) then use Data_Extraction_Archive_Tags.py
data data-analysis data-analytics data-extraction data-preprocessing data-science data-scraping deep-learning machine-learning python
Last synced: 29 Apr 2026
https://github.com/mdaffailhami/king_county_home_sales_analysis
This repository contains code and analysis for exploring home sales data in King County, featuring geospatial mapping to visualize trends and factors influencing housing prices, including location, size, and various property features, using Python and popular data analysis libraries.
data-analysis data-science folium-maps geospatial python
Last synced: 29 Apr 2026
https://github.com/anilyigitsel/istanbul-rental-apartments-analysis
This project analyzes the Istanbul Rental Apartments Dataset (2025), which includes rental apartment listings from Istanbul, Turkey.
data-analysis data-visualization jupyter-notebook matplotlib pandas python rental-housing
Last synced: 29 Apr 2026
https://github.com/dcs-training/network-analyisis-python
Course material for introducing data visualization with Altair and network analysis with NetworkX (in Python). Go to the readme file
data-analysis data-visualisation network-analysis python text-analysis
Last synced: 29 Apr 2026
https://github.com/mrjxtr/ossph_2025_survey_analysis
OSSPH_2025_Survey_Analysis
data-analysis data-visualization matplotlib nltk pandas python sentiment-analysis
Last synced: 29 Apr 2026
https://github.com/george-gca/ai_papers_analysis
Do some analysis based on main AI conferences
conferences data-analysis fasttext fasttext-embeddings fasttext-python python scikit-learn top2vec
Last synced: 29 Apr 2026
https://github.com/rafgpereira/obmep-analise
Código que analisa a retrospectiva das premiações da Obmep em determinada localidade e escola
data-analysis excel pandas python
Last synced: 29 Apr 2026
https://github.com/findmyway/dataframe-in-julia
A quick introduction of DataFrame in Julia for users from Python
data-analysis dataframe julia jupyter-notebook
Last synced: 29 Apr 2026
https://github.com/lankesathwik7/sql-query-assistant
Natural language to SQL query converter using Groq LLM. Ask questions in plain English and get SQL queries, visualized results, and natural language explanations. Built with Streamlit and PostgreSQL.
data-analysis database groq llm natural-language-processing python sql
Last synced: 29 Apr 2026
https://github.com/taljindergill78/yelp-arizona-analysis
This project analyzes the Yelp dataset for the state of Arizona to extract insights about restaurant businesses and user behavior. Using Apache Spark and PySpark for distributed data processing, the project demonstrates how big data tools can be used to uncover patterns in customer reviews, business performance, and user engagement.
big-data data-analysis data-engineering distributed-computing pyspark spark sql yelp-dataset
Last synced: 29 Apr 2026
https://github.com/fatihilhan42/starbucks_analysis_turkey_and_world_with_python
In this project, firstly the brands for coffee in the world and then these brands in Turkey were examined. The data from the dataset, which you can find in the repo, was first organized using data cleaning algorithms. These cleaned data were then graphically extracted using data visualization algorithms.
data-analysis data-cleaning data-science data-visualization jupyter-notebook python
Last synced: 29 Apr 2026
https://github.com/carlos-edulira/mbabigdata-projeto
Entrega do projeto MBA Unipe Big Data BI
data-analysis delta minio python spark
Last synced: 29 Apr 2026
https://github.com/mr-dhan/eda-sales-customer-transactions
Dalam dunia bisnis ritel yang kompetitif, pemahaman mendalam terhadap perilaku pelanggan merupakan fondasi penting untuk pengambilan keputusan strategis. Namun, data transaksi pelanggan seringkali berjumlah besar dan kompleks, sehingga memerlukan proses analisis yang efektif untuk mengungkap insight yang berharga.
dashboard data data-analysis data-analysis-python data-science data-visualization eda python
Last synced: 29 Apr 2026
https://github.com/farhad-here/textprepx
A Multilingual Text Preprocessing Tool for English and Persian.
cleantext contractions data-analysis deep-learning emoji nlp nltk opp parsivar regex streamlit text-preprocessing textblob
Last synced: 29 Apr 2026
https://github.com/hardikk-7/election-analysis-project
A data analytics project exploring the 2024 Indian General Election results using Python. Includes party-wise, state-wise, and vote share analysis with visualizations.
data-analysis data-science election-analysis jupyter-notebook python
Last synced: 29 Apr 2026
https://github.com/srinibas-masanta/yelp-business-reviews-analysis
This project analyzes Yelp business reviews using Python, Snowflake, and SQL, focusing on efficient data ingestion, transformation, and analysis. We preprocess JSON data, optimize ingestion via Amazon S3, classify sentiments with Python UDFs, and extract insights using SQL queries—showcasing a streamlined end-to-end workflow.
amazon-s3 data-analysis json python snowflake sql
Last synced: 29 Apr 2026
https://github.com/valikmorinko/ecommerce-sales-analysis
Анализ продаж e-commerce: данные, визуализации, аналитические выводы.
data-analysis e-commerce jupyter matplotlib pandas python seaborn
Last synced: 29 Apr 2026
https://github.com/sdley/cas_pratique-del_annuel
Del-Annuel est logiciel de deliberation annuelle des ecoles superieures ou universités
data-analysis pandas python tkinter-gui
Last synced: 29 Apr 2026
https://github.com/alunera-data/sql-use-cases
Practical SQL use cases for Business Intelligence and IT Service Management (BI & ITSM)
business-intelligence dashboards data-analysis data-quality eda itsm kpis postgresql process-monitoring query reporting sql sqlserver
Last synced: 29 Apr 2026
https://github.com/meinhere/ta-pendat
Proyek Akhir Mata Kuliah Penambangan Data - Klasifikasi Trauma Pasien Menggunakan Metode Naive Bayes
data-analysis data-mining naive-bayes-classifier python trauma
Last synced: 29 Apr 2026
https://github.com/varshan1123/sql-tableau-project
We analyze key indicators for our pizza sales data to gain insights into our business performance - A Data Analysis Project performed on Tableau & SQL.
analysis data-analysis data-science data-visualization excel mysql powerbi sql sql-server tableau tableau-dashboards
Last synced: 29 Apr 2026
https://github.com/brevex/code-complexity-data-analisis
Data collection that shows different complexity scores in an algorithmic dataframe.
code-analysis data-analysis data-science python
Last synced: 29 Apr 2026
https://github.com/shimaa83/eda-repo
Exploratory data analysis for Police and retail dataset in kaggle
Last synced: 29 Apr 2026
https://github.com/prithviraj-2003/cognifyz-data-science-internship
🎓 Data Science Internship at Cognifyz Technologies 📅 Duration: 2 Months 🧠 Worked on real-world restaurant data 🗂️ Completed structured tasks across 3 levels 📌 Tasks focused on EDA, data preprocessing, visualization, and analysis 📎 Task descriptions provided in an attached PDF
data-analysis data-science data-visualization matplotlib numpy pandas python3
Last synced: 29 Apr 2026
https://github.com/yimethan/basics-of-data-analysis
2023-2 Basics of Data Analysis
data-analysis numpy pandas python
Last synced: 29 Apr 2026
https://github.com/psyplot/psy-transect
Visualize and explore transects with psyplot
data-analysis data-exploration data-science exploratory-data-analysis psyplot transects
Last synced: 29 Apr 2026
https://github.com/nishumehta/supermart-grocery-sales-retails-analytics
Tableau Dashboard Link :
data-analysis data-cleaning data-visualization jupyter-notebook matplotlib-pyplot numpy pandas python3 seaborn
Last synced: 29 Apr 2026
https://github.com/theoplayz2/eda-explorer
Инструмент на Python для разведочного анализа данных (EDA) и визуализации, поддерживающий загрузку данных CSV и JSON, с модульной архитектурой ООП. Практическая работа по теме: "Обнаружение и визуализация данных для понимания их сущности" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".
analysis battery-life cqrs csharp data-analysis eeg-analysis exploratorydataanalysis json-visualization matplotlib messaging profile-report python verilog visualization
Last synced: 29 Apr 2026
https://github.com/farhad-here/student_performance_analyzer
Student Performance Analyzer with python, it is on of my data analysis course project. I teach you about filter(),lambda,map() in python
data-analysis data-visualization filter kaggle kaggle-dataset lambda map pandas python python-tutorial streamlit
Last synced: 29 Apr 2026
https://github.com/alam025/algo-trading-bot
Backtested 20+ strategies achieving 18% annualised returns on historical S&P 500 data
api ccxt data-analysis finance fintech pandas postgresql python
Last synced: 08 Jun 2026
https://github.com/alam025/invoice-generator
Processed 500+ invoices with automated payment reminders and multi-currency PDF generation
api data-analysis finance fintech nextjs pdfkit prisma python stripe
Last synced: 08 Jun 2026
https://github.com/andryadsm/asset-analyzer
📈 Project Asset Analyzer (Python)
commodities data-analysis data-visualization economics financial-markets investing matplotlib numpy pandas python seaborn stock-market strategy trading
Last synced: 29 Apr 2026
https://github.com/jelhamm/model-ensembles-bagging-in-machine-learning
"This repository contains implementations of Boosting method, aimed at improving predictive performance by combining multiple models. by using titanic database."
bagging bagging-ensemble data-analysis database-analysis datamining datamining-algorithms ensemble-machine-learning ensemble-model jupyter-notebook machine-learning-models machine-learning-projects matplotlib module numpy-library pandas-library python sklearn-library
Last synced: 30 Apr 2026
https://github.com/angchekar28/air-quality-index-analysis
This project analyzes Air Quality Index (AQI) data to identify pollution trends, seasonal variations, and the impact of different pollutants. It includes data visualization, correlation analysis, and insights into air quality variations over time.
data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook machine-learning python
Last synced: 30 Apr 2026
https://github.com/shruti-h/sales_data_analysis
Sales Data Analysis | Pandas & Matplotlib
data-analysis data-science data-vi matplotlib pandas-library python
Last synced: 30 Apr 2026
https://github.com/yuvrajs2003/formula-1-performance-analysis
Analysis of F1 races and their drivers
data-analysis data-science data-visualization hyperparameter-tuning pandas python
Last synced: 09 Apr 2026
https://github.com/jotstolu/netflix-sql-data-analysis-project
This project explores the Netflix dataset using SQL queries to uncover trends, patterns, and business insights that could help stakeholders understand content distribution, viewer preferences, and platform optimization
data-analysis sql sql-server tsql
Last synced: 02 Aug 2025
https://github.com/nushratjabenaurnima/cse_477_data_mining
A collection of labs, reports, Jupyter notebooks, and project outputs for the CSE 477 Data Mining course. This repository tracks my learning journey through data preprocessing, association rules, clustering, classification, and real-world data analysis with Python.
data data-analysis data-mining data-science google-colab-notebook jupyter-notebook machine-learning python python-3
Last synced: 09 Apr 2026
https://github.com/nahiyanhkhan/stock-market-data-analysis_capstone-project
In this course, learned and solved assignments on SQL and Python. Final capstone project was on analyzing "Stock Market Data". Achieved 100% score in every assignment.
data-analysis data-analytics matplotlib mysql mysql-database numpy pandas python sql
Last synced: 09 Apr 2026
https://github.com/syed-amjad-ali/restaurant-sales-sql-project
This was a simple SQL project where I analyzed restaurant sales data, showcasing skills in data creation and querying. The project explores menu performance, order trends, and customer insights.
aggregations business-intelligence data-analysis guided-project joins maven-analytics querying restaurant-sales sales-data sql subqueries
Last synced: 03 Jan 2026
https://github.com/kuuhaku86/datmingemastik19
data-analysis data-mining data-science data-visualization
Last synced: 02 Aug 2025
https://github.com/quesocosteno03/data-analysis-projects
This repository serves as a collection of all my projects.
data-analysis jupyter-notebook powerbi
Last synced: 02 Aug 2025
https://github.com/yamslam/contentsunderpressure_processing
A repository for data processing and analysis for Contents Under Pressure.
data-analysis data-processing data-visualization game-based-learning judgments process-safety
Last synced: 07 Sep 2025
https://github.com/waghraj1699/car-price-prediction
Implementation of ML algorithm to predict the car price
artificial-intelligence data-analysis data-science data-visualization feature-engineering linear-regression machine-learning machine-learning-algorithms regression-models
Last synced: 02 Aug 2025
https://github.com/idaraabasiudoh/credit_card_fraud_detection
This repository contains a machine learning project focused on detecting credit card fraud using Decision Tree and Support Vector Machine (SVM) classifiers.
data-analysis jupyter-notebook machine-learning python3 scikit-learn snapml
Last synced: 19 Feb 2026
https://github.com/thedevreda/jadaerospace
A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero
data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python
Last synced: 02 Aug 2025
https://github.com/abdullahashfaqvirk/PowerBI-Dashboards
A collection of Microsoft Power BI dashboards and reports designed to address business challenges and support data driven decision-making.
dashboards data-analysis data-driven data-science microsoft powerbi reports visualization
Last synced: 27 Sep 2025
https://github.com/lc-rezende/eqx_boston_dataset
Exploratory data analysis, clustering, and forecasting on Boston crime data (2011-2015), revealing key crime trends, hotspots, and temporal patterns to support data-driven insights for urban safety and policing strategies.
data-analysis exploratory-data-analysis jupyter-notebook kmeans matplotlib numpy pandas prophet-facebook python scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/faint-liebfraumilch101/fraud-detection-sql-unsupervised
🕵️♂️ Detect fraud in bank transactions using SQL for feature engineering and Python's Isolation Forest for unsupervised anomaly detection.
anomaly-detection banking-data data-analysis data-science financial-analytics fraud-detection isolation-forest machine-learning portfolio-project python sql sqlite unsupervised-learning
Last synced: 07 May 2026
https://github.com/prasannnnn/real-time-share-price-scraping-and-analysis
The Stock Sentiment Analyzer is a web-based application built with Streamlit, BeautifulSoup, and Pandas to help users analyze the sentiment of a stock (BUY, SELL, or HOLD) based on its financial data. The tool extracts key financial metrics like Market Cap, Stock P/E, Dividend Yield, ROCE, ROE, and the 52-week High/Low from Screener.in.
beautifulsoup4 data-analysis python sentiment-analysis streamlit streamlit-dashboard webscraping
Last synced: 03 Aug 2025
https://github.com/tashi-2004/apache-flink-spark-data-streaming
This project showcases a real-time data streaming pipeline using Apache Flink, Apache Spark, and Grafana. It streams data, stores it in Parquet format, and performs aggregations for insights, with seamless visualization via Grafana dashboards.
apache-flink apache-spark data-aggregation data-analysis data-science data-streaming data-visualization flink flink-stream-processing flink-streaming grafana-dashboard grafana-plugin pyflink python3
Last synced: 09 Feb 2026