Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-30 00:07:38 UTC
- JSON Representation
https://github.com/tbep-tech/peptools
Materials for wrangling and summarizing data from the Peconic Estuary
data-analysis package pep water-quality
Last synced: 19 Feb 2026
https://github.com/tbep-tech/rookery-bay-training
Materials for R training at Rookery Bay Monitoring Workshop 2020
data-analysis open-science workshop
Last synced: 19 Feb 2026
https://github.com/arju10/exploring-hacker-news-posts
data-analysis hacker-news jupyter-notebook python
Last synced: 19 May 2026
https://github.com/rohitblaze10/survey_monkey_analysis--using-ipython
This data analysis project focused on extracting insights from survey responses. It involves data cleaning, merging, and transformation using iPython (Pandas,OS) and SQL. The goal is to identify trends and patterns in survey data for better decision-making.
data-analysis ipynb ipython-notebook
Last synced: 28 Jul 2025
https://github.com/mohitsai/mohitsai.github.io
My Personal Project Portfolio - simple SPA with basic HTML, CSS & Javascript
data-analysis data-engineering portfolio portfolio-page portfolio-website project-portfolio single-page-app software-engineering
Last synced: 28 Jul 2025
https://github.com/alikhalajii/text-classification-life-sciences
Text classification of Life Science apps
data-analysis data-science datasets feature-importance jupyter-notebook pandas sbert scikit-learn word2vec
Last synced: 05 May 2026
https://github.com/ashwin331133/sql-project--sales-data-analysis--walmart
This SQL-based Walmart data analysis project aims to identify top-performing branches and products, optimize sales strategies using Kaggle's Walmart Sales Forecasting Competition dataset.
Last synced: 03 Jan 2026
https://github.com/labex-labs/sqlite-intermediate-to-advanced
In this course, delve into advanced SQLite techniques. Master constraints, indexing, joins, subqueries, transactions, triggers, views, full-text search, JSON, backups, PRAGMA tuning, CTEs, window functions, and more!
advanced-sql course data-analysis data-integrity data-manipulation data-modeling database database-design hands-on labex labs performance-tuning programming query-optimization relational-database schema-management sql sqlite stored-procedures transaction-management
Last synced: 18 May 2026
https://github.com/lucashomuniz/project-09
SALES DATA ANALYSIS WITH POWERBI AND PYTHON
business-analytics business-intelligence data-analysis data-science data-visualization excel powerbi python toy-project
Last synced: 28 Jul 2025
https://github.com/odinsride/monpl
PL/SQL Data Load Monitoring Tools
data-analysis database etl logging logging-framework monitoring oracle plsql
Last synced: 28 Jul 2025
https://github.com/brad-cannell/waiver_evaluation
Waiver Evaluation
data-analysis evaluation hospital-admissions medicaid
Last synced: 19 Feb 2026
https://github.com/archanakokate/eda_amazon_products_and_discounts_2023
Exploratory Data Analysis (EDA) on Amazon's 2023 Products and Discounts data
data-analysis data-mining data-visualization exploratory-data-analysis
Last synced: 03 Jan 2026
https://github.com/a-iceberg/clustering_and_naming_categories
Summarization, clastering and characterization of text categories using LLM
bertscore clustering data-analysis data-science deep-learning gpt llm mssqlserver nlp openai prompt-engineering python summarization transformers
Last synced: 08 Feb 2026
https://github.com/swethajoseph/statistical-stock-performance-analysis
Conducted a statistical analysis of Microsoft, Tesla, and Apple stock performance compared to the S&P 500, examining price trends, volatility, and correlations to derive investment insights.
advancedexcel comparative-analysis data-analysis data-visualization datapreparation descriptive-statistics moving-average msexcel performance-analysis performance-metrics regression-analysis statistical-analysis
Last synced: 03 Jan 2026
https://github.com/prateek5525/retail-sales-analysis-project
This project involves analyzing retail sales data using SQL to uncover insights into sales patterns, customer behavior, and product performance. It serves as an exercise to develop foundational SQL skills in data exploration, cleaning, and analysis.
data-analysis data-cleaning retail-sales-data sql
Last synced: 03 Jan 2026
https://github.com/aabbtree77/uci-marketing-analysis-cart
UCI bank marketing data analysis with decision trees (CART).
cart chatgpt commerce conversion-rate data-analysis decision-trees deepseek grok kovnatsky marketing-analytics miniconda scikit-learn-python uci-machine-learning
Last synced: 29 Jul 2025
https://github.com/hasinii12/-chocolate-analysis-dashboard
This Power BI report provides a comprehensive analysis of chocolate ratings and related attributes.
data-analysis data-visualization powerbi
Last synced: 09 Feb 2026
https://github.com/codeonthespectrum/web-scrap
Este projeto realiza o web scraping da Wikipédia para obter dados sobre os municípios mais populosos do estado do Rio de Janeiro.
data-analysis data-visualization webscraping
Last synced: 16 Feb 2026
https://github.com/kanasia-moore/sql-project
bigquery data-analysis data-analysis-project data-analytics dataset homelessness sql
Last synced: 21 Sep 2025
https://github.com/noorulhudaajmal/business-performance-analytics
Python-Streamlit based interactive dashboard to analyze and visualize key business metrics for an online store.
business-analytics dashboard data-analysis python-streamlit
Last synced: 29 Jul 2025
https://github.com/malakasupun/crime-data-analysis-of-lapd
This project aims to explore and analyse crime patterns in Los Angeles using a dataset spanning from 2020 to the present. The primary focus is to extract meaningful insights by integrating structured data analysis and advanced techniques in SQL and Natural Language Processing (NLP).
data-analysis data-visualization llm nlp sql
Last synced: 29 Jul 2025
https://github.com/yash22222/literacy-exploration-analysis
Delve into India's literacy landscape through data analysis. Uncover regional disparities, high/low literacy states & gender imbalances.
csv data-analysis data-visualization government-data india literacy literacy-analysis states
Last synced: 29 Jul 2025
https://github.com/nguyenda18/ppp-data-tool
Command line tool (could later be used as lambda function) to download CSV files from SBA and generate JSON
data-analysis nodejs-server ppp-files ppp-loans
Last synced: 29 Jul 2025
https://github.com/cyprianfusi/data-scientist-technical-exercise-10ds
With recommendations to UK Department for Education of 10 Local Authorities where National Tutoring Programme (NTP) should be intensified and a response to UK Secretary of Health regarding a 76% Accident and Emergency (A&E) performance target which seems far-fetched.
data-analysis data-cleaning data-visualization hypothesis-testing pandas-python policy statistics
Last synced: 21 Sep 2025
https://github.com/maazie-khan/olympics-data-enigeering
Worked with Azure Data Factory, Databricks, Data Lake Storage, and Synapse Analytics to build an ETL pipeline for processing and analyzing Olympic Games data from Kaggle.
azure big-data data-analysis dataengineering devops pipeline
Last synced: 13 May 2026
https://github.com/naso7y/twitter-sentiment-analysis
Classifies airline-related tweets as positive, negative, or neutral using machine learning and NLP.
data-analysis machine-learning nlp sentiment-analysis
Last synced: 29 Jul 2025
https://github.com/mjshubham21/ny_yellow_taxi_python_da_project
A data analysis project of New York Yellow Taxi (Feb of 2025) using Python and its libraries for analytics like : NumPy, MatPlotLib, Pandas and Seaborn.
data-analysis jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 04 May 2026
https://github.com/prestonjohnson-portfolio/marketing-data-portfolio-project
Analyzing Marketing Data for Future Improvements
data data-analysis data-visualization powerbi sql sql-server
Last synced: 21 Sep 2025
https://github.com/jpcadena/tweets-classification-frontend
Frontend project for the Classification Tweets project
api axios css data-analysis data-science data-visualization ecuador eslint frontend html insecurity json machine-learning node npm openapi-typescript-generator react tweets-classification twitter typescript
Last synced: 06 Apr 2026
https://github.com/dadvaiahpavan/ai-data-scientist-
AI-powered tool for dataset analysis, featuring data preprocessing, classification, regression, anomaly detection, and text analysis. Built with scikit-learn, pandas, and Plotly for visualization. Includes an interactive Streamlit web interface for real-time data analysis.
ai anomaly-detection classification data-analysis data-science machine-learning panda plotu regression scikit-learn sentiment-analysis streamlit
Last synced: 03 May 2026
https://github.com/ozep/genshincharacteranalysis
Uses a spreadsheet with Character Data and organizes it into readable graphs.
data-analysis jypyternotebook python
Last synced: 18 Apr 2026
https://github.com/sandergi/ekichabi
A digital phonebook to connect sustenance farmers in Tanzania. Works via USSD so farmers without an internet connection can use it (via their Telecom). Build with Django in Python and a MySQL database. This is a public copy of the private repo with user information stripped.
android data-analysis ict4d research ussd
Last synced: 14 May 2026
https://github.com/nagasai123-k/twitter-sentiment-analysis
Twitter sentiment analysis
csv data-analysis data-analysis-python data-analytics data-science data-visualization ipynb ipynb-jupyter-notebook jyputer-notebook jypyternotebook jyutping md python raw-data
Last synced: 02 Mar 2025
https://github.com/hemangsharma/bookingdataanalysisreport
The report helps understand key trends and insights around customer bookings, pricing, and other related attributes.
analysis data data-analysis data-analytics data-visualization streamlit streamlit-dashboard
Last synced: 14 May 2026
https://github.com/antrita/stroke_prediction_model
A model that combines Kaggle's Stroke Prediction Dataset with live weather/air quality data to implement FDA-compliant MLOps pipeline and shows expertise in healthcare regulations and real-time inference.
ai data-analysis deep-learning kaggle-dataset machine-learning prediction-model random-forest real-time scikit-learn streamlit weather-api xgboost
Last synced: 07 May 2026
https://github.com/zulfachafidz/green_horizon_forecasting_peak_organic_avocado_sales_with_the_prophet_algorithm
The Green Horizon Project leverages the Prophet algorithm to predict peak sales of organic avocados, supporting the campaign "APEAM GO ORGANIC." Using Python and Looker Studio, this analysis aims to provide deep insight into sales trends and potential, forming the basis of smarter marketing strategies.
algorithm algorithms analytics data data-analysis data-engineering data-mining data-science data-visualization forecasting machine-learning machine-learning-algorithms prophet-model python python-script
Last synced: 17 May 2026
https://github.com/cyberoctane29/epa-air-quality-aqi-analysis
This project involved analyzing air quality data from the EPA, focusing on the Air Quality Index (AQI). I used Python data structures like dictionaries and sets to manage and process the data, simulating real-world data analysis to assess pollution levels and their health implications.
data-analysis numpy pandas python statistics
Last synced: 10 Apr 2026
https://github.com/sinsunsan/earth-survival-kit
Global warning data visualisation app to make everyone understand global warning and take actions that matter
angular angular7 d3 data-analysis data-visualization ecology global-warning ngx-charts
Last synced: 05 May 2026
https://github.com/nathadriele/transaction_fraud_prevention_pipeline
Uma solução de detecção e prevenção de fraudes em transações financeiras, combinando Machine Learning, regras de negócio e análises estatísticas avançadas. O sistema oferece um dashboard interativo para monitoramento em tempo real, análise de dados e gestão de alertas de fraude.
data-analysis data-visualization docker fraud-prevention machine-learning matplotlib numpy pandas pipeline pytest python scikit-learn scipy seaborn streamlit tensorflow transaction xgboost
Last synced: 10 Apr 2026
https://github.com/timbeechey/clubpro
Classification using binary procrustes rotation
classification data-analysis psychology-experiments r r-package rcpp rstats statistical-analysis statistics
Last synced: 19 Feb 2026
https://github.com/benmar2406/rent-in-germany
Interactive visualizations and maps depicting topics around rent prices and income in Germany built with Svelte.
charts d3 d3-visualization d3js data-analysis data-visualization gis gis-data infographic infographics map mapbox mapbox-gl mapbox-gl-js mapboxgl svelte
Last synced: 26 Mar 2025
https://github.com/Madhuresh2011/Career-Aspiration-Of-Gen-Z-Project-Using-Excel
Career Aspiration Of Gen-Z ,To explore the industries, roles, and pathways using Excel .
dashboards data-analysis data-visualization datacleaning dataset designing excel functional-dashboard gen-z kpi pivot-charts pivot-tables project-using-excel
Last synced: 22 Sep 2025
https://github.com/jaymax01/website-performance-analysis
Analyzing retail performance
data-analysis data-visualization feature-engineering google-colab metrics presentations python
Last synced: 19 May 2026
https://github.com/karencofre/riesgorelativo-lookerstudio
proyecto de análisis de datos y análisis perdicitvo en looker studio y google colab
bigquery data-analysis data-science machine-learning matplotlib python sklearn sql
Last synced: 03 Jan 2026
https://github.com/sadratehranian/pem-fuel-cell
The methodology section details the use of Python for data processing and analysis, employing statistical and machine learning-based anomaly detection techniques to identify potential issues in fuel cell stacks. It emphasizes data preprocessing, feature engineering, exploratory data analysis (EDA), and anomaly detection.
anomaly-detection data-analysis data-science data-visualization exploratory-data-analysis feature-engineering fuel-cell machine-learning preprocessing python statistical-analysis visual-studio-code
Last synced: 26 Mar 2025
https://github.com/phomint/udacity_free_datavisualization_with_tableau
Udacity free course using Tableau
data-analysis tableau udacity-course
Last synced: 09 Mar 2026
https://github.com/chen0040/python-data-analytics-feature-selection
Python project on feature selection
data-analysis feature-engineering feature-selection
Last synced: 03 Apr 2025
https://github.com/kartikey2807/bike-classification-1rt700
Binary classification problem involving Logistic regression, SMOTE and feature expansion.
data-analysis data-engineering data-visualization logistic-regression
Last synced: 30 Jul 2025
https://github.com/sidsin0809/hmdb-endo-flagger
A Python toolkit to identify and score endogenous human metabolites from HMDB XML metadata
data-analysis hmdb metabolomics ontology pipeline python-3 streaming-parser xml-parsing
Last synced: 06 Jul 2025
https://github.com/amoghkori/deeplabcut-package-for-animal-pose-estimation
DeepLabCut Mouse Location Prediction: Training a deep neural network to predict the location of a mouse using annotated joint positions.
data-analysis data-annotations data-preprocessing deep-learning machine-learning model-evaluation python-programming research research-project
Last synced: 17 Mar 2025
https://github.com/sanveed-adnan/supermarket-sales-sql-project
SQL-based data analysis project on supermarket sales performance using SQLite and Power BI.
business-intelligence data-analysis data-science data-science-projects data-visualization power-bi sales-data sql sqlite
Last synced: 08 Nov 2025
https://github.com/alanmenchaca/getting-and-cleaning-data-course-project
The purpose of this project is to demonstrate how to collect, work with, and clean a data set.
data-analysis getting-and-cleaning-data rstudio tidy-data
Last synced: 31 Jul 2025
https://github.com/teamtigers/echartify
A web application built with .net core 2.2 that has come with the idea of reading the National Election's Data-set of Bangladesh in a fastest possible time and then representing the data-set with different statistical charts.
bangladesh chartjs code-first-migration cross-platform data-analysis data-structures data-visualization dotnet-core election-analysis election-data entity-framework-core materializecss mvc npoi razor-pages
Last synced: 16 Apr 2026
https://github.com/jackmnob/python-tableau-eda-stockdash
Data cleaning, preparation, and manipulation (EDA) for an interactive stock market dashboard with Tableau - using pandas (Python) via JupyterLab
cleaning-data dashboard data-analysis data-preparation eda jupyter-notebook jupyterlab python tableau-public
Last synced: 14 May 2026
https://github.com/dieegogutierrez/cyclistic
Google Data Analytics Capstone Project
data-analysis datavisualization googleslides kaggle rprogramming rstudio
Last synced: 02 Apr 2025
https://github.com/alekszhs/data-analyst-portfolio
Data Analytics Portfolio
data-analysis data-visualization excel mysql problem-solving python tableau
Last synced: 08 May 2026
https://github.com/alrza2003/google-data-analysis-case-study-cyclistic
This project analyzes Cyclistic’s trip data to identify patterns in bike usage between casual riders and annual members. The findings help optimize marketing strategies and membership conversions.
business-task cyclistic-bike-share-analysis-case-study data-analysis data-science data-visualization google-data-analytics google-data-analytics-capstone-project google-data-analytics-professional jupyter-notebook python rmarkdown tableau
Last synced: 09 May 2026
https://github.com/ayeshathoi/simulation-sessional-412
Simulation of SSQS, Inventory System, Transient State, PERT, Monte Carlo Alo etc.
data-analysis excel inventory-system monte-carlo python simulation ssqs triangle-distributions
Last synced: 31 Jul 2025
https://github.com/rosa-lpz/data-analysis-handbook
Data Analysis base knowledge and practical applications
data data-analysis data-visualization database dax documentation power-bi python r sql tableau tableau-public
Last synced: 06 Apr 2026
https://github.com/mainak-97/netflix-content-analysis-project
SQL-based analysis of Netflix’s movies and TV shows dataset to uncover content trends, popular genres, geographical insights, and audience preferences. Includes data queries, findings, and a presentation of key insights.
data-analysis mysql mysql-workbench powerpoint presentation-slides sql
Last synced: 23 Sep 2025
https://github.com/derogative404/google_data_analytics_capstone
Capstone project part of the Google Data Analytics Certificate Program
Last synced: 26 Mar 2025
https://github.com/remram44/apex-legends-ocr-data
Get data from Apex Legends streams using OCR
apex-legends data-analysis video-games
Last synced: 31 Jul 2025
https://github.com/farrelfaricaf/exploratorydataanalyst---titanic
This project analyzes the Titanic dataset using exploratory data analysis (EDA) and visualization techniques to identify survival patterns. The goal is to understand how demographic factors like gender and age influenced survival rates during the 1912 disaster.
data data-analysis data-science data-visualization eda python titanic-dataset
Last synced: 31 Jul 2025
https://github.com/pauliorandall/airline-passenger-satisfaction-r
Analysing the Airline Passenger Satisfaction dataset from Maven Analytics
data-analysis data-analytics r
Last synced: 01 Aug 2025
https://github.com/computingvictor/mercadona_agent
Web app to explore supermarket products with advanced filters, search, favorites, and nutritional info. Includes data analysis notebooks for deeper insights.
css data-analysis data-science data-visualization filtering html interactive-ui javascript notebooks nutritional-info pandas product-catalog python supermarket webapp
Last synced: 09 Apr 2026
https://github.com/darkdk123/handwashing-discovery-analysis
A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.
data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots
Last synced: 09 Apr 2026
https://github.com/celineboutinon/chicken-run
OpenClassrooms Data Analyst 2022-2023 - Projet 9
data-analysis data-analytics data-visualisation dataframes matplotlib-pyplot missingno numpy pandas plotly python scikit-learn scipy seaborn statsmodels
Last synced: 09 Apr 2026
https://github.com/aygp-dr/claude-log-stream
Advanced analytics engine for Claude Code logs with real-time processing capabilities
claude-api clojure data-analysis monitoring
Last synced: 24 Sep 2025
https://github.com/palwisha-18/time_series_analysis_lex_vs_gdp
Analyzes how a country’s GDP per capita correlates with the life expectancy of its citizens over a period of about 100+ years
data-analysis data-visualization pandas plotl time
Last synced: 19 May 2026
https://github.com/owenl0000/housepricesproject
Kaggle Project
data-analysis data-science data-visualization gridsearchcv kaggle-competition kaggle-dataset linear-regression machine-learning machine-learning-algorithms numpy onehot-encoding ordinal-encoding pandas python random-forest-regression sckit-learn seaborn streamlit xgboost-regressor
Last synced: 09 Apr 2026
https://github.com/yousefmohammad/american_collage_quickanalysis
Quick Anaylsis about American Colleges
data-analysis data-visualisation data-visualization datanalysis datavisualisation datavisualization excel microsoft-excel
Last synced: 09 Mar 2026
https://github.com/xenon1919/credit-card-fraud-detection
Credit Card Fraud Detection is a machine learning project to predict fraudulent credit card transactions. It handles imbalanced data using undersampling and applies Logistic Regression and XGBoost models. With an AUC of 0.98, it offers robust fraud detection. Includes a Streamlit app for real-time predictions.
data-analysis machine-learning python
Last synced: 14 May 2026
https://github.com/aravind2060/employee_engagement_analysis_spark
Using Spark Structured APIs to analyze employee data and extract insights related to employee satisfaction, engagement, concerns, and job titles within an organization.
apache-spark data-analysis data-preprocessing docker docker-compose python
Last synced: 09 Apr 2026
https://github.com/jasoncobra3/finops-copilot
An end-to-end AI-powered FinOps platform that ingests cloud billing data, analyzes cost trends, answers natural-language questions using a RAG pipeline (LangChain + FAISS + sentence-transformers + Groq), and provides actionable cost optimization recommendations. Includes a FastAPI backend and Streamlit dashboard UI - fully containerized with Docker
ai-assistant cloud-cost-optimization cloud-enginee cost-analytics data-analysis devops docker faiss faiss-vector-database fastapi finops groq langchain llm pandas rag rag-pipeline sentence-transformers sqlite3 streamlit
Last synced: 13 Apr 2026
https://github.com/kaushik-puttaswamy/amazon-sales-dashboard-using-tableau
The Amazon Sales Data Analysis Dashboard provides insights into key sales metrics like profit, revenue, shipment days, and units sold. It includes visualizations to assess performance by region, country, and sales channel. The dashboard helps stakeholders optimize strategies and improve profitability through data-driven analysis.
dashboard data-analysis data-visualization tableau
Last synced: 11 Jan 2026
https://github.com/rodolfo-brandao/pos-graduacao
[pt-BR] Repositório para armazenar alguns materiais e projetos de cada módulo da minha especialização em Ciência de Dados (2025–2027)
artificial-intelligence data-analysis data-science data-visualization databases deep-learning jupyter linear-algebra machine-learning python r statistics
Last synced: 09 Apr 2026
https://github.com/0xunkn0wn4m1r/data_engineering_banking_project
🏦 Build a complete data engineering workflow for a banking system, showcasing ETL processes, data transformations, and an interactive financial dashboard.
automation data-analysis data-cleaning data-science feature-engineering fintech-bank flask-api loan-default-prediction machine-learning mlops model-explainability numpy postgresql scikit-learn segmentation shap sql unsupervised-learning
Last synced: 09 Apr 2026
https://github.com/analyst-lochan/flight-delay-and-cancellation-dataset-2019-2023-
This project demonstrates a complete data analytics pipeline starting from raw real-world flight data to professional visual dashboards using SQL Server and Power BI. It showcases data import, cleaning, optimization, transformation, and dynamic DAX-based visual reporting.
airline-performance business-intelligence data-analysis data-cleaning data-modeling data-visualization dax etl flight-data kaggle-dataset portfolio-project powerbi powerbi-dashboard sql sql-server
Last synced: 09 Sep 2025
https://github.com/vishal-bhandary/sql-data-analytics
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics business-intelligence customer-segmentation dashboarding data-analysis data-reporting data-visualization data-warehouse etl kpi product-analysis sql sql-server star-schema t-sql
Last synced: 30 Jun 2026
https://github.com/kevingastelum/mydataanalysis
My DataAnalyst Projects | Python, SQL, Excel, PowerBI & Tableau
data-analysis python sql visualization
Last synced: 20 May 2026
https://github.com/ashwin331133/powerbi-data_professional_survey_breakdown
This project analyzes survey data from individuals interested in transitioning to the data field. The survey aims to understand their backgrounds, motivations, and the challenges they face. Using Power BI for data visualization, the project provides insights into the demographics and preferences of these aspirants.
data-analysis data-visualization powerbi
Last synced: 03 Jan 2026
https://github.com/phanchenh/supplychaindashboard_datacpsupplychain
Tracking Trends in Supply Chain – A Sales and Profit Review (2015-2017)
business-analytics business-intelligence data-analysis data-visualization dax-languague dax-query mssql mssqlserver powerbi supply-chain supply-chain-management
Last synced: 01 Aug 2025
https://github.com/ritap03/neuralnetwork-shapeclassifier
Feedforward neural network system in MATLAB for geometric shape classification. Includes data preprocessing, network training and evaluation, confusion matrix analysis, and a graphical interface for user interaction and model testing.
ai data-analysis deep-learning feedforward-network gui image-classification machine-learning matlab neural-network pattern-recognition
Last synced: 14 May 2026
https://github.com/jigyasag18/ai-ml-salaries-and-ai-tools-usage-trends
This repository presents an in-depth Power BI analytics report on the AI job market trends and student AI tool usage from 2020 to 2025. It combines structured datasets (job postings, salaries, surveys) with custom DAX measures to uncover key patterns in salaries, remote work, industry demand, and student engagement. 5 interaractive dashboards made.
analysis data data-analysis data-visualization dataanalysis dataanalytics dataset datavisualization power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-visuals powerbidashboard visualization
Last synced: 16 Feb 2026
https://github.com/jigyasag18/global-terrorism-1970-2017-analysis-using-big-data
This repository explores over 180,000 terrorist incidents across 205 countries using Hadoop and Power BI. The project identifies global and regional patterns in terrorism, analyzes the impact on civilians, and highlights high-risk areas. Key insights include attack trends,weapon usage,top terror groups,& country-specific risks like those in India.
big-data big-data-analytics data data-analysis data-visualization dataanalytics dataset hadoop hive hive-database hive-db hivedb power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-report-validation powerbi-visuals powerbidashboard
Last synced: 19 Feb 2026
https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics
This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.
big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard
Last synced: 01 Aug 2025
https://github.com/lucashomuniz/project-15
[Dashboard] Enhancing Business Intelligence: Leveraging SQL, Python, and DAX for Strategic Insights in Sales Analysis
business-analytics business-intelligence data-analysis data-science data-visualization dax-languague machine-learning powerbi python
Last synced: 12 Jul 2025
https://github.com/jimohola/flight-price-predict-deployment
Deploying Machine Learning Models via Microsoft Azure
cloud-computing css data-analysis data-preprocessing data-visualization flask html machine-learning python3
Last synced: 05 May 2026
https://github.com/ryanga09/digitalent_fundamentaldatascience-selfpractice
A repository of hands-on projects from DigiTalent’s Fundamental Data Science training, covering web scraping, data exploration, data cleaning, and data annotation. Includes Jupyter notebooks and example code for practical learning.
data data-analysis data-science data-visualization dataset digitalent komdigi notebook-jupyter notebooks
Last synced: 02 Aug 2025
https://github.com/dlozeve/topological-persistence
Topological persistence diagram (barcode) of a triangulation
data-analysis persistence topology
Last synced: 02 Aug 2025
https://github.com/ausaaf-rh/movie-recommendation-system-collaborative-filtering
🎬 A comprehensive movie recommendation system implementing item-based collaborative filtering with cosine similarity. Features real-time recommendations, performance evaluation metrics (Precision@K, Recall@K), and interactive user interface. Built with Python, scikit-learn, and MovieLens dataset for academic research and learning purposes.
agents data-analysis jupyter-notebook python python3
Last synced: 17 Apr 2026
https://github.com/baslia/zillow_exploration
data-analysis data-science ds machine-learning pandas python
Last synced: 09 Apr 2026
https://github.com/jedrzej-wydra/competition-cooperation
Competition, cooperation, and parental effects in larval aggregations formed on carrion by communally breeding beetles Necrodes littoralis (Staphylinidae: Silphinae)
data-analysis non-linear-regression r
Last synced: 20 Aug 2025
https://github.com/takshak26/predict_blood_donations-
About The title of the project is “Predict Blood Donations”. It uses python as language, data science, and machine learning as the field of operation, TPOT library for model selection, logistic regression for model building, and jupyter notebook as the code editor.
data-analysis data-visualization datascience machine-learning python3
Last synced: 16 May 2026
https://github.com/yuvrajs2003/formula-1-performance-analysis
Analysis of F1 races and their drivers
data-analysis data-science data-visualization hyperparameter-tuning pandas python
Last synced: 09 Apr 2026
https://github.com/jotstolu/netflix-sql-data-analysis-project
This project explores the Netflix dataset using SQL queries to uncover trends, patterns, and business insights that could help stakeholders understand content distribution, viewer preferences, and platform optimization
data-analysis sql sql-server tsql
Last synced: 02 Aug 2025
https://github.com/nushratjabenaurnima/cse_477_data_mining
A collection of labs, reports, Jupyter notebooks, and project outputs for the CSE 477 Data Mining course. This repository tracks my learning journey through data preprocessing, association rules, clustering, classification, and real-world data analysis with Python.
data data-analysis data-mining data-science google-colab-notebook jupyter-notebook machine-learning python python-3
Last synced: 09 Apr 2026
https://github.com/pooja-manjunatha/nyc_parking_violations_dbt
This project uses dbt to transform NYC parking violations data through a layered architecture: Bronze: Raw ingested data Silver: Cleaned and enriched data Gold: Aggregated tables for analytics Using DuckDB as the warehouse backend, it ensures data quality with tests and documentation. The project enables reliable analysis of parking violations
data data-analysis data-engineering dbt duckdb python sql
Last synced: 14 May 2026
https://github.com/nahiyanhkhan/stock-market-data-analysis_capstone-project
In this course, learned and solved assignments on SQL and Python. Final capstone project was on analyzing "Stock Market Data". Achieved 100% score in every assignment.
data-analysis data-analytics matplotlib mysql mysql-database numpy pandas python sql
Last synced: 09 Apr 2026
https://github.com/aakk23/netflix_sql_project
This SQL project provides an analytical overview of Netflix's movies and TV shows dataset, uncovering key insights related to content types, ratings, release trends, and geographic distribution. It helps explore patterns in content availability, audience targeting, and regional preferences to support data-driven decisions.
data-analysis netflix-data-analysis postgresql sql
Last synced: 10 Apr 2025