Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/tawfikhammad/sql-leetcode-solutions
The solutions of SQL 50 LeetCode problems
data-analysis data-engineering database leetcode leetcode-solutions sql
Last synced: 15 Jun 2026
https://github.com/prathmesh2507/global-stock-intelligence-dashboard
Interactive Global Stock Market Analytics Dashboard built using Python, YFinance, Pandas, Streamlit, and Plotly. Analyze 20+ countries and 400+ top stocks with advanced visualizations and financial insights.
dashboard data-analysis data-visualization python stock-analysis streamlit
Last synced: 15 Jun 2026
https://github.com/victoryfanfare/car-price-prediction
ML модель для определения рыночной стоимости автомобилей с пробегом. Проект включает анализ данных, feature engineering и сравнение различных алгоритмов машинного обучения.
catboost data-analysis jupyter-notebook lightgbm machine-learning pandas python regression
Last synced: 15 Jun 2026
https://github.com/mattsebastianh/Analyze-Data-with-Python-Portfolio-Project
Analyze Data with Python
barplot categories chi-square-test conservation contingency-table crosstab data-analysis data-cleaning-and-preprocessing eda endangered-species matplotlib national-parks pandas-dataframe species species-conservation
Last synced: 18 Jun 2026
https://github.com/ddsuhaimi/turkiye-student-evaluation-eda
A little bit of exploration of well-known Turkiye Student Evaluation dataset
data-analysis data-science data-visualization-project exploratory-data-analysis exploratory-data-visualizations
Last synced: 18 Jun 2026
https://github.com/ilhanseyhanx/car-price-prediction-with-machine-learning
🚗 ML-powered car price prediction model with 95.88% accuracy using Random Forest and comprehensive data preprocessing
car-price-prediction data-analysis data-science machine-learning pandas python random-forest regression sklearn
Last synced: 19 Jun 2026
https://github.com/dgraves4/cms-hospital-quality-analytics
Python analytics project using CMS hospital quality data to clean, summarize, and visualize hospital ratings, reporting patterns, and facility characteristics.
cms-data data-analysis eda healthcare-analytics matplotlib pandas python
Last synced: 19 Jun 2026
https://github.com/dcs-training/r-visualisation-and-stats
This repository contains material from a 8 classes course on Data Visualisation and statistics with R
data-analysis data-visualisation data-wrangling intro-to-programming r statistics
Last synced: 20 Jun 2026
https://github.com/prestonjohnson-portfolio/marketing-data-portfolio-project
Analyzing Marketing Data for Future Improvements
data data-analysis data-visualization powerbi sql sql-server
Last synced: 21 Sep 2025
https://github.com/emaleckova/emaleckova.github.io
My personal website created with Quarto
biology data-analysis data-viz quarto r
Last synced: 23 Jun 2026
https://github.com/ozep/genshincharacteranalysis
Uses a spreadsheet with Character Data and organizes it into readable graphs.
data-analysis jypyternotebook python
Last synced: 18 Apr 2026
https://github.com/antrita/stroke_prediction_model
A model that combines Kaggle's Stroke Prediction Dataset with live weather/air quality data to implement FDA-compliant MLOps pipeline and shows expertise in healthcare regulations and real-time inference.
ai data-analysis deep-learning kaggle-dataset machine-learning prediction-model random-forest real-time scikit-learn streamlit weather-api xgboost
Last synced: 07 May 2026
https://github.com/sinsunsan/earth-survival-kit
Global warning data visualisation app to make everyone understand global warning and take actions that matter
angular angular7 d3 data-analysis data-visualization ecology global-warning ngx-charts
Last synced: 05 May 2026
https://github.com/phomint/udacity_free_datavisualization_with_tableau
Udacity free course using Tableau
data-analysis tableau udacity-course
Last synced: 09 Mar 2026
https://github.com/kartikey2807/bike-classification-1rt700
Binary classification problem involving Logistic regression, SMOTE and feature expansion.
data-analysis data-engineering data-visualization logistic-regression
Last synced: 30 Jul 2025
https://github.com/sanveed-adnan/supermarket-sales-sql-project
SQL-based data analysis project on supermarket sales performance using SQLite and Power BI.
business-intelligence data-analysis data-science data-science-projects data-visualization power-bi sales-data sql sqlite
Last synced: 08 Nov 2025
https://github.com/darkdk123/handwashing-discovery-analysis
A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.
data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots
Last synced: 09 Apr 2026
https://github.com/celineboutinon/chicken-run
OpenClassrooms Data Analyst 2022-2023 - Projet 9
data-analysis data-analytics data-visualisation dataframes matplotlib-pyplot missingno numpy pandas plotly python scikit-learn scipy seaborn statsmodels
Last synced: 09 Apr 2026
https://github.com/owenl0000/housepricesproject
Kaggle Project
data-analysis data-science data-visualization gridsearchcv kaggle-competition kaggle-dataset linear-regression machine-learning machine-learning-algorithms numpy onehot-encoding ordinal-encoding pandas python random-forest-regression sckit-learn seaborn streamlit xgboost-regressor
Last synced: 09 Apr 2026
https://github.com/jasoncobra3/finops-copilot
An end-to-end AI-powered FinOps platform that ingests cloud billing data, analyzes cost trends, answers natural-language questions using a RAG pipeline (LangChain + FAISS + sentence-transformers + Groq), and provides actionable cost optimization recommendations. Includes a FastAPI backend and Streamlit dashboard UI - fully containerized with Docker
ai-assistant cloud-cost-optimization cloud-enginee cost-analytics data-analysis devops docker faiss faiss-vector-database fastapi finops groq langchain llm pandas rag rag-pipeline sentence-transformers sqlite3 streamlit
Last synced: 13 Apr 2026
https://github.com/jigyasag18/ai-ml-salaries-and-ai-tools-usage-trends
This repository presents an in-depth Power BI analytics report on the AI job market trends and student AI tool usage from 2020 to 2025. It combines structured datasets (job postings, salaries, surveys) with custom DAX measures to uncover key patterns in salaries, remote work, industry demand, and student engagement. 5 interaractive dashboards made.
analysis data data-analysis data-visualization dataanalysis dataanalytics dataset datavisualization power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-visuals powerbidashboard visualization
Last synced: 16 Feb 2026
https://github.com/jigyasag18/global-terrorism-1970-2017-analysis-using-big-data
This repository explores over 180,000 terrorist incidents across 205 countries using Hadoop and Power BI. The project identifies global and regional patterns in terrorism, analyzes the impact on civilians, and highlights high-risk areas. Key insights include attack trends,weapon usage,top terror groups,& country-specific risks like those in India.
big-data big-data-analytics data data-analysis data-visualization dataanalytics dataset hadoop hive hive-database hive-db hivedb power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-report-validation powerbi-visuals powerbidashboard
Last synced: 19 Feb 2026
https://github.com/dlozeve/topological-persistence
Topological persistence diagram (barcode) of a triangulation
data-analysis persistence topology
Last synced: 02 Aug 2025
https://github.com/takshak26/predict_blood_donations-
About The title of the project is “Predict Blood Donations”. It uses python as language, data science, and machine learning as the field of operation, TPOT library for model selection, logistic regression for model building, and jupyter notebook as the code editor.
data-analysis data-visualization datascience machine-learning python3
Last synced: 16 May 2026
https://github.com/jotstolu/netflix-sql-data-analysis-project
This project explores the Netflix dataset using SQL queries to uncover trends, patterns, and business insights that could help stakeholders understand content distribution, viewer preferences, and platform optimization
data-analysis sql sql-server tsql
Last synced: 02 Aug 2025
https://github.com/syed-amjad-ali/restaurant-sales-sql-project
This was a simple SQL project where I analyzed restaurant sales data, showcasing skills in data creation and querying. The project explores menu performance, order trends, and customer insights.
aggregations business-intelligence data-analysis guided-project joins maven-analytics querying restaurant-sales sales-data sql subqueries
Last synced: 03 Jan 2026
https://github.com/waghraj1699/car-price-prediction
Implementation of ML algorithm to predict the car price
artificial-intelligence data-analysis data-science data-visualization feature-engineering linear-regression machine-learning machine-learning-algorithms regression-models
Last synced: 02 Aug 2025
https://github.com/abdullahashfaqvirk/PowerBI-Dashboards
A collection of Microsoft Power BI dashboards and reports designed to address business challenges and support data driven decision-making.
dashboards data-analysis data-driven data-science microsoft powerbi reports visualization
Last synced: 27 Sep 2025
https://github.com/tashi-2004/apache-flink-spark-data-streaming
This project showcases a real-time data streaming pipeline using Apache Flink, Apache Spark, and Grafana. It streams data, stores it in Parquet format, and performs aggregations for insights, with seamless visualization via Grafana dashboards.
apache-flink apache-spark data-aggregation data-analysis data-science data-streaming data-visualization flink flink-stream-processing flink-streaming grafana-dashboard grafana-plugin pyflink python3
Last synced: 09 Feb 2026
https://github.com/0xnu/england-house-prices
Predict house prices for the next five years across all English local authorities.
data-analysis england england-house-prices housing-market housing-market-analysis predictive-modeling regression
Last synced: 03 Aug 2025
https://github.com/asghar-rizvi/hotel_reservation_data_analysis
This project involves a comprehensive data analysis of a hotel reservation dataset using Excel. The primary focus is on examining reservation cancellations. Through detailed analysis and visual representation.
dashboard dashboard-templates data-analysis data-analysis-excel data-representation data-science excel
Last synced: 02 Mar 2026
https://github.com/phanchenh/datacosupplychain_sqlproject
Supply Chain Optimization – Tackling Delivery Delays and Profitability Challenges (2015-2017)
business-analytics business-intelligence data-analysis insights jupyter-notebook mssql mssqlserver python supply-chain supply-chain-analytics supply-chain-optimization
Last synced: 09 Mar 2026
https://github.com/theashishmavii/job-trends-analyzer-automation
End-to-end automation: job scraping, data analysis, and trends reporting for job seekers and researchers.
automation beautifulsoup data-analysis open-source pandas python selenium webscraping
Last synced: 07 Aug 2025
https://github.com/sebastianofazzino/ibm-data-science-professional-certificate
In this repository I've stored exercises and projects I've been working on while attending IBM Data Science Professional Certificate, using Python and its libraries.
data-analysis data-mining data-science data-structures data-visualization database machine-learning matplotlib numpy pandas python regression seaborn sql
Last synced: 09 Apr 2026
https://github.com/v41bh4vr4jput/data-analysis-with-python
This repository is a comprehensive collection of data analysis projects and tutorials using Python's most powerful libraries: NumPy, Pandas, Seaborn, and Matplotlib. It is designed to help you explore, clean, visualize, and analyze data efficiently.
api data data-analysis data-visualization matplotlib numpy pandas python sakila-db seaborn
Last synced: 09 Apr 2026
https://github.com/jagoda11/elastic-vision
This repository contains a full-stack application designed to explore data from ElasticSearch🧐indices and visualize it using charts and graphs. The backend is built using Node.js and the frontend is powered🚀 by React.
backend chartjs dashboard-development data-analysis data-visualization docker elasticsearch frontend fullstack javascript material-ui monorepo mui-x node pie-chart react restful-api tables
Last synced: 09 Apr 2026
https://github.com/lorennmarque/logistics-data-exploratory-analysis-iflow
Delivery Data Exploratory Analysis
case-study data-analysis data-science data-visualization eda
Last synced: 20 Jan 2026
https://github.com/busradeveci/odev2-branching
This project is prepared for Artificial Intelligence and Technology Academy Git GitHub Assignment 2. Using the “Wine Reviews” dataset from Kaggle, it converts wine ratings into star ratings and analyzes them.
data-analysis kaggle-dataset python wine-reviews-dataset
Last synced: 03 Oct 2025
https://github.com/svetlanam/pycon-workshop
Pycon CZ workshop: Better data analyses and product recommendations with Instagram data
data-analysis data-science martinus matplotlib pandas pycon2016 pyconcz python scikit-learn workshop
Last synced: 09 Apr 2026
https://github.com/blackcub3s/msc-finalthesis
The most important programming files, code functions and data processing pipelines for the Machine learning final thesis of my Master's degree. Also, the LaTeX code of the thesis.
data-analysis latex machine-learning numpy python sklearn
Last synced: 09 Apr 2026
https://github.com/alan-oliveir/state-of-data-2022
Neste projeto faço a análise da distribuição das faixas salariais para os profissionais de nível júnior para o cargo de analista, cientista e engenheiro de dados.
data-analysis jupyter-notebook pandas-python seaborn-python
Last synced: 03 Oct 2025
https://github.com/hemangsharma/hotel-revenue-booking-analysis
This project provides a comprehensive revenue and reservation analysis for Highfield Hotel using historical data exported from booking systems and internal revenue reports. The goal is to derive actionable insights to improve room profitability, understand booking patterns, and support data-driven decision-making.
analysis data-analysis data-visualization hotel
Last synced: 10 Aug 2025
https://github.com/nuraj250/datainsighthub
A Node.js backend application that processes and analyzes personal user data to generate personalized insights and recommendations. It features secure user authentication, data upload and storage, custom algorithms for data analysis, and optional real-time notifications and third-party API integrations. Perfect for showcasing backend development
api-development backend-development bcrypt data-analysis data-analytics data-insights dotenv express jwt-authentication mongodb nodejs passport secure-api user-authentication
Last synced: 09 Apr 2026
https://github.com/erayagdogan/simplecharts
Simple Charts is a chart maker compose app with material 3 design. Charts are created using the lets-plot-compose library.
android android-app charts data-analysis data-visualization jetpack-compose lets-plot-kotlin material-3 viewmodel
Last synced: 11 Aug 2025
https://github.com/mindlessmuse666/eda-pandas
Проект по разведочному анализу данных (EDA) о пассажирах Титаника с использованием библиотеки Pandas. Включает в себя загрузку данных, предобработку, статистический анализ, визуализацию и создание сводных таблиц. Цель проекта - демонстрация основных методов и инструментов EDA для анализа и понимания данных.
data-analysis data-processing data-science data-visualization eda exploratory-data-analysis matplotlib pandas python titanic
Last synced: 18 Apr 2026
https://github.com/r12habh/canada-imigration-data-analysis
Dataset: Immigration to Canada from 1980 to 2013 - International migration flows to and from selected countries - The 2015 revision from United Nation's website. (Cognitive Class Data Analysis with Python)
canada data-analysis data-science data-visualization datascience python python3
Last synced: 23 May 2026
https://github.com/arun-data-analyst/finance-reporting-sql
End-to-end SQL project for project/portfolio finance: schema, seed data, validation, data-quality checks, business queries, and KPI views (Power BI–ready).
data-analysis data-modeling data-quality database finance kpi portfolio-management powerbi sql sql-server ssms
Last synced: 18 May 2026
https://github.com/itsachrafmansari/moroccan-real-estate-analysis
Scrape, process, analyze, and visualize data from Avito.ma to uncover current trends in Morocco's real estate market.
api-scraping data data-analysis data-mining data-science data-scraping data-visualization eda exploratory-data-analysis morocco real-estate web-scraping
Last synced: 13 Aug 2025
https://github.com/baguilar6174/python-jupyter-notebooks
Explore data analysis projects with Python, Jupyter and more tools. Discover stunning visualizations and reveal meaningful information in datasets to make informed decisions.
data-analysis jupyter-notebook kaggle pandas python
Last synced: 09 Apr 2026
https://github.com/emmarhoffmann/analysis-of-sleep-patterns-and-psychological-well-being-among-college-students
Explores the relationship between sleep patterns, psychological well-being, and lifestyle choices among college students using statistical analysis on 253 observations.
college-students data-analysis r statistical-models
Last synced: 04 Oct 2025
https://github.com/Solrikk/PicTrace-Web
PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image
automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api
Last synced: 15 Aug 2025
https://github.com/mothraa/etl-marketanalysis-webscraping
OC project 2
data-analysis etl python web-scraping
Last synced: 15 Aug 2025
https://github.com/ggarciajavier/udacity-dalf-project1-investigate-dataset
Work performed for the 1st project of Udacity Data Analyst Nanodegree: exploratory data analysis of a football dataset.
data-analysis football-analytics python python36 udacity-data-analyst-nanodegree
Last synced: 15 May 2026
https://github.com/douglasdavis/twaml
tW Analysis Machine Learning
data-analysis high-energy-physics machine-learning python
Last synced: 16 Aug 2025
https://github.com/sebastiansauer/hans-hackathon2025
Materials for a course on the evaluation of the AI student learn tool "HaNS"
Last synced: 04 Oct 2025
https://github.com/edoardotosin/january-2025-southern-california-wildfires-burn-severity-sentinel2
Scripts and data for analyzing burn severity of the January 2025 Southern California wildfires using Sentinel-2 satellite imagery. This project explores the use of the Differenced Normalized Burn Ratio (dNBR) and Relativized Burn Ratio (RBR) to classify burn severity, leveraging publicly available satellite data.
burn-severity copernicus data-analysis earth-observation satellite-imagery sentinel-2 wildfire wildfire-detection wildfires
Last synced: 09 Feb 2026
https://github.com/ccoolbaugh/individualized_cooling_data_analysis
Matlab code to analyze data collected during a brown adipose tissue individualized cooling protocol.
brown-adipose-tissue cold-exposure data-analysis ibutton matlab skin-temperature thermoregulation
Last synced: 18 Aug 2025
https://github.com/berkekaragoz/media-investments-data-analysis
Advertisement Investments Distribution of Turkey by Medium
Last synced: 19 Aug 2025
https://github.com/shadz23/smart-energy-dashboard
Power BI dashboard analyzing household electricity consumption to reveal usage patterns, peak hours, and estimated costs for smarter energy management and reduced bills. 🐙
chart data-analysis data-visualization dax energy-consumption hs110 hs300 ibm ibm-cloud influxdb jupyter-notebook kasa kp115 linuxone observability photovoltaics-dashboard plotly sense
Last synced: 19 Aug 2025
https://github.com/rahmamohammad/retail_project
Retail & Data analytics: KPIs, sales trends, Excel planning pack, forecasting & inventory tracking.
data-analysis data-visualization ecommerce excel jupyter-notebook matplotlib python retail-analytics storytelling
Last synced: 17 May 2026
https://github.com/jailsonsb2/kit-analise-de-dados
🚀 Um kit de ferramentas Python para acelerar a análise de dados. Carregue arquivos de forma inteligente (CSV, Excel, etc.) e converta notebooks Jupyter para scripts de produção sem esforço.
analise-de-dados analise-exploratoria analise-exploratoria-de-dados automation automations dados data-analysis data-cleaning etl etl-automation jupyter-notebook pandas powerquery python toolkit
Last synced: 29 Apr 2026
https://github.com/mjshubham21/ny_yellow_taxi_python_da_project
A data analysis project of New York Yellow Taxi (Feb of 2025) using Python and its libraries for analytics like : NumPy, MatPlotLib, Pandas and Seaborn.
data-analysis jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 04 May 2026
https://github.com/abhirajp595/python
Data Science Project using Python
data-analysis data-science data-visualization eda jyputer-notebook numpy pandas statistics
Last synced: 08 May 2026
https://github.com/shriansh8619/sql_eda
Explored relational databases using SQL to perform comprehensive Exploratory Data Analysis (EDA), covering database exploration, segmentation, trend analysis, and performance ranking. Developed reusable SQL scripts to analyze dimensions, measures, and time-based metrics, helping uncover key business insights.
data-analysis exploratory-data-analysis mysql
Last synced: 20 Aug 2025
https://github.com/nickenshidqia/sql-for-financial-data-analysis
Design SQL queries to generate accurate and timely financial reports including Profit and Loss statements, Balance Sheets, and Cash Flow statements
azure-data-studio data-analysis finance microsoft-sql-server sql
Last synced: 09 Mar 2026
https://github.com/0xnu/data-analyst-training
The repository contains training materials for data analysts.
data data-analysis data-analyst
Last synced: 25 Aug 2025
https://github.com/harshnevse/performance_analysis_of_solar_plants_in_india
A Data Analysis project using Tableau
Last synced: 03 Jan 2026
https://github.com/lauratrigo/fft_matlab
📡Análise de Fourier para Dados Ionosféricos é um script MATLAB que aplica FFT para gerar espectros unilaterais e bilaterais de parâmetros ionosféricos (hF, f0F2, hmF2), identificando periodicidades e comparando assinaturas espectrais com resolução de 15 minutos, útil para estudos de variações e distúrbios ionosféricos.
data-analysis fast-fourier-transform fft fourier ionosphere matlab scientific scientific-initiation
Last synced: 29 Aug 2025
https://github.com/agdturner/ccg-data
A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.
Last synced: 12 Jan 2026
https://github.com/nischay002/us-honey-production-analysis
Analysis of US honey production (1995–2021) using Python & data visualization. Identifies trends in honey yield, pricing, and colony distribution across states.
data-analysis data-visualization exploratory-data-analysis honey-production matplotlib pandas python seaborn us-agriculture
Last synced: 26 Feb 2025
https://github.com/singhs05/global-youtube-trends
Understand the impact of Likes, comments, dislikes on the video consumption for the videos that were trending.
data-analysis mssqlserver query sql
Last synced: 18 Mar 2026
https://github.com/luminati-io/target-dataset-samples
A sample dataset of over 1000 target products, extracted using the Bright Data API, ideal for brand reputation, tracking inventory, and optimizing prices.
api data-analysis data-mining datasets target web-scraper web-scraping
Last synced: 04 Jan 2026
https://github.com/mysftz/numerical-methods-in-matlab
Multiple MatLab scripts over multiple data analysis assignments.
data-analysis data-science matlab university university-assignment
Last synced: 14 May 2025
https://github.com/mysftz/statistical-analysis
A in-depth review of statistical analysis in Python from datasets.
data-analysis python python3 statistics university university-project
Last synced: 14 May 2025
https://github.com/scailfin/benchmark-templates
Workflow Templates are parameterized workflow specifications for the Reproducible Open Benchmarks for Data Analysis Platform (ROB)
benchmarks data-analysis reproducibility
Last synced: 16 Jan 2026
https://github.com/iness000/online-retail-customer-segmentation
This project performs comprehensive customer segmentation analysis on an online retail dataset using machine learning clustering techniques and RFM (Recency, Frequency, Monetary) analysis. The goal is to identify distinct customer segments to drive better customer relationship management strategies and business insights.
customer-segmentation data-analysis k-means
Last synced: 31 Aug 2025
https://github.com/rdrahul123/ecommerce-sales-dashboard
This project focuses on analyzing e-commerce sales data to uncover actionable insights and improve business decision-making. Using interactive dashboards and data analysis techniques, the project evaluates key performance metrics, customer behavior, sales trends, and payment modes across different categories and regions.
data-analysis data-science excel powerbi
Last synced: 22 Mar 2025
https://github.com/zimmi48/nixpkgs-issues
Analysis on nixpkgs issue lifetime.
data-analysis github-api nixpkgs
Last synced: 10 May 2026
https://github.com/evanwporter/sloth
Faster Pandas Dataframe
cython data-analysis dataframe pandas
Last synced: 14 Mar 2025
https://github.com/hi-jin2/data-analysis-basics
데이터분석기초(R) 수업 중에 작성한 소스코드 모음입니다. 『모두를 위한 R 데이터 분석 입문』 교재를 통해 R언어를 학습하였습니다.
Last synced: 19 Jul 2025
https://github.com/akmj1011/hill-and-valley-prediction-using-logistic-regression
Created A Prediction System Using Logistic Regression For Figuring Out The Hall And Valley From The Given Datasets
cloud-computing data-analysis data-manipulation data-preprocessing data-transformation data-visualization google-colab
Last synced: 13 May 2026
https://github.com/farhad-here/median-performance-comparison
Benchmarking the performance of median calculation using vanilla Python vs NumPy.
data-analysis matplotlib numpy python
Last synced: 18 Apr 2026
https://github.com/satyam4229/omnify-dataanalysis
Our assessment of Omnify focused on data-driven strategies to maximize profitability. We identified "Product X" as the most profitable product and recommended leveraging the "Wellness Solutions" keyword category for optimal keyword strategy.
data-analysis data-science data-visualization excel omnify
Last synced: 04 Jan 2026
https://github.com/andrii04/ga4-gcs-to-bigquery-etl
Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.
automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql
Last synced: 18 May 2026
https://github.com/sanchittechnogeek/overscripted-analysis
Geolocation and user language extraction analysis from Mozilla Overscripted dataset
analysis data data-analysis mozilla
Last synced: 23 Mar 2025
https://github.com/bibymaths/python_snippets
A collection of Python scripts for bioinformatics data analysis, including tools for transcription counts, nucleotide composition, and protein sequence evaluation.
amino-acid-scoring bioinformatics data-analysis fasta-generation mathematical-evaluation nucleotide-analysis protein-sequence-analysis transcription-counts
Last synced: 29 Jul 2025
https://github.com/nevermendel/revolut-analysis
Python script to analyse Revolut transactions
data-analysis revolut revolut-analysis
Last synced: 12 Apr 2025
https://github.com/cnoret/retail-data-analysis
Let's analyze historical sales data from a large retail chain and predict weekly sales using machine learning on a Streamlit web app
data-analysis data-analyst data-science data-vizualisation pandas python streamlit streamlit-webapp
Last synced: 10 Apr 2026
https://github.com/itrauco/data-dirtying-tool
a simple command line tool to generate dirty data and do common data things in google cloud
data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning
Last synced: 24 Feb 2025
https://github.com/motapinto/agent-based-simulation-conquest
Agent-based simulation modelation of the conquest Battlefield gamemode
agent-based-simulation data-analysis jade java sajas swing
Last synced: 24 Jan 2026
https://github.com/shubham200137/icc-women-s-t20-world-cup-data-analytics
Created a Power BI report to identify top 11 players for a T20 cricket team by scraping data from espncricinfo with Python, cleaning and transforming the data with pandas, and evaluating various player performance metrics.
beautifulsoup4 data-analysis data-visualization numpy-python pandas-python powerbi web-scraping
Last synced: 25 Feb 2025
https://github.com/grlyntng/rpims
Django Code and documentation for the Retail Pharmacy Inventory Management System (best final year project award)
data-analysis django erp forecasting-models lstm-neural-networks reporting
Last synced: 26 May 2026
https://github.com/apsinghanalytics/hranalytics_myersbriggspersonalityinsights
A Excel analytics study exploring the correlation between personality traits and key HR-relevant parameters, including tenure and performance
data-analysis data-visualization excel pivot-tables
Last synced: 30 Jan 2026
https://github.com/tashi-2004/apache-hadoop-spark-hive-cyberanalytics
This project utilizes Apache Hadoop, Hive, and PySpark to process and analyze the UNSW-NB15 dataset, enabling advanced query analysis, machine learning modeling, and visualization. The project demonstrates efficient data ingestion, processing, and predictive analytics for network security insights.
ai apache-hadoop apache-hive big-data-analytics big-data-processing data-analysis data-engineering data-science data-security data-visualization hdfs machine-learning network-analysis network-security pyspark python3 threat-detection unsw-nb15-dataset
Last synced: 02 May 2026
https://github.com/tenifayo/analysis-of-fordgobike-trip-data
Data Visualization using Ford GoBike Trip Data
data-analysis matplotlib pandas
Last synced: 11 Jul 2025
https://github.com/bationoa/how_does_a_bike_share_navigate_speedy_success
Bike rendting case study
analytics business-intelligence cleaning-data data-analysis data-collection data-visualization r
Last synced: 26 May 2026
https://github.com/dug22/jjournal
A Jupyter like notebook software for Java
data data-analysis data-science java jshell jshell-repl notebook swing swing-application
Last synced: 11 Apr 2026