Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/jcaperella29/jc_bioinformatics_hub
A personal hub to showcase my bioinformatics applications including RNA-Seq, ATAC-Seq, and miRNA-Seq analysis tools. Powered by simple HTML, CSS, and JavaScript with a biotech-themed design.
atac-seq bioinformatics biotech data-analysis github-pages portal rna-seq webapp
Last synced: 25 Feb 2026
https://github.com/simranjeet97/restaurant_data_analysis_covid_impact
Restaurant Data Analysis during Coronavirus time to Check the Impact on Foods and Restaurant Sales and YOY.
coronavirus covid-19 covid-impact data-analysis data-analytics data-cleaning data-manipulation data-science data-structures data-structures-and-algorithms database impact on restaurant-data-analysis restaurant-dataset restaurants
Last synced: 15 Apr 2026
https://github.com/aavishkarmahajan/sql
SQL code assignments and practice questions from SQL courses, SQL data analysis
Last synced: 07 Feb 2026
https://github.com/a-iceberg/whisper-timestamped
Timestamped ASR microservice
asr audio-to-text automatic-speech-recognition data-analysis data-science deep-learning docker fastapi mlops monitoring mssqlserver openai prompt-engineering python resource-management timestamps uvicorn-gunicorn whisper
Last synced: 31 Jan 2026
https://github.com/devexpress-examples/wpf-pivot-grid-connect-to-an-olap-datasource
This example shows how to specify connection settings to the server and create fields that relate to specific measures and dimensions of the cube for the Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf xpf
Last synced: 06 May 2026
https://github.com/auliannee/new-york-uber-pickups-analysis
This repository contains the projects related to data collecting, quality check, manipulation, analyzing, and visualizations.
data-analysis data-science ipython-notebook jupyter-notebook python
Last synced: 07 Feb 2026
https://github.com/shubham200137/icc-women-s-t20-world-cup-data-analytics
Created a Power BI report to identify top 11 players for a T20 cricket team by scraping data from espncricinfo with Python, cleaning and transforming the data with pandas, and evaluating various player performance metrics.
beautifulsoup4 data-analysis data-visualization numpy-python pandas-python powerbi web-scraping
Last synced: 25 Feb 2025
https://github.com/jofaval/titanic-disaster
Data Analysis of the famous Titanic Disaster in 1912 with Machine Learning
classification data-analysis data-science data-visualization google-colab kaggle machine-learning python scikit-learn
Last synced: 15 Apr 2026
https://github.com/abelarduu/power_bi_analyst
Projeto Power BI para relatório de dados financeiros, com navegação intuitiva e recursos interativos. Oferece uma experiência completa ao usuário, combinando apresentação sofisticada e funcionalidade eficaz para análise de dados.
dashboard data-analysis data-analytics modelagem-de-dados powerbi tratamento-de-dados
Last synced: 08 Sep 2025
https://github.com/nurulashraf/customer-segmentation-hierarchical-clustering
A customer segmentation project using hierarchical clustering to group customers based on their spending behaviour and demographics. This helps businesses identify patterns and create targeted marketing strategies.
business-analytics clustering-algorithm customer-segmentation data-analysis hierarchical-clustering machine-learning python unsupervised-learning
Last synced: 18 Apr 2025
https://github.com/traore-07/fedex-sales-analysis
Analysis of the FedEx Sales Transaction
data-analysis data-visualization sales-analysis tabeau
Last synced: 31 Jan 2026
https://github.com/cca/panopto-session-data
analyzing Panopto session data for retention purposes
data-analysis ipython-notebook video
Last synced: 07 Feb 2026
https://github.com/shafaq-aslam/pandas-lab
A comprehensive collection of Jupyter notebooks exploring Pandas, from Series and DataFrames to data cleaning, aggregation, merging, and visualization. A complete hands-on guide for mastering data manipulation and analysis with Python.
analytics data-analysis data-cleaning data-science data-visualization dataframe jupyter-notebook machine-learning pandas pandas-dataframe pandas-library pandas-series python python3 series
Last synced: 15 Apr 2026
https://github.com/ginanti-riski/streamlit_datapenyewaansepeda
Analisis Bike Sharing adalah proyek yang bertujuan untuk memahami pola penyewaan sepeda berdasarkan berbagai faktor seperti cuaca, musim, dan hari. Proyek ini menggunakan teknik analisis data untuk mendapatkan wawasan yang lebih dalam mengenai tren peminjaman sepeda.
data-analysis data-analysis-python data-science data-visualization python streamlit
Last synced: 15 Apr 2026
https://github.com/abdullahashfaqvirk/powerbi-dashboards
A collection of Microsoft Power BI dashboards and reports designed to address business challenges and support data driven decision-making.
dashboards data-analysis data-driven data-science microsoft powerbi reports visualization
Last synced: 10 Mar 2026
https://github.com/annnieglez/nlp-stock-market-and-news
This project focuses on detecting fake news from news headlines using advanced Natural Language Processing (NLP) techniques. It combines sentiment analysis with news headlines embeddings, generated from Hugging Face transformer models, to train a binary classification model that distinguishes between real and fake news.
classification-model data-analysis embeddings machine-learning machine-learning-models nlp nlp-deep-learning nlp-machine-learning python scraping-websites sentiment-analysis
Last synced: 25 Apr 2026
https://github.com/muthukumar0908/-singapore-resale-flat-prices-predicting
This project is to develop a machine learning model and deploy it as a user-friendly web application that predicts the resale prices of flats in Singapore.
data-analysis data-visualization mechine-learing plotly python streamlit
Last synced: 07 May 2026
https://github.com/al-ghaly/iti-project
ITI Final/Graduation Project.
data-analysis data-cleaning data-visualization data-warehousing machine-learning power-bi python-data-analysis sql statistical-analysis
Last synced: 15 Mar 2025
https://github.com/llnl/cap
HPC workflow that automates the tedious actions of compiling, analyzing, and parsing with bincfg
data-analysis hpc python workflows
Last synced: 17 Jun 2026
https://github.com/biginformatics/git-basics
Hands-on Git and GitHub lessons for analysts and statisticians
data-analysis git github public-health training
Last synced: 10 Jun 2026
https://github.com/anshulkansal121/music_store_analysis_sql
SQL project to analyze online music store data
beginner-friendly contributions-welcome data-analysis database mysql sql
Last synced: 07 May 2026
https://github.com/ajmannust41288/data-analyst
Data Analyst ,Microsoft Professional expert,Desktop PowerBi ,Tablue and Dashboards with ChatGP4 AI uses
business-analytics data-analysis data-analyst data-analytics eda
Last synced: 01 Feb 2026
https://github.com/axsk/geekgraph
parse, cluster and visualize boardgamegeek.com user profiles
Last synced: 01 Feb 2026
https://github.com/bineet-ratna-shakya/data-science-salary-analysis
analyzing a dataset containing salaries of data science professionals from 2020 to 2023.
data-analysis data-science data-visualization jupyter numpy pandas python
Last synced: 01 Feb 2026
https://github.com/k178412/sql-data-warehouse-project
A hands-on data warehouse project using SQL Server, covering ETL processes, and data modeling.
bronze-layer data-analysis data-analytics data-cleaning data-engineering data-warehouse database datalake dataset datawarehouse etl etl-pipeline etl-process gold-layer silver-layer sql sql-query sql-server sqlserver
Last synced: 25 Apr 2026
https://github.com/ludreinsalvador/life-expectancy-data-analysis
Contains Power BI dashboards analyzing global life expectancy trends, mortality rates, and health expenditures. Using a dataset sourced from Google Sheets, the project explores the impact of economic and healthcare factors on longevity.
dashboard data-analysis data-visualization healthcare-analysis life-expectancy powerbi
Last synced: 25 Feb 2026
https://github.com/riborings/python_projects
Python projects and other programming experiences
data-analysis machine-learning project python regression-analysis
Last synced: 08 May 2026
https://github.com/ejw-data/pandas-school
Analysis of school data with Pandas
Last synced: 08 May 2026
https://github.com/steviecurran/dashboards
Compilation of Links to the dashboards in the other repositories
dashboard data-analysis data-science data-visualization pandas powerbi python-dash tableau
Last synced: 21 Feb 2026
https://github.com/rissh/titanicsurvivalpredictionusingml
Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. 🚢
data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic
Last synced: 01 Feb 2026
https://github.com/keneandita/exploratory-data-analysis-eda-
Explore EDA on 5 datasets: Titanic 🚢, Heart Disease ❤️, Wine Quality 🍷, Car Price 🚗, and NBA Players 🏀. Includes data cleaning, preprocessing, and visualizations to uncover insights. Perfect for beginners to learn data analysis with Pandas, Matplotlib, and Seaborn! 🎨📈
data-analysis data-visualization eda matplotlib pandas python seaborn sklearn
Last synced: 15 Apr 2026
https://github.com/stas1f1/methods-and-models-for-multivariate-data-analysis
Completed tasks for the course on methods of mutivatiate data analysis, 1st year of masters, FDT ITMO
data-analysis multivariate-analysis python
Last synced: 10 Mar 2026
https://github.com/omkar2503/credit-risk-dashboard
A SQL-based Credit Risk Scoring System visualized using Metabase
credit-risk dashboard data-analysis data-analytics metabase postgresql sql
Last synced: 01 Jul 2025
https://github.com/aidan-zamfir/the-iliad
Data analysis & relationship network for the characters of Homers Iliad
data data-analysis dataframes networks networkx python selenium spacy webscraping
Last synced: 08 May 2026
https://github.com/fahamidur/cuisine-analysis
This project analyzes recipes from AllRecipes.com to reveal global cooking patterns, nutritional trends, and cultural food differences, offering data-driven insights for food enthusiasts and researchers.
beautifulsoup data-analysis datavisualization pandas selenium tableau-public webscraping
Last synced: 08 May 2026
https://github.com/tameronline/ai-financial-analyst
AI-driven financial analyst system utilizing LangChain and Ollama for real-time stock analysis, market trends, and financial insights.
ai data-analysis finance financial-analysis langchain machine-learning nlp ollama stock-market
Last synced: 02 Feb 2026
https://github.com/kernelshreyak/kaggle-notebooks
Collection of my Kaggle notebooks for data analysis and machine learning on a variety of datasets
data-analysis data-science data-visualization kaggle kaggle-competition machine-learning
Last synced: 27 Apr 2026
https://github.com/manisharora96/instagram-reach-analysis
This project provides a detailed approach to analyzing Instagram reach and engagement metrics. By leveraging the code and tools shared here, you can gain valuable insights into your Instagram content's performance and optimize your strategy to grow your audience effectively
data-analysis data-visualization instagram-reach python-tools
Last synced: 23 Mar 2025
https://github.com/vladimiracunadev-create/python-data-science-program
Python Data Science Program — 197 clases en 9 partes. Pauta avanzada derivada de Géron, VanderPlas, Huyen, ISLP y Barocas/Hardt/Narayanan. Recurso personal de aprendizaje, enseñanza y mejora continua.
bootcamp data-analysis data-science education jupyter machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 01 Jun 2026
https://github.com/shubham200137/customer-churn-analysis
In this case study, we analyze customer churn for a telecom company serving Southern California. The company faces increased competition and wants to retain customers by understanding the reasons for churn. Our objectives include improving service quality, identifying churn factors, pinpointing attractive services, and retaining high LTV customers.
data-analysis data-visualization numpy-python pandas-python sqlite tableau
Last synced: 15 Apr 2026
https://github.com/amanraghuvanshi/adidas-western-zone-sales
Adidas United States Sales Report Analysis
data-analysis datatable pandas plotly statsmodels time-series
Last synced: 08 Feb 2026
https://github.com/suhail25/hotel-booking-analysis
Analyzed the cancelling of booking of hotels and summarized insights to the Hotel Manager to increase profit by 30%. Demonstrated data exploration, cleaning, analysis using Python and its libraries: pandas, seaborn, matplot. Documented the results in PDF report: reduced cancellation by 30% and releasing discounts for 10 days in a month.
data-analysis ipynb-notebook matplotlib pandas python seaborn
Last synced: 08 Feb 2026
https://github.com/rodrigojunqueiradev/curso-sql-para-analise-de-dados
data-analysis data-science nosql pg pgadmin4 postgresql sql
Last synced: 08 Feb 2026
https://github.com/prakashjha1/stock-investment-analysis
Stock Investment Analysis Project can help investor to select the better performing stocks.
data-analysis data-science numpy pandas pandas-datareader parallel-programming python
Last synced: 08 May 2026
https://github.com/fahadnasir13/financial_data-analyzer_tool
A Python-based framework for analyzing, cleaning, and reconciling financial data stored in Excel workbooks.
data-analysis excel financial python store
Last synced: 17 Jun 2026
https://github.com/grindelfp/datasets-analysis
The Machine Learning and Data Analysis course task dedicated to training skills of data normalizing and preprocessing.
data-analysis datasets ipynb mlda
Last synced: 05 Mar 2026
https://github.com/siddhant2105s/airline-performance-analysis-dashboard
Enhancing Airline Performance Analysis for the Department of Transport
data-analysis data-visualization tableau
Last synced: 08 Feb 2026
https://github.com/samjoesilvano/password_strength_prediction_using_nlp
Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.
data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf
Last synced: 08 May 2026
https://github.com/djm158/learning-microsoft-r
Working through https://www.gitbook.com/book/smott/introduction-to-microsoft-r-server/details and creating samples
data-analysis data-science microsoft microsoft-sql-server r
Last synced: 15 Apr 2026
https://github.com/josericodata/statisticsapp
Interactive statistics analysis app using Python and Streamlit. Perform key statistical tests, visualise distributions, and explore data with ease.
alpha-value chi-square-test confidence-intervals data-analysis dublin dublin-ireland europe hyphotesis-tests ireland normal-distribution null-hypothesis p-value portfolio python statistics streamlit t-test tech ubuntu z-test
Last synced: 26 Feb 2026
https://github.com/jedrzej-wydra/improving-accuracy
Improving accuracy of age estimates for insect evidence—calibration of physiological age at emergence (k) using insect size but without “k versus size” model
Last synced: 02 Sep 2025
https://github.com/wittyicon29/kritika-iit-b-2023
Seletcion task for the summer projects of Kritika IIT-B
data data-analysis data-science
Last synced: 15 Mar 2025
https://github.com/fatihilhan42/eda-spacex-launches-falcon9-and-falcon-heavy
In this project, we analyze the space flight data of Spacex space research company Falcon 9 rocket.
data-analysis data-science data-visualization eda elonmusk spacex
Last synced: 23 Mar 2025
https://github.com/an1mch1k-theone/project_1_hh_analyze
Проект: анализ резюме из HeadHunter
data-analysis data-analysis-project python
Last synced: 15 Apr 2026
https://github.com/barraharrison/airbnb-price-trends
Looking at how Airbnbs differ in price when it comes to location, room type and host activity
data-analysis data-science pandas plotly python streamlit
Last synced: 09 Feb 2026
https://github.com/fatihilhan42/turkey_earthquake_analysis_1915-2021_python
In this project, earthquakes in Turkey from 1915 to 2021 were analyzed. The data taken from the data set, which you can find in the repo, was first organized using data cleaning algorithms. Afterwards, these cleaned data were printed out as graphics and animation using data visualization algorithms.
data-analysis data-cleaning data-visualization jupyter-notebook
Last synced: 23 Mar 2025
https://github.com/27ahmad/amazon-sales-analysis
This repository contains an exploratory data analysis (EDA) and visualization project of Amazon sales data. The goal is to uncover insights and present key metrics through a Tableau dashboard.
data-analysis eda pandas python seaborn tableau
Last synced: 15 Apr 2026
https://github.com/82luli02/sakila_dvd_rental_database_analysis
Analysis of the Sakila DVD Rental database using SQL
data data-analysis data-science data-visualization sql
Last synced: 10 Mar 2026
https://github.com/evgeniyarbatov/singapore-streets
Exploring Singapore street names
data-analysis geospatial gis mapping osm python singapore street
Last synced: 15 Apr 2026
https://github.com/ludreinsalvador/global-covid-19-data-analysis
Contains Power BI dashboards that visualizes and analyzes global COVID-19 cases, deaths, and vaccination trends using data from the World Health Organization (WHO). The project aims to provide insights into the pandemic’s impact and vaccination progress worldwide through dynamic reports and advanced analytics.
analytics covid-19 covid19-data data data-analysis data-collection data-transformation data-visualization
Last synced: 26 Feb 2026
https://github.com/mathusanm6/critics-vs-players-analysis
This data analysis examines the relationship between critic scores, sales (owners), player engagement, and pricing to determine the ROI of critic reviews.
data-analysis data-science data-visualization game-reviews games-sales jupyter-notebook python-3 steam-games
Last synced: 16 Apr 2026
https://github.com/haroontrailblazer/machine_learning
About This Repository A curated resource hub for learning machine learning, featuring tutorials, code examples, datasets, and hands-on projects to build foundational skills and explore real-world applications.
data data-analysis data-visualization database dataset gradient-descent machine-learning pandas python3 random-forest sklearn statistics
Last synced: 16 Apr 2026
https://github.com/purushothamadluru/kpi-driven-insights-dashboard-customer-churn-analysis
This repository features a Power BI project designed to deliver KPI-driven insights into customer churn patterns. Leveraging a robust dataset and advanced data modeling techniques, this project uncovers trends, identifies key drivers of churn, and enables businesses to make data-driven decisions.
customer-churn-analysis data-analysis insights-dashboard kpi powerbi
Last synced: 09 Feb 2026
https://github.com/riju18/data-analysis-and-visualizaton
Most complex data analyzing for clustering, preparing, complex calculation, joining, cross-over & more for Data science.
data-analysis data-mining data-science data-visualization powerbi tableau
Last synced: 04 Jan 2026
https://github.com/animesh-chourey/power-bi
Various projects at my attempt to learn Power BI
business-analytics data-analysis data-visualization powerbi
Last synced: 10 Feb 2026
https://github.com/tushar2704/imdb-movie-analysis
This project extracts meaningful insights, trends, and patterns from the data, shedding light on various aspects of the movie industry. By leveraging this analysis, filmmakers, studios, and enthusiasts can gain valuable information to inform decision-making, understand audience preferences, and contribute to the creation of successful movies.
artificial-intelligence data-analysis data-science imdb project tushar2704
Last synced: 10 Feb 2026
https://github.com/gnneto/nf-analyzer
Script Python para extrair dados de Notas Fiscais Eletrônicas (XML) e gerar Excel consolidado, com foco na extração de informações financeiras, como vencimentos e valores, para uma análise mais detalhada e eficiente. mantendo formatação numérica.
data-analysis excel finance nf-analyzer pandas python xlm
Last synced: 16 Apr 2026
https://github.com/ezmiller/esd-viz
Visualization of European Social Survey (http://www.europeansocialsurvey.org/data/)
clojure data-analysis visualization
Last synced: 28 May 2026
https://github.com/0290192029/apartment-price-predictor
Python-проект по прогнозированию стоимости аренды квартир с помощью линейной регрессии. Практическая работа по теме: "Основы машинного обучения" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".
apartment-price-prediction apartments-for-rent api correios-api data-analysis feature-engineering feature-enginering linear-regression linear-regression-models mlops numpy prediction-model r seaborn
Last synced: 08 May 2026
https://github.com/bcko/ud-da-eda-redwinequality
Udacity Data Analyst Nanodegree Project : Exploratory Data Analysis : Red Wine Quality dataset
data-analysis data-analyst-nanodegree exploratory-data-analysis r-markdown rstudio udacity udacity-data-analyst-nanodegree udacity-nanodegree
Last synced: 10 Feb 2026
https://github.com/jesuserro/ab-testing-ui-redesign-vanguard
A/B testing analysis to evaluate the impact of a user interface redesign at Vanguard.
a-b-testing data-analysis eda exploratory-data-analysis testing ui-design ux-design
Last synced: 08 Jul 2025
https://github.com/chinmayee4/sales-analysis-for-ferns-n-petals
Analyzed Data By Creating Interactive Dashboard Using MS Excel
data-analysis data-cleaning data-visualization excel pivot-tables powerquery
Last synced: 11 Feb 2026
https://github.com/haonamnguyen/data-science-job-analysis
Evaluate the factors influencing salary trends in the data science industry, including experience levels, job titles, employment types, company sizes, and remote work arrangements, to help HR teams and hiring managers make data-driven decisions regarding compensation packages and recruitment strategies.
data-analysis data-science data-visualization jupyter-notebook python
Last synced: 16 Apr 2026
https://github.com/multitagging/benchmarks
Provides benchmarks to test the MultiTagging framework
benchmarks data-analysis ethereum smart-contracts vulnerabilities
Last synced: 11 Feb 2026
https://github.com/praveen-devknight/event-registration-analytics-dashboard
This project presents an interactive and visually-rich Power BI dashboard that analyzes registration data from a college-level technical and non-technical event, Teciton. The dashboard provides comprehensive insights into participant demographics, event preferences, food choices, and time-based trends.
data-analysis data-visualization excel powerbi sql
Last synced: 11 Feb 2026
https://github.com/bagusperdanay7/fcc-da-mean-variance-standard-deviation-calculator
One of Data Analysis with Python (freecodecamp) task, created a Mean Variance Standard Deviation Calculator.
data-analysis freecodecamp-project numpy python
Last synced: 06 May 2026
https://github.com/rodrigojunqueiradev/python-exercises
Repositório para armazenar exercícios realizados na linguagem Python / Repository to organize exercises with Python language
data-analysis data-science data-structures data-visualization database math pandas pandas-python python python-3 python3 sql statistics
Last synced: 16 Apr 2026
https://github.com/mborrillo/ranking-ciudades-espana
Sistema end-to-end de análisis multicriterio que evalúa 50 ciudades españolas en calidad de vida mediante datos oficiales
business-intelligence data-analysis multi-criteria-decision-analysis pandas python3 quality-of-life ranking-system scikit-learn scoring-models
Last synced: 13 Jan 2026
https://github.com/swethajoseph/credit-risk-assessment-eda-case-study
Conducted an Exploratory Data Analysis (EDA) using Python to assess credit risk, identifying key factors that contribute to loan defaults and improving lending decisions
data-analysis data-visualization datacleaning datapreparation exploratory-data-analysis feature-engineering jupyter-notebook matplotlib-pyplot numpy-library pandas-library python-library risk-analysis risk-assessment risk-management seaborn-plots visual-studio-code
Last synced: 27 Feb 2026
https://github.com/zeshanfareed/spam_mail_detection_ml_django_project
Machine learning projects through Django
css data-analysis data-visualization django-framework frontend html machine-learning mailcsvfile matplotlib pandas project-repository projects python
Last synced: 11 Feb 2026
https://github.com/trim0500/fe-stats-classifier
An experiment to create a machine learning model via PyTorch to classify select Fire Emblem unit base stat distributions.
creational-patterns data-analysis data-science data-visualization design-patterns excel jupyter jupyter-notebook matplotlib-pyplot numpy pandas python python-modules python3 pytorch singleton
Last synced: 11 Apr 2026
https://github.com/bala-1409/sql-projects
The repository contains Structured Query Language (SQL) Scripts. The Multiple SQL scripts for various projects which includes data cleaning, data pre-processing, data processing, data transformation and insights gaining through Query Language.
data-analysis data-mining data-science data-transformation database eda etl-framework exploratory-data-analysis microsoft-sql-server query-language sql sql-server sql-server-database sql-server-management-studio
Last synced: 27 Feb 2026
https://github.com/ancapitigoi/portfolio
This repository is my portfolio containing past and current projects.
analitycs dashboard data-analysis data-cleaning data-mining data-visualization excel exploratory-data-analysis r-programming sql story-telling tableau
Last synced: 12 Feb 2026
https://github.com/mlund2k/project-1-baseball-performance-vs.-attendance
Project assets for my first exploratory data analysis: Baseball Performance vs. Attendance.
bigquery data-analysis data-cleaning data-visualization excel rstudio sql tableau tidyverse
Last synced: 12 Feb 2026
https://github.com/mattsebastianh/Analyze-Data-with-Python-Portfolio-Project
Analyze Data with Python
barplot categories chi-square-test conservation contingency-table crosstab data-analysis data-cleaning-and-preprocessing eda endangered-species matplotlib national-parks pandas-dataframe species species-conservation
Last synced: 18 Jun 2026
https://github.com/preetesh21/spotme
This repository is using the web-based API provided by Spotify to retrieve data and then analyse it.
Last synced: 18 Jun 2026
https://github.com/praveingk/lipidanalysis
data-analysis data-visualisation
Last synced: 17 Mar 2025
https://github.com/koldlight/bluetab-data-science-2017
Repositorio para compartir material y publicar los retos
course data-analysis data-science exercises
Last synced: 12 Feb 2026
https://github.com/kzon94/torn-market-analyzer
Streamlit app that parses Torn Add Listing text, matches items with a custom dictionary, fetches market data via the public API, and generates KPIs and price recommendations using a modular Python analytics pipeline.
data-analysis data-engineering fuzzy-matching market-analytics numpy pandas python streamlit torn-city torn-city-api
Last synced: 11 Apr 2026
https://github.com/nabilshadman/power-bi-essential-training
Exercise files for Power BI Essential Training (2024): datasets and dashboards for hands-on learning
dashboard data-analysis data-science data-visualization power-bi power-bi-dashboard
Last synced: 12 Feb 2026
https://github.com/projects-developer/ransomware-prediction-using-machine-learning-project
The project aims to develop a machine learning-based system to predict and detect ransomware attacks on computer systems. Ransomware is a type of malware that encrypts a victim's files and demands a ransom in exchange for the decryption key. Project Includes Source Code, PPT, Synopsis, Report, Documents, Base Research Paper & Video tutorials
artificial-intelligence btechproject computerscienceproject cybersecurity-malware data-analysis data-mining deep-learning machinelearning mtechproject neural-networks ransomware-machine-learning
Last synced: 12 Feb 2026
https://github.com/martachesnova/big-data
Finding out whether reviews from Amazon's Vine program are trustworthy. Performed ETL process in the Cloud and uploaded a DataFrame to an RDS instance. Used PySpark and Spark SQL to perform a statistical analysis and uncover "hidden" insights.
big-data data-analysis dataset python spark sql
Last synced: 16 Apr 2026
https://github.com/edoaltamura/rotational-ksz-macsis
Repository for suppelementary material from my publication on the rotational kinetic SZ effect in MACSIS
cosmology data-analysis galaxy-clusters high-performance-computing hydrodynamics
Last synced: 28 Feb 2026
https://github.com/sabaasif2501/netflix-data-analysis
Exploratory data analysis of Netflix content using Python and pandas. Content types, genres, countries, and release years.
data-analysis netflix pandas portfolio-project python
Last synced: 08 May 2026
https://github.com/satvikpraveen/numpymasterpro
A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.
broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization
Last synced: 08 May 2026
https://github.com/karlyndiary/data-visualisation-empowering-business-with-effective-insights
This Tata Group Sales Insights Dashboard uses a dataset provided by Forage.
analysis-and-presentation analytics-and-insights dashboard data-analysis data-cleanup data-interpretation data-visualization forage tableau tata-group visualisation
Last synced: 28 Feb 2026
https://github.com/kariemseiam/geoegy
An innovative and responsive dashboard to discover, filter, and analyze places across Egypt. Featuring advanced search, interactive maps with Leaflet.js, real-time analytics, dark mode, and seamless data export—all wrapped in a sleek, modern design with RTL support.
accessibility data-analysis data-visualization es6-modules geojson javascript leaflet mapping openstreetmap places-data responsive-design web-development
Last synced: 13 Feb 2026
https://github.com/aicorsair/python-case-study-365-data-science-subscription-purchase-prediction
This repository contains a comprehensive case study on predicting 365 Data Science customer subscriptions using real-world student engagement data.
data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization decision-tree feature-engineering feature-selection hyperparameter-optimization hyperparameter-tuning k-nearest-neighbors logistic-regression machine-learning purchase-prediction python random-forest scikit-learn statsmodels svc
Last synced: 08 May 2026
https://github.com/josewebdev2000/ztm-python-course
Challenges and Guided Projects from ZTM Python Course
automation data-analysis functional-programming oop python python3 regex scripting testing web-development
Last synced: 10 Jun 2026
https://github.com/virajbhutada/credit-card-transaction-analysis-sql
This project provides a structured database schema and SQL scripts to analyze credit card data. It includes tools for managing and analyzing transaction data, helping to identify spending patterns and trends. The project features visual schema diagrams and supporting documentation for easy understanding.
creditcard customer data-analysis data-cleaning data-modeling database database-management insights performance-optimization postgresql query-language schema-design schema-diagram scripts sql transactions trends
Last synced: 15 May 2026