Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/bertiewooster/ipywidgets
Interactive data visualizations in a Jupyter Notebook per tutorial https://python.plainenglish.io/interactive-visualizations-with-pandas-seaborn-and-ipywidgets-173e5d7d6a5e
data-analysis data-science data-visualization ipython-notebook ipywidgets juypter-notebook python
Last synced: 06 Mar 2026
https://github.com/hyperentangledqubit/shellplot
shellplot -- Generate plot(s) directly from terminal via matplotlib or ggplot2 (plotnine)!
data-analysis ggplot2 graphics matplotlib plotnine plotting pyplot terminal
Last synced: 10 May 2026
https://github.com/alex-pierron/ekip-enedis-genai
Repository for the team "Ekip" during the H-GenAI Hackathon 2025 organized at SIA Partners, Paris, France
amazon-nova artificial-intelligence aws aws-lambda data-analysis database generative-ai mistral nlp
Last synced: 15 Apr 2026
https://github.com/ajmannust41288/data-analyst
Data Analyst ,Microsoft Professional expert,Desktop PowerBi ,Tablue and Dashboards with ChatGP4 AI uses
business-analytics data-analysis data-analyst data-analytics eda
Last synced: 01 Feb 2026
https://github.com/axsk/geekgraph
parse, cluster and visualize boardgamegeek.com user profiles
Last synced: 01 Feb 2026
https://github.com/emediongfrancis/unified-data-lake-implementation-gcp-kafka-airflow-snowflake
This project demonstrates the integration of data from multiple sources into a unified data lake. The project showcases the use of Apache Airflow for ETL tasks, Google Cloud Storage as a data lake, Apache Kafka for data movement automation, Snowflake for data warehousing, and Google BigQuery for analysis.
airflow data-analysis data-warehousing etl etl-pipeline gcp-storage kafka snowflake value variety
Last synced: 07 Feb 2026
https://github.com/tapas-gope/global-superstore-sales
This repository contains a Power BI dashboard designed to provide comprehensive insights into sales performance across various regions, segments, and products. The dashboard utilizes a variety of visualizations, including bar charts, line charts, maps, and tables, to effectively communicate key metrics and trends.
business-intelligence data-analysis data-modeling data-visualization financial-reporting powerbi sales-analysis
Last synced: 07 Feb 2026
https://github.com/wsu-carbon-lab/ezfit
Fitting in python made dead simple
data-analysis experimental-physics fitting pandas-accessor
Last synced: 14 Jun 2025
https://github.com/ludreinsalvador/life-expectancy-data-analysis
Contains Power BI dashboards analyzing global life expectancy trends, mortality rates, and health expenditures. Using a dataset sourced from Google Sheets, the project explores the impact of economic and healthcare factors on longevity.
dashboard data-analysis data-visualization healthcare-analysis life-expectancy powerbi
Last synced: 25 Feb 2026
https://github.com/samjoesilvano/password_strength_prediction_using_nlp
Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.
data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf
Last synced: 08 May 2026
https://github.com/shrikantnaidu/sql-for-data-analysis
SQL for Data Analysis
data-analysis parch-and-posey postgresql
Last synced: 27 Feb 2025
https://github.com/rissh/titanicsurvivalpredictionusingml
Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. 🚢
data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic
Last synced: 01 Feb 2026
https://github.com/keneandita/exploratory-data-analysis-eda-
Explore EDA on 5 datasets: Titanic 🚢, Heart Disease ❤️, Wine Quality 🍷, Car Price 🚗, and NBA Players 🏀. Includes data cleaning, preprocessing, and visualizations to uncover insights. Perfect for beginners to learn data analysis with Pandas, Matplotlib, and Seaborn! 🎨📈
data-analysis data-visualization eda matplotlib pandas python seaborn sklearn
Last synced: 15 Apr 2026
https://github.com/marina-gal/sql-business-questions
A collection of SQL queries designed to strengthen analytical problem-solving skills using the AdventureWorks2019 sample database. tested and optimized in SQL Server Management Studio (SSMS).
adventureworks data-analysis data-analyst interview-preparation learning microsoft-sql-server practice sql sql-queries
Last synced: 30 May 2026
https://github.com/0290192029/apartment-price-predictor
Python-проект по прогнозированию стоимости аренды квартир с помощью линейной регрессии. Практическая работа по теме: "Основы машинного обучения" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".
apartment-price-prediction apartments-for-rent api correios-api data-analysis feature-engineering feature-enginering linear-regression linear-regression-models mlops numpy prediction-model r seaborn
Last synced: 08 May 2026
https://github.com/nagar2nd/jenson-usa-mysql-analysis
We are analyzing Jenson USA's dataset to gain valuable insights into customer behavior, staff performance, inventory management, and store operations. By crafting advanced SQL queries, the analysis explores key metrics such as product sales, customer spending, and order patterns, ultimately guiding strategic decision-making and operations.
data-analysis problem-solving sql
Last synced: 01 Feb 2026
https://github.com/sumit-sinha9/sales-analysis
Analyzing 12 months worth fo Sales data
data-analysis pandas python visualization
Last synced: 08 May 2026
https://github.com/prakshal0809/power-bi-analytics-dashboard
I have developed a dashboard in Power BI utilizing data from an Excel file. The dashboard effectively visualizes and analyzes the given data.
Last synced: 22 Feb 2026
https://github.com/yeuner/file-analysis-sql-demo
Streamlit-based application that leverages pandas, sqlite3, and file handling libraries (OpenPyXL and PyArrow) to practice SQL queries, analyze datasets, and export results. A personal project to enhance Python and SQL skills.
data-analysis dataset pandas sql sqlite streamlit vizualization
Last synced: 15 Apr 2026
https://github.com/isaqueiros/newspapersales-predictions-linearregression_and_regularisation
This notebook is a study on the sales of newspapers of a local stand, with intention to predict the newspaper sales performance based on the different features available. For this, 4 sklearn models are applied: Linear Regression, Lasso Regression, Ridge Regression and Elastic Net Regression.
data-analysis data-science linear-regression machine-learning python regularization-methods sklearn-library sklearn-linear-regression
Last synced: 02 May 2026
https://github.com/khanovico/python-stock-analyzer
This is a Webapp implemented by python and several data science frameworks, enabling online stock trend analyzing.
amcharts-js-charts data-analysis data-visualization flask javascript pandas python scikit-learn
Last synced: 02 Feb 2026
https://github.com/fer-aguirre/cookiecutter-data-analysis-lite
A cookiecutter template for data journalism projects that offers a simplified and beginner-friendly structure.
cookiecutter data-analysis data-journalism project-template python
Last synced: 14 Jun 2025
https://github.com/abhisek-13/whatsapp-chat-analyzer
The WhatsApp Chat Analyzer is a data analysis project that provides insights into WhatsApp chats. It analyzes chat data to show metrics like the number of lines, most used letter, chatting duration, media files shared, most used emojis, and group member activity. The results are displayed on a user-friendly dashboard built with Streamlit.
data-analysis data-mining data-visualization eda machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn sklearn
Last synced: 13 Apr 2026
https://github.com/shubham200137/customer-churn-analysis
In this case study, we analyze customer churn for a telecom company serving Southern California. The company faces increased competition and wants to retain customers by understanding the reasons for churn. Our objectives include improving service quality, identifying churn factors, pinpointing attractive services, and retaining high LTV customers.
data-analysis data-visualization numpy-python pandas-python sqlite tableau
Last synced: 15 Apr 2026
https://github.com/amanraghuvanshi/adidas-western-zone-sales
Adidas United States Sales Report Analysis
data-analysis datatable pandas plotly statsmodels time-series
Last synced: 08 Feb 2026
https://github.com/suhail25/hotel-booking-analysis
Analyzed the cancelling of booking of hotels and summarized insights to the Hotel Manager to increase profit by 30%. Demonstrated data exploration, cleaning, analysis using Python and its libraries: pandas, seaborn, matplot. Documented the results in PDF report: reduced cancellation by 30% and releasing discounts for 10 days in a month.
data-analysis ipynb-notebook matplotlib pandas python seaborn
Last synced: 08 Feb 2026
https://github.com/rodrigojunqueiradev/curso-sql-para-analise-de-dados
data-analysis data-science nosql pg pgadmin4 postgresql sql
Last synced: 08 Feb 2026
https://github.com/noorulhudaajmal/customer-segmentation-analysis
Customer segmentation and analysis of purchasing behaviour
cluster-analysis customer-segmentation data-analysis
Last synced: 07 Oct 2025
https://github.com/sroman0/data-analytics
Data Analytics Exercises is a collection of comprehensive university-level exercises aimed at enhancing skills in data analytics. The repository includes practical notebooks covering data manipulation, exploratory data analysis (EDA), statistical analysis, data visualization, and machine learning fundamentals.
data-analysis data-analytics data-science data-visualization education exercises exploratory-data-analysis hands-on-practice jupyter-notebook machine-learning python statistics
Last synced: 15 Apr 2026
https://github.com/grindelfp/datasets-analysis
The Machine Learning and Data Analysis course task dedicated to training skills of data normalizing and preprocessing.
data-analysis datasets ipynb mlda
Last synced: 05 Mar 2026
https://github.com/mdaltamashalam/uber-fare-prediction-models
Predicts the fare amount of Uber rides based on various factors such as pickup/drop-off coordinates, passenger count, and trip distance.
catboost data-analysis data-cleaning data-visualization lgbm-regressor machine-learning matplotlib numpy pandas python random-forest regression-models skit-learn xgboost-algorithm
Last synced: 26 Feb 2026
https://github.com/siddhant2105s/airline-performance-analysis-dashboard
Enhancing Airline Performance Analysis for the Department of Transport
data-analysis data-visualization tableau
Last synced: 08 Feb 2026
https://github.com/sabaasif2501/netflix-data-analysis
Exploratory data analysis of Netflix content using Python and pandas. Content types, genres, countries, and release years.
data-analysis netflix pandas portfolio-project python
Last synced: 08 May 2026
https://github.com/michalspano/maturitna-skuska-proj
Maturitná skúška 2021/2022 - objektívna spracovanie a analýza dát
Last synced: 19 Mar 2026
https://github.com/jkaardal/csvnav
A memory-efficient python class for navigating large CSV/text files.
csv data-analysis data-science machine-learning memory-management
Last synced: 14 Jan 2026
https://github.com/jweinst1/xenon
A processing based language
data-analysis interpreter reactive-programming
Last synced: 15 Apr 2026
https://github.com/dhruwsunita/zomato-data-analysis-project
Zomato data analysis project using Python.
data-analysis data-visualization jupyter-notebook matplotlib numpy pandas-dataframe python
Last synced: 08 May 2026
https://github.com/weisswuerste/polars-eurovision-analytics
Analytics example using both the Pandas and Polars libraries
data-analysis data-analytics pandas polars python python-3 python3
Last synced: 08 May 2026
https://github.com/sadia-khan13/data-preprocessing
Welcome to the Data preprocessing Repository! This repository is dedicated to showcase the comprehensive resources and implementations related to Data Preprocessing using Python and Jupyter Notebook.
artificial-intelligence data-analysis data-mining data-preprocessing data-science jupyter-notebook matplotlib numpy pandas python seaborn-python sklearn
Last synced: 11 Apr 2026
https://github.com/vatshayan/pokemon-analysis
Visualization, Analysis & Predicting the accuracy of finding Pokemon power, attack & speed through Machine Learning
artificial-intelligence data data-analysis data-science data-visualization dataset machine-learning machine-learning-algorithms pokemon scikit-learn
Last synced: 30 May 2026
https://github.com/an1mch1k-theone/project_1_hh_analyze
Проект: анализ резюме из HeadHunter
data-analysis data-analysis-project python
Last synced: 15 Apr 2026
https://github.com/shubham200137/spotify-listening-habits-analytics
Spotify Listening Habits Analytics is a project aimed at analyzing personalized Spotify listening habits and music trends. It involves Exploratory Data Analysis (EDA) with Python Pandas, data processing using SQL Server, and creating visualizations with Power BI. The goal is to uncover insights into listening patterns, track popularity, and artist.
data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas power-bi-dashboard sqlserver
Last synced: 18 Mar 2026
https://github.com/aisurjyasamantaray/-optimizing-target-s-brazilian-operations-insights-from-order-processing-pricing-and-payment-trends-
This project offers an in-depth analysis of consumer behavior, logistical performance, and payment preferences within the e-commerce sector. By examining order costs, delivery times, and payment methods, businesses can uncover valuable insights into operational efficiency and customer preferences.
bigquery consumer-insights data-analysis database sql target
Last synced: 26 Feb 2026
https://github.com/naninsv/apple-retail-sales-warranty-analysis
An advanced SQL project analyzing over 1 million rows of Apple retail sales data to solve real-world business problems, optimize query performance, and extract actionable insights. The analysis includes sales trends, warranty claims, product performance, and year-over-year growth
business-intelligence data-analysis data-science etl insights retailanalytics sql sqladvance
Last synced: 26 Feb 2026
https://github.com/fer-aguirre/covid19-venezuela
Análisis de datos de muertes por covid-19 en Venezuela
covid-19 data-analysis dataviz line-chart
Last synced: 09 Apr 2025
https://github.com/tatilimongi/first_python_project
Este repositório contém um estudo de caso de automação de planilhas em Python para análise de vendas de carros por fabricante ao longo dos anos
data-analysis email-sending file-manipulation graphical-visualization spreadsheet-automation
Last synced: 26 Mar 2025
https://github.com/josewebdev2000/ztm-python-course
Challenges and Guided Projects from ZTM Python Course
automation data-analysis functional-programming oop python python3 regex scripting testing web-development
Last synced: 10 Jun 2026
https://github.com/rajeev2806/netflix-data-analysis
In this project i have implemented ETL . I used netflix dataset to clean and analyze using postgresql and python
data-analysis data-cleaning postgresql python
Last synced: 15 Apr 2026
https://github.com/ludreinsalvador/global-covid-19-data-analysis
Contains Power BI dashboards that visualizes and analyzes global COVID-19 cases, deaths, and vaccination trends using data from the World Health Organization (WHO). The project aims to provide insights into the pandemic’s impact and vaccination progress worldwide through dynamic reports and advanced analytics.
analytics covid-19 covid19-data data data-analysis data-collection data-transformation data-visualization
Last synced: 26 Feb 2026
https://github.com/mathusanm6/critics-vs-players-analysis
This data analysis examines the relationship between critic scores, sales (owners), player engagement, and pricing to determine the ROI of critic reviews.
data-analysis data-science data-visualization game-reviews games-sales jupyter-notebook python-3 steam-games
Last synced: 16 Apr 2026
https://github.com/haroontrailblazer/machine_learning
About This Repository A curated resource hub for learning machine learning, featuring tutorials, code examples, datasets, and hands-on projects to build foundational skills and explore real-world applications.
data data-analysis data-visualization database dataset gradient-descent machine-learning pandas python3 random-forest sklearn statistics
Last synced: 16 Apr 2026
https://github.com/shruti23-ui/blinkit-powerbi-dashboard
A comprehensive Power BI dashboard analyzing Blinkit's sales performance, outlet metrics, and multi-tier market analytics with interactive visualizations and business intelligence insights.
data-analysis data-visualization microsoft-excel microsoft-power-bi powerbi sales-analysis sql
Last synced: 09 Feb 2026
https://github.com/tillbiskup/trepr
A Python package based on the ASpecD framework for handling TREPR data.
data-analysis data-processing electron-paramagnetic-resonance reproducible-research reproducible-science spectroscopy time-resolved
Last synced: 06 Sep 2025
https://github.com/aakk23/perfomance-dashboard-tableau
This Tableau dashboard provides an interactive analysis of Superstore sales data, covering key metrics like sales, profit, orders, and customer trends. It helps visualize business performance across product categories, customer segments, and geographic regions.
data-analysis data-visualization superstore-data-analysis tableau tableau-dashboards
Last synced: 10 Feb 2026
https://github.com/tushar2704/imdb-movie-analysis
This project extracts meaningful insights, trends, and patterns from the data, shedding light on various aspects of the movie industry. By leveraging this analysis, filmmakers, studios, and enthusiasts can gain valuable information to inform decision-making, understand audience preferences, and contribute to the creation of successful movies.
artificial-intelligence data-analysis data-science imdb project tushar2704
Last synced: 10 Feb 2026
https://github.com/1401dev/customer-lifetime-value-prediction
A data science project leveraging Python and Scikit-Learn to build predictive models that estimate customer lifetime value (CLV). Includes data cleaning, feature engineering, and model selection to identify key drivers of CLV, supporting strategic decision-making in customer retention and marketing.
clv clv-analysis customer-retention data-analysis dataprocessing feature-engineering machine-learning marketing-analytics predictive-modeling python regression-analysis scikit-learn
Last synced: 06 May 2026
https://github.com/sreekar0101/bank-financial-loan-performance-trend-analysis
About This project analyzes the performance trends of financial loans using SQL for data extraction and Tableau for visualization. The goal was to perform exploratory data analysis (EDA) to understand key metrics like loan applications, funded amounts, interest rates, and debt-to-income ratios using sql and tableau for visualization
data-analysis data-visualization sql tableau
Last synced: 27 Feb 2026
https://github.com/bcko/ud-da-eda-redwinequality
Udacity Data Analyst Nanodegree Project : Exploratory Data Analysis : Red Wine Quality dataset
data-analysis data-analyst-nanodegree exploratory-data-analysis r-markdown rstudio udacity udacity-data-analyst-nanodegree udacity-nanodegree
Last synced: 10 Feb 2026
https://github.com/saro0307/exploratory-data-analysis-terrorism
Phase 1 of Data Science project (program) to perform Exploratory Data Analysis on Terrorism using Python On Google Colab for Coderscave Internship sept 2023
colaboratory data-analysis datascience machine-learning numpy pandas python seaborn skit-learn visualization
Last synced: 13 Apr 2026
https://github.com/meokullu/prefill
PreFill adds desired characters onto output values to increase their legibility.
alignment data data-analysis data-engineering data-science legibility
Last synced: 17 Jan 2026
https://github.com/georgehanymilad/mobile-usage-behavior-analysis
Excel Project for Data Analysis
data-analysis data-visualization dataanalyst dataanalytics excel-dashboard pivot-tables powerquery storytelling
Last synced: 11 Feb 2026
https://github.com/chinmayee4/sales-analysis-for-ferns-n-petals
Analyzed Data By Creating Interactive Dashboard Using MS Excel
data-analysis data-cleaning data-visualization excel pivot-tables powerquery
Last synced: 11 Feb 2026
https://github.com/nickenshidqia/startup-venture-funding-dashboard-data-analysis
The Startup Venture Funding Dashboard is a comprehensive visual representation of the dynamic landscape of startup funding, providing valuable insights into the top startups, funding round types, markets, startup statuses, and investor details.
dashboard data-analysis tableau tableau-dashboards
Last synced: 11 Feb 2026
https://github.com/shrutiijoshi/crm-sales-analysis
The dataset contained records exported from MavenTech's CRM from October 2016 to December 2017. It held details of opportunities with associated information such as product, account, and whether the sale was won or lost.
data-analysis data-visualization dax-functions powerbi powerquery
Last synced: 11 Feb 2026
https://github.com/vikktor93/proyecto-final-python-datascience
Dataset analysis of worldwide sales of video games on different platforms in 2020
data-analysis data-science jupyter-notebook kaggle matplotlib pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/ohimoiza1205/mastercard-cybersecurity-simulation
Served as an analyst on Mastercard’s Security Awareness Team to identify and report security threats
cybersecurity data-analysis data-presentation security-awareness-training technical-security-awareness
Last synced: 11 Feb 2026
https://github.com/diligencefrozen/dcinside-data
Analyzing the Dcinside Frozen Gallery Dataset. #디시
Last synced: 30 May 2026
https://github.com/joemull/pyjade
A data curation script for the Jane Addams Digital Edition
data-analysis digital-humanities
Last synced: 11 Feb 2026
https://github.com/rodrigojunqueiradev/python-exercises
Repositório para armazenar exercícios realizados na linguagem Python / Repository to organize exercises with Python language
data-analysis data-science data-structures data-visualization database math pandas pandas-python python python-3 python3 sql statistics
Last synced: 16 Apr 2026
https://github.com/virajbhutada/telecom-customer-churn-prediction
Predict and prevent customer churn in the telecom industry with this project. Harness the power of advanced analytics and Machine Learning on a diverse dataset to develop a robust classification model. Gain deep insights into customer behavior and identify critical factors influencing churn using interactive Power BI visualizations.
churn-prediction classification-models customer-attrition-analysis customer-churn-prediction data-analysis data-science decision-tree-classifier eda logistic-regression machine-learning machine-learning-algorithms machine-learning-models pandas powerbi powerbi-desktop python random-forest-classifier roc-curve xgboost-classifier
Last synced: 09 Apr 2026
https://github.com/rishitabansal9/adult-census-income-prediction
This is a project made for data analysis and income prediction using random forest classifier with 91% accuracy.
data data-analysis data-science feature-engineering random-forest-classifier
Last synced: 25 Mar 2025
https://github.com/swethajoseph/credit-risk-assessment-eda-case-study
Conducted an Exploratory Data Analysis (EDA) using Python to assess credit risk, identifying key factors that contribute to loan defaults and improving lending decisions
data-analysis data-visualization datacleaning datapreparation exploratory-data-analysis feature-engineering jupyter-notebook matplotlib-pyplot numpy-library pandas-library python-library risk-analysis risk-assessment risk-management seaborn-plots visual-studio-code
Last synced: 27 Feb 2026
https://github.com/pedrosfaria2/analisandopostshn
Projeto para analisar as postagens da comunidade HackerNews
analise-de-dados data-analysis datetime jupyter-notebook matplotlib python python3
Last synced: 08 May 2026
https://github.com/giyanellow/time-series-analysis-on-philippine-debt-and-inflation
A Time Series Analysis on the Philippine Inflation Rate with some predictions using RandomForest.
data-analysis data-analysis-python machine-learning python random-forest
Last synced: 18 Mar 2026
https://github.com/dbriane208/python-for-data-science
Machine Learning and Data Science repository. Love crafting Machine Learning models.
data-analysis data-science data-visualization machine-learning numpy pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/bala-1409/sql-projects
The repository contains Structured Query Language (SQL) Scripts. The Multiple SQL scripts for various projects which includes data cleaning, data pre-processing, data processing, data transformation and insights gaining through Query Language.
data-analysis data-mining data-science data-transformation database eda etl-framework exploratory-data-analysis microsoft-sql-server query-language sql sql-server sql-server-database sql-server-management-studio
Last synced: 27 Feb 2026
https://github.com/thlindustries/mortalidade_neonatal_python_react
Uma plataforma de visualização de dados montada utilizando Python e React com a library de visualização do Plotly
data-analysis data-visualization plotly python python3 react reactjs
Last synced: 16 Apr 2026
https://github.com/ancapitigoi/portfolio
This repository is my portfolio containing past and current projects.
analitycs dashboard data-analysis data-cleaning data-mining data-visualization excel exploratory-data-analysis r-programming sql story-telling tableau
Last synced: 12 Feb 2026
https://github.com/l1ght14/e-commerce-sales-analysis
Interactive Power BI dashboard analyzing e-commerce sales, profit trends, top products, and customer segments using the Sample Superstore dataset.
dashboard data-analysis powerbi
Last synced: 12 Feb 2026
https://github.com/shreshthvashisht/hiring-process-analytics
Statistics Using Excel
advanced-excel data-analysis data-science data-visualization excel hr-analytics statistics
Last synced: 27 Feb 2026
https://github.com/deepanshkhurana/udacityproject-prediciting-boston-housing-prices
This is a Udacity Project for the Machine Learning Nanodegree. Here, we are trying to predict Boston Housing Prices using sklearn.
data-analysis data-science machine-learning python scikit-learn udacity
Last synced: 08 May 2026
https://github.com/ryan-wong1/analyzing-arrest-patterns-in-chicago-data-analysis
Chicago Police Department (CPD) arrest data on offenses, locations, and demographics
data-analysis data-cleaning data-visualization exploratory-data-analysis matplotlib pandas python seaborn
Last synced: 08 May 2026
https://github.com/an4pdm/relatorio-de-vendas
O presente projeto foi feito através das ferramentas oferecidas pelo Power BI afim de aprimorar meus conhecimentos sobre ETL. Os dados utilizados foram de origem do site "Kaggle".
data-analysis data-visualization database etl powerbi
Last synced: 20 Jun 2026
https://github.com/rohitblaze10/-excel-_seller_store_analysis
A collection of data analysis projects showcasing data cleaning, exploration, visualization, and machine learning. Using "Excel" and more to uncover insights and drive data-driven decision-making. Feel free to explore, contribute, or collaborate!
data-analysis data-visualization excel excel-export
Last synced: 12 Feb 2026
https://github.com/koldlight/bluetab-data-science-2017
Repositorio para compartir material y publicar los retos
course data-analysis data-science exercises
Last synced: 12 Feb 2026
https://github.com/nhoiyee/other-python-projects
using Python in Jupyter Notebook
data-analysis data-engineering data-mining jupyter jupyter-notebook jupyter-notebooks python python3
Last synced: 12 Feb 2026
https://github.com/krzysikd/uber_fare_prediction
Predicting uber fares using advanced machine learning models and feature engineering techniques
data-analysis data-processing eda hyperparameter-tuning jupyter machine-learning regression-models
Last synced: 02 Apr 2025
https://github.com/yalai92/alfalfa_imp_exp_analysis
This repository covers data cleaning, analysis, and visualization of global alfalfa and pellet imports, focusing on trends from 2003 to 2023. It also includes a predictive analysis of global alfalfa demand for 2024-2029, using data science techniques to provide insights for stakeholders in the alfalfa industry.
data-analysis data-cleaning data-visualization matplotlib numpy pandas python sckiit-learn tableau
Last synced: 12 Feb 2026
https://github.com/projects-developer/ransomware-prediction-using-machine-learning-project
The project aims to develop a machine learning-based system to predict and detect ransomware attacks on computer systems. Ransomware is a type of malware that encrypts a victim's files and demands a ransom in exchange for the decryption key. Project Includes Source Code, PPT, Synopsis, Report, Documents, Base Research Paper & Video tutorials
artificial-intelligence btechproject computerscienceproject cybersecurity-malware data-analysis data-mining deep-learning machinelearning mtechproject neural-networks ransomware-machine-learning
Last synced: 12 Feb 2026
https://github.com/martachesnova/big-data
Finding out whether reviews from Amazon's Vine program are trustworthy. Performed ETL process in the Cloud and uploaded a DataFrame to an RDS instance. Used PySpark and Spark SQL to perform a statistical analysis and uncover "hidden" insights.
big-data data-analysis dataset python spark sql
Last synced: 16 Apr 2026
https://github.com/edoaltamura/rotational-ksz-macsis
Repository for suppelementary material from my publication on the rotational kinetic SZ effect in MACSIS
cosmology data-analysis galaxy-clusters high-performance-computing hydrodynamics
Last synced: 28 Feb 2026
https://github.com/rileynwong/forecasting-coffee-prices
Predict coffee prices in Kenya
data data-analysis data-scraping data-visualization forecasting forecasting-models forecasting-prices jupyter-notebook prophet prophet-model
Last synced: 20 Jun 2026
https://github.com/sakan811/stress-pattern-occurrence-in-english-words
This project is intended to provide English learners with data that allows them to make a data-driven guess when encountering words that they aren't sure where to stress
data-analysis data-visualization english english-language english-learning language powerbi powerbi-report powerbi-visuals
Last synced: 20 Jun 2026
https://github.com/ryan-wong1/nyc-job-postings-data-analysis
City of New York Current Job Postings 2024
data-analysis data-cleaning exploratory-data-analysis sql
Last synced: 13 Feb 2026
https://github.com/karlyndiary/data-visualisation-empowering-business-with-effective-insights
This Tata Group Sales Insights Dashboard uses a dataset provided by Forage.
analysis-and-presentation analytics-and-insights dashboard data-analysis data-cleanup data-interpretation data-visualization forage tableau tata-group visualisation
Last synced: 28 Feb 2026
https://github.com/kariemseiam/geoegy
An innovative and responsive dashboard to discover, filter, and analyze places across Egypt. Featuring advanced search, interactive maps with Leaflet.js, real-time analytics, dark mode, and seamless data export—all wrapped in a sleek, modern design with RTL support.
accessibility data-analysis data-visualization es6-modules geojson javascript leaflet mapping openstreetmap places-data responsive-design web-development
Last synced: 13 Feb 2026