Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/sharmas1ddharth/mode_of_transport_analysis
This project requires you to understand what mode of transport employees prefers to commute to their office. The data includes employee information about their mode of transport as well as their personal and professional details like age, salary, and work exp. We need to predict whether or not an employee will use private transport. Also, which variables are a significant predictor behind this decision.
Last synced: 11 Feb 2026
https://github.com/ancapitigoi/portfolio
This repository is my portfolio containing past and current projects.
analitycs dashboard data-analysis data-cleaning data-mining data-visualization excel exploratory-data-analysis r-programming sql story-telling tableau
Last synced: 12 Feb 2026
https://github.com/shreshthvashisht/hiring-process-analytics
Statistics Using Excel
advanced-excel data-analysis data-science data-visualization excel hr-analytics statistics
Last synced: 27 Feb 2026
https://github.com/rohitblaze10/-excel-_seller_store_analysis
A collection of data analysis projects showcasing data cleaning, exploration, visualization, and machine learning. Using "Excel" and more to uncover insights and drive data-driven decision-making. Feel free to explore, contribute, or collaborate!
data-analysis data-visualization excel excel-export
Last synced: 12 Feb 2026
https://github.com/koldlight/bluetab-data-science-2017
Repositorio para compartir material y publicar los retos
course data-analysis data-science exercises
Last synced: 12 Feb 2026
https://github.com/yalai92/alfalfa_imp_exp_analysis
This repository covers data cleaning, analysis, and visualization of global alfalfa and pellet imports, focusing on trends from 2003 to 2023. It also includes a predictive analysis of global alfalfa demand for 2024-2029, using data science techniques to provide insights for stakeholders in the alfalfa industry.
data-analysis data-cleaning data-visualization matplotlib numpy pandas python sckiit-learn tableau
Last synced: 12 Feb 2026
https://github.com/projects-developer/ransomware-prediction-using-machine-learning-project
The project aims to develop a machine learning-based system to predict and detect ransomware attacks on computer systems. Ransomware is a type of malware that encrypts a victim's files and demands a ransom in exchange for the decryption key. Project Includes Source Code, PPT, Synopsis, Report, Documents, Base Research Paper & Video tutorials
artificial-intelligence btechproject computerscienceproject cybersecurity-malware data-analysis data-mining deep-learning machinelearning mtechproject neural-networks ransomware-machine-learning
Last synced: 12 Feb 2026
https://github.com/edoaltamura/rotational-ksz-macsis
Repository for suppelementary material from my publication on the rotational kinetic SZ effect in MACSIS
cosmology data-analysis galaxy-clusters high-performance-computing hydrodynamics
Last synced: 28 Feb 2026
https://github.com/kariemseiam/geoegy
An innovative and responsive dashboard to discover, filter, and analyze places across Egypt. Featuring advanced search, interactive maps with Leaflet.js, real-time analytics, dark mode, and seamless data export—all wrapped in a sleek, modern design with RTL support.
accessibility data-analysis data-visualization es6-modules geojson javascript leaflet mapping openstreetmap places-data responsive-design web-development
Last synced: 13 Feb 2026
https://github.com/malakaburamila/power-bi-dashboards
A portfolio of interactive Power BI dashboards I developed, showcasing data visualization, analytics, and data-driven insights.
amazonsalesanalysis analytics dashboards data-analysis data-visualization datasets hranalytics power-bi
Last synced: 14 Feb 2026
https://github.com/kambleakash0/mubi_eda
Mini Project #1 for EAS503 course at SUNY Buffalo
data-analysis data-visualization eda
Last synced: 16 Apr 2026
https://github.com/guermoud98/data-analysis-with-python-projects
data-analysis matplotlib pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/fhdsl/seattlestatsummer_r
A 4-day introduction to R programming, focused on Fred Hutch Research Interns
beginner beginner-friendly course data-analysis data-science introduction-to-programming r-programming tidyverse
Last synced: 19 Mar 2026
https://github.com/nmelgar/marathons_data_viz
Data visualization project to analyze finishing times and other data.
csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau
Last synced: 15 Feb 2026
https://github.com/k-bloch/car-theft-analysis
A dashboard created to inform the public about car theft, providing insights extracted from real-world police stats.
data-analysis maven-analytics tableau
Last synced: 19 Mar 2026
https://github.com/devexpress-examples/aspxpivotgrid-group-date-time-values
This example shows how to group date-time values in Pivot Grid for Web Forms.
asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms
Last synced: 01 Mar 2026
https://github.com/arnoudbuzing/iowa-caucus
Data Analysis on 2020 Iowa Caucus results
caucus data-analysis iowa iowa-caucus mathematica primaries primary-election wolfram-language
Last synced: 01 Mar 2026
https://github.com/oscarmtr/metrov
Interactive viewer for tropospheric meteorological soundings
climate data-analysis meteorology skew-t soundings temperature tropospheric web
Last synced: 01 Mar 2026
https://github.com/aleks-andrs/bigdataanalytics
Public repository for CM3111: Big Data Analytics Coursework (Meteorite landings analysis)
data-analysis data-science machine-learning
Last synced: 02 Mar 2026
https://github.com/yash22222/pwc-power-bi-virtual-case-experience
The Power BI PwC Virtual Case Experience is an exciting and educational program designed to provide participants with hands-on exposure to Power BI, a prominent business intelligence and data visualization tool, within the context of consulting at PwC.
business-analyst business-analytics business-intelligence dashboard data-analysis data-analyst data-analytics dax microsoft-power-bi powerbi powerbi-dashboards powerbi-visuals pwc
Last synced: 02 Mar 2026
https://github.com/mayankyadav23/amazon-sales-data-analysis
Diving into Amazon sales data to uncover hidden gems! 📈 Analyzing iNeuron's dataset to optimize sales strategies and boost performance 💡 Driving business growth with data-driven decisions! 💻
amazon data-analysis data-visualization ineuron-ai internship-project
Last synced: 02 Mar 2026
https://github.com/elrf3lipes/ramon-s_portfolio
I'm passionate about Cloud and DevOps, and for the moment I'm posting some of my work and personal projects here to showcase that. If its useful for you, feel free to integrate or contribute!
api-integration biopython clinical-trials data-analysis data-extraction data-parsing django docker entrez ipython medline-xml pandas pubmed-parser requests rest-api
Last synced: 27 Mar 2026
https://github.com/mugambi645/exploring-ebay-car-sales-data
Exploring ebay car sales dataset
car-sales data-analysis numpy pandas
Last synced: 16 Apr 2026
https://github.com/ibrahimceyisakar/hotel-finder-streamlit-dashboard
Streamlit dashboard of hotel-finder
data-analysis data-science data-visualization pandas plotly python streamlit
Last synced: 16 Apr 2026
https://github.com/dvaser/world-happiness-expanatory-data-analysis
DATA ANALYSIS
data-analysis data-visualization dataset jupyter jupyter-notebook kaggle python
Last synced: 03 Mar 2026
https://github.com/grindelfp/logistic-regression-study
Example of logical regression data analysis and exercise on it.
data-analysis ipynb logistic-regression python
Last synced: 03 Mar 2026
https://github.com/banner-19/extraction-and-analysis-of-text
The objective is to analyze text content from a list of URLs. This involves extracting article titles and text, then performing natural language processing to generate metrics like sentiment, readability, and word usage. Finally, the results are stored for further analysis or visualization.
data-analysis data-analytics data-science nlp nltk python3 text-analysis text-extraction
Last synced: 03 May 2026
https://github.com/dpb24/netflix-global-top-10-performance
Using Machine Learning to predict Netflix Global Top 10 viewership trends (Python & R)
data-analysis data-science data-visualization decision-tree-regression gradient-boosting-regressor machine-learning media netflix predictive-analytics predictive-modeling python r random-forest random-forest-regression regression-models sklearn streaming-video xgboost-regression
Last synced: 16 Apr 2026
https://github.com/santiago-giordano/ahora12project
Excel, SQL and Python processing from excel files
data-analysis excel jupyter-notebook microsoft-sql-server pandas sql sqlalchemy sqlserver
Last synced: 16 Apr 2026
https://github.com/themihirmathur/qlik-intern-project
Qlik Analysis of Road Safety & Accident Patterns in India 📈 Analyzed & visualized road safety data for 20.85k+ accident cases with 9+ accident data patterns in India using Qlik 📉 Reduced inefficiencies by 25% by developing design of an avant-garde data tracking dashboard that monitored injuries.
data-analysis data-visualization presentation qlik qlik-cloud qlik-sense qlikview
Last synced: 04 Mar 2026
https://github.com/danpoynor/omdb-api-data-analysis
Gathers data for Oscar-winning movies using their IMDB ids, saves the information to a CSV file, and answers a few data analysis questions about the movies using JupyterLab.
analytics csv data-analysis jupyter-notebook matplotlib omdb-api pandas-dataframe python-dotenv python3 seaborn-plots
Last synced: 16 Apr 2026
https://github.com/e1washere/weather-spark-pipeline
Scalable pipeline using Apache Spark to process and analyze weather data.
apache-spark batch-processing big-data data-analysis data-engineering data-pipeline data-processing etl python spark-sql weather-data
Last synced: 17 Apr 2026
https://github.com/kheriberto/knn_project
This is a simple project that uses dummie data to practice and demonstrate my knowledge of the KNN algorithm.
data-analysis knn-classifier numpy python scikit-learn seaborn
Last synced: 02 Apr 2026
https://github.com/ruajean/netflixmoviescraper
🎬 A powerful tool for gathering movie data and user reviews from FilmAffinity's Netflix category. This script scrapes movie details and iterates through user reviews, saving structured information to a CSV file for analysis. Ideal for insights into user sentiments and movie popularity on FilmAffinity.
data-analysis data-visualization dataset jupyter-notebook python scraping
Last synced: 17 Apr 2026
https://github.com/hugo-hattori/rpa_email_report
Robotic Process Automation Project.
automation data-analysis data-analysis-python data-analytics jupyter jupyter-notebook pandas pandas-dataframe pandas-python pyautogui pyautogui-automation pyperclip python time
Last synced: 17 Apr 2026
https://github.com/shimazadeh/ft_linear_regression
Implementing a modular linear regression from scratch to predict the price of cars using a gradient descent algorithm.
data-analysis data-science hyperparameter-tuning linear-regression predictive-modeling
Last synced: 03 Jun 2026
https://github.com/nathaliacosim/migration-patrim
Automação para extração, conversão e migração de dados patrimoniais para o sistema patrimônio cloud da betha sistemas. O projeto garante um fluxo estruturado e seguro de transferência de informações, utilizando C# (.NET Framework), PostgreSQL e integração via API.
conversion-tool data-analysis data-conversion data-transformation dotnet dotnet-code dotnet-console-app migration-tool
Last synced: 17 Apr 2026
https://github.com/rishisolanke/pdf_query_langchain
PDF Query LangChain is a tool that extracts and queries information from PDF documents using advanced language processing. Leveraging LangChain, OpenAI, and Cassandra, this app enables efficient, interactive querying of PDF content. Ideal for data analysis, research, and automated reporting, it simplifies detailed document analysis with ease.
artificial-intelligence data-analysis document-query langchain natural-language-processing nlp openai pdf-analysis pdf-extraction python research-tool
Last synced: 17 Apr 2026
https://github.com/victoorv/prediction_covid19
Prédire si un invidu est positif au COVID19 ou non.
classification covid-19-classifier covid-19-data-analysis covid19-data data-analysis data-science data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning machine-learning-algorithms neural-networks oversampling-algorithms python statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/victoorv/criminalite_us
Une analyse de la criminalité en fonction de variables socio-économiques a été menée, incluant la sélection et la comparaison de modèles de régression multiple ainsi que des tests d'hypothèses sur les coefficients et la significativité des modèles.
data-analysis data-science r regression regression-analysis regression-models statistical-analysis statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/phanchenh/businessdashboard_powerbiproject_pizzadataset
Pizza Business Performance Analysis and Growth Strategies 2015
business-analytics business-intelligence dashboard data-analysis data-visualization insights powerbi python
Last synced: 17 Apr 2026
https://github.com/sevilaymuni/project-no.3-seaborn-plots
Pandas and Seaborn Mediated Comprehensive Analysis on Differentiated Thyroid Cancer
data-analysis data-structures data-visualization mathplotlib pandas python seaborn
Last synced: 18 Apr 2026
https://github.com/awanraskall/retail-demand-analysis
Data analysis of retail meal orders, fulfillment centers, and product demand using Python
data-analysis data-visualization jupyter-notebook numpy pandas python
Last synced: 18 Apr 2026
https://github.com/vansh-py04/data-analysis-questions-pandas-numpy-sql
Solution to 450+ Data Science Tech Stack questions essential for Data Analysts and Scientists!
data-analysis data-science deepnote machine-learning numpy pandas python sql
Last synced: 18 Apr 2026
https://github.com/prakhar-ff13/finding-donors-for-charityml
Udacity Machine Learning Engineer Nanodegree project 2
data-analysis data-science machine-learning supervised-learning udacity udacity-machine-learning-nanodegree udacity-nanodegree
Last synced: 05 Apr 2026
https://github.com/kenatsf/basic_data_analysis
Basic data science project: ETL, forecast and data visualization.
analysis data data-analysis data-science logistic-regression matplotlib matplotlib-pyplot numpy pandas powerbi python scikit-learn time-series time-series-analysis time-series-forecasting
Last synced: 05 Apr 2026
https://github.com/kwokhing/visualizing-datasets-with-facets
Demo on using Facets: An Open Source Visualization Tool for Machine Learning Training Data developed by Google's PAIR Initiative
anaconda data-analysis data-visualization facets jupyter-notebook missing-data open-source python skewness unbalanced-data visualisation visualization
Last synced: 18 Apr 2026
https://github.com/bolshovaelizaveta/covid19_spark_analysis
Учебный проект по дисциплине 'Базы данных для компьютерного зрения'. Разработка аналитической платформы для эпидемиологического мониторинга COVID-19 с использованием Apache Hadoop и Spark
apache-hadoop apache-spark covid-19 data-analysis jupyter-notebook machine-learning medical-imaging pyspark sql
Last synced: 18 Apr 2026
https://github.com/mksingh431/free-data-science-courses
Data science is a rapidly growing tech field that’s transforming business decision-making. To break into this field, you need the right skills. Fortunately, top institutions like Harvard and IBM offer free online courses. These courses cover everything from basic programming to advanced machine learning.
course data data-analysis data-science data-visualization free freecou python
Last synced: 19 Apr 2026
https://github.com/scanf-s/basic_dataanalysis
data-analysis jupyter-notebook matplotlib pandas python
Last synced: 19 Apr 2026
https://github.com/souraevshing/data-science-01
Data analysis using jupyter notebook.
data-analysis data-science data-visualization jupyter-notebook python
Last synced: 19 Apr 2026
https://github.com/ajmannust41288/python-developer
all about python backend ,data analytics,Data-Visualization
artificial-intelligence data-analysis data-science data-visualization-project deeplearning machine-learning machine-learning-algorithms matplotlib mongodb-database mongoose mongoosejs pandas-python python-library python-packages python-script python3 seaborn-plots
Last synced: 19 Apr 2026
https://github.com/mlucifer27/bilateral-visualization
Streamlit app visualizes bilateral relationship scores between 100 countries from 1945 to 2024. It supports interactive heatmaps, network graphs, pairwise comparisons, and more.
d3blocks data-analysis data-visualization plotly-python python streamlit
Last synced: 04 Jun 2026
https://github.com/kmbuki/uk_police_data
R programming - Using open data about crime and policing in England, Wales and Northern Ireland.
data-analysis data-visualization r
Last synced: 04 Jun 2026
https://github.com/nikolaos-mavromatis/etf-data-analysis-dashboard
Insights into SPY ETF performance with an interactive Streamlit dashboard powered by Alpha Vantage data.
api data-analysis data-visualization financial-analysis pandas plotly python streamlit
Last synced: 20 Apr 2026
https://github.com/misaghmomenib/soccer-match-analysis
This Project Predicts Football Match Outcomes (Home Win, Away Win, or Draw) Using Historical Match Data. It Involves Data Preprocessing, Exploratory Analysis, and Training a Random Forest Model to Predict Results Based on Features Like Shots, Possession, and Passes.
data-analysis git open-source python
Last synced: 20 Apr 2026
https://github.com/jbalooshie/school_district_analysis
Analysis of standardized testing results using NumPy and Pandas, executed in Jupyter Notebook. Summaries of the testing results are provided based on school, test type, and grade level.
data-analysis data-science dataframes jupyter-notebook numpy pandas python
Last synced: 20 Apr 2026
https://github.com/hugo-hattori/customer_profile_analysis
Data Analysis Project.
data-analysis data-analysis-python data-analytics jupyter jupyter-notebook pandas pandas-dataframe pandas-python plotly plotly-express plotly-io python
Last synced: 20 Apr 2026
https://github.com/ak-pydev/python_practice
Documenting my learning journey from python -> ML -> DL -> LLM/GenAI -> Agents exercises solved daily from Udemy/Kaggle/YouTube.
data-analysis data-science feature-engineering llms machine-learning mlflow mlops-workflow modeling python3 streamlit uvicorn
Last synced: 20 Apr 2026
https://github.com/robinmillford/hr-analytics-employee-performance-analysis
HR Analytics: Unveiling Employee Performance - A comprehensive exploration of employee data using SQL and Power BI, uncovering key insights for strategic HR decision-making.
data-analysis data-visualization jupyter-notebook powerbi python3 sql
Last synced: 20 Apr 2026
https://github.com/docuvesta/la-mer-skincare-chicago-duty-free-analysis
Comparing La Mer product selection, availability and pricing from 3 different purchase locations ✈️
analytics cremedelamer data-analysis data-analytics data-science data-visualization lamer luxury plotly python seaborn skincare
Last synced: 21 Apr 2026
https://github.com/florence-nyokabi/house-power-consumption
Machine Learning: Exploring Regression Analysis
data-analysis data-cleaning data-science data-visualization feature-engineering jupyter-notebook jupyterlab machine-learning pandas-python regression-analysis regression-models
Last synced: 05 Jun 2026
https://github.com/anushkundu/churn-prediction
Telecom Customer Churn Prediction Using Machine Learning!
accuracy-score classification-algorithm classification-report data-analysis data-science deep-learning gradient-boosting-classifier keras-tensorflow logistic-regression machine-learning random-forest-classifier recall-precision roc-auc-score smote-sampling svm-classifier
Last synced: 21 Apr 2026
https://github.com/danpoynor/data-analysis-spotify-songs-2010-2019
Spotify data analysis for songs between 2010 and 2019 using Jupyter Notebooks including pandas and Seaborn plots.
data-analysis jupyter-notebook matplotlib pandas-dataframe python3 seaborn-plots spotify
Last synced: 22 Apr 2026
https://github.com/robinmillford/optimizing-treatment-plans-through-data-analysis
The primary focus was on understanding customer health, treatment, and associated charges over multiple years.
data-analysis data-visualization healthcare mysql powerbi sql
Last synced: 22 Apr 2026
https://github.com/rajesh9943/sentiment-analysis-of-consumer-opinions-on-amazon-products
Developed a comprehensive Sentiment Analysis System aimed at classifying Amazon product reviews into positive, neutral, and negative sentiments. The project leveraged advanced Natural Language Processing (NLP) techniques alongside machine learning algorithms to deliver accurate and actionable insights from customer feedback
amazon data-analysis data-manipulation data-preprocessing data-presentation data-visualization machine-learning nlp nlp-library nltk product-reviews-analysis sentiment-analysis sklearn-library word-cloud-generator-in-python-3
Last synced: 05 Jun 2026
https://github.com/kgotsosm/epl-analysis
Preparing data for machine learning algorithms to predict English Premier League match winners.
data-analysis data-cleaning data-modeling
Last synced: 22 Apr 2026
https://github.com/al-ogr/sf_pr1_job_analysis_hh
SkillFactory DataScience PROJECT-1. Анализ резюме из HeadHunter
data-analysis data-science ipynb plotly python
Last synced: 23 Apr 2026
https://github.com/ihnokim/datk
Data Analysis Toolkit (DATK)
data-analysis data-engineering data-science deep-learning image-processing pandas signal-processing
Last synced: 24 Apr 2026
https://github.com/strixion/demoversion_ai
The demoversion of StrixionAI
ai csv data-analysis data-analytics json python txt
Last synced: 24 Apr 2026
https://github.com/datalopes1/bank_marketing
Este projeto será baseado no Dataset Bank Marketing encontrado na UC Irvine - Machine Learning Repository e disponibilizado por S. Moro, R. Laureano e P. Cortez
data-analysis data-science data-visualization eda python
Last synced: 24 Apr 2026
https://github.com/yxuco/ethdecoder
This CLI decodes Ethereum transactions and events, stores results in CouchDB, and then exports customized views to CSV files for data visualization and analysis.
data-analysis decoding ethereum
Last synced: 24 Apr 2026
https://github.com/pedrohdosanjos/economic-data-analysis
This project aims to analyze the export data from various states in the United States to Brazil over time. The data is sourced from the FRED (Federal Reserve Economic Data) API and processed to identify the top 5 exporting states for each year, as well as the states with the highest total export value across all years.
api data-analysis data-visualization jupyter-notebook python
Last synced: 24 Apr 2026
https://github.com/mehmetkahya0/gallstone_dataset_analysis_project
Safra Taşı Hastalığı (Gallstone-1) Veri Seti Analizi (https://archive.ics.uci.edu/dataset/1150/gallstone-1)
analysis analytics data data-analysis data-science data-visualization database graph matplotlib python
Last synced: 25 Apr 2026
https://github.com/rubix982/product-quality-classification
This is an implementation for the CIKM AnalytiCup 2017, around the topic of "Product Title Quality". The goal is to take SKUs and rank its title's clarity and conciseness. Referenced papers are attached to this repository. And as such, the aim is to craft ensemble models that either try to replicate results or find new methods for classification.
data data-analysis information-retrieval jupyter-notebook machine-learning nlp python spacy-nlp
Last synced: 25 Apr 2026
https://github.com/ddihora1604/iit_patna
A multifaceted project involving applying ML models like Ridge Classifier, RNN, RIDOR, Rotation Forest and RUSBoost, integrating SMOTE for class balancing, and handling diverse datasets including those for seating arrangement tasks.
data-analysis data-visualization datamodelling machine-learning-algorithms python
Last synced: 25 Apr 2026
https://github.com/devexpress-examples/winforms-create-a-custom-exporter-for-pivotgridcontrol-with-xtrareport
This example illustrates how to dynamically create a custom report based on PivotGridControl content in WinForms.
data-analysis dotnet pivot-grid pivot-grid-for-winforms winforms
Last synced: 26 Apr 2026
https://github.com/ys1f/geothermal_project
Geothermal Data Analysis & Visualization for Texas – well data, temperature gradients & zone mapping
bht bottom-hole-temperature data-analysis folium geopandas geospatial geothermal gis interpolation irena jupyter-notebook mapping python rasterio spatial-analysis temperature-gradient texas visualization well-data zone-mapping
Last synced: 26 Apr 2026
https://github.com/swapnanildutta/prediction-with-python
The projects are made using Jupyter Notebook
data-analysis jupyter-notebook machine-learning prediction python regression-models
Last synced: 27 Apr 2026
https://github.com/pararang/nams-thesis-fuzzy
A specialized data processing tool designed to help with Fuzzy Delphi Method calculations for thesis research data analysis. Then extended with some new features for data processing with different method.
data-analysis dematel hacktoberfest hacktoberfest-accepted house-of-quality python sustainability vibecoding
Last synced: 27 Apr 2026
https://github.com/sohamb21/analysis-of-superstore-dataset
I completed the IBM SkillsBuild Data Analytics Internship Program to develop my Data Analytics skills and apply them to a real-world problem by working on this project.
Last synced: 27 Apr 2026
https://github.com/busesimsek/sql-projects
A collection of my SQL projects with insights into real-world datasets.
data-analysis data-analytics mysql sql
Last synced: 07 Jun 2026
https://github.com/garcane/exodus_analysis
This project analyses cryptocurrency transaction data exported from the Exodus wallet. The goal is to explore and visualize the inflows and outflows of assets, the types of transactions, and other key metrics over time.
bitcoin btc crypto cryptocurrencies cryptocurrency data-analysis data-visualization eth ethereum pandas seaborn
Last synced: 27 Apr 2026
https://github.com/hfzdzakii/dicoding-airqualityanalysisdata
This repo is a master submission for my Dicoding Final Project. Air Quality Dataset is being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!
data-analysis data-visualization streamlit
Last synced: 27 Apr 2026
https://github.com/tillscode/personal-finance-ml-analysis
Machine learning analysis of personal financial data with predictive modeling and interactive dashboard
dashboard data-analysis finance machine-learning python scikit-learn
Last synced: 28 Apr 2026
https://github.com/simranshaikh20/diwali-sales-analysis-for-business-insights
A data analyst project on diwali sales . In this state according state , gender, age we are able to know how much sale it done.
data-analysis data-visualization python
Last synced: 28 Apr 2026
https://github.com/hadson0/chess-live-ratings-data
A study project focused on web scraping the live chess ratings from chess.com, with data analysis and visualization on nearly 5000 players in the classical world ranking.
beautifulsoup chess data-analysis data-visualization numpy pandas python seaborn web-scraping
Last synced: 28 Apr 2026
https://github.com/elmezianech/autoinventory
This project is an end-to-end, fully automated warehouse management solution designed to tackle real-world inventory challenges in the FMCG sector. From real-time data ingestion and predictive analytics to interactive dashboards, this project combines cutting-edge technologies and an event-driven architecture to simulate a business-ready system.
automation dashboard data-analysis data-engineering-pipeline docker etl glue-job inventory-management kafka kpis lambda-functions lstm ml-pipeline mlflow power-bi pytorch redshift s3 streamlit warehouse-management
Last synced: 28 Apr 2026
https://github.com/abdeldjalilchafai/us-flight-delay-eda
Structured EDA on 2015 US flight delay data. Clean, reproducible notebook using a 6-step data analysis framework for real-world datasets.
data-analysis data-cleaning eda exploratory-data-analysis flight-delays kaggle matplotlib numpy pandas python seaborn
Last synced: 28 Apr 2026
https://github.com/shreeparab1890/indian-elections-2019-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Indian Lok Sabha Elections 2019.
data data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib numpy pandas plotly python python3 visualization
Last synced: 28 Apr 2026
https://github.com/gaurav-van/optimizing-rate-of-penetration-in-geothermal-drilling-a-digital-twin-approach
Let’s explore something interesting together. In this project, we developed a machine learning digital twin using Intel-optimized XGBoost and daal4py to simulate and optimize the Rate of Penetration (ROP) in geothermal drilling. We leveraged SHAP for Explainable AI (XAI) to interpret model predictions.
data-analysis data-science digital-twin explainable-ai geothermal geothermal-energy jupyter-notebook machine-learning python shap xai xgboost
Last synced: 28 Apr 2026
https://github.com/bala-1409/titanic-survived-prediction-datascience-classification-project
This projects predicts whether a passenger on the titanic survived or not using machine learning algorithms with the given details of the passenger data.
classification-algorithm data-analysis data-cleaning data-preprocessing data-science data-visualization eda exploratory-data-analysis gradient-boosting jupyter-notebook machine-learning-algorithms matplotlib predictive-modeling python3 seaborn
Last synced: 28 Apr 2026
https://github.com/prady2309/sales-prediction-using-python
Implemented using Multiple Linear Regression
data-analysis data-science machine-learning python
Last synced: 29 Apr 2026
https://github.com/thanaraklee/pyspark-dataframe-operations
This project focuses on utilizing PySpark DataFrames to analyze and visualize data sourced from external datasets, such as CSV files. It provides a practical example of how to manipulate, transform, and gain insights from large datasets using the PySpark framework.
data-analysis dataframe pyspark python
Last synced: 29 Apr 2026
https://github.com/marcinz20/anomaly-detection-in-credo-dataset
University project, which goal is to build a system, that detects anomalies in CREDO dataset
credo data-analysis data-science encoder-decoder-model jupiter-notebook pca-analysis python3
Last synced: 29 Apr 2026
https://github.com/vanshuchaudhary/zomato
This Jupyter Notebook contains an exploratory data analysis (EDA) of Zomato restaurant data. It includes data cleaning, visualization, and insights into restaurant ratings, pricing, cuisine distribution, and location-based trends.
business-analytics data-analysis data-mining data-science data-visualization datascience matplotlib pandas-dataframe pandas-python python python-3 python-library
Last synced: 29 Apr 2026
https://github.com/jakebrehm/ezpz-plotting
📈 Easily visualize and manipulate plots from multiple data files.
data-analysis data-visualization engineering matplotlib matplotlib-pyplot pandas plotting python python-3 software software-engineering tkinter tkinter-gui
Last synced: 29 Apr 2026