Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/athul64/exploratory-data-analysis
To preprocess and analyze the given employee dataset, present the findings graphically, and derive meaningful insights to help better understand the company’s workforce.
colab-notebook data-analysis data-visualization matplotlib numpy pandas python seaborn statistical-analysis
Last synced: 25 Feb 2026
https://github.com/v6ntage/sql-sales_data-analytics-project
This repository contains a SQL scripts demonstration analytical techniques.
analytics business-analytics data data-analysis database query sql sql-server
Last synced: 12 Apr 2026
https://github.com/jahnavigupta06/zepto-delivery-customer-analytics
Real-time SQL + Power BI Analytics Project replicating Zepto's customer & delivery insights.
business-intelligence churn-analysis customer-segmentation data-analysis data-visualization powerbi sql-server
Last synced: 02 Aug 2025
https://github.com/thomasshikalepo/sql-data-warehouse-project
Building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics
data-analysis data-cleaning data-engineering data-lakehouse data-science data-warehouse data-warehousing datascience datawarehousing etl-pipeline medallion-architecture sql sql-query sql-server
Last synced: 02 Aug 2025
https://github.com/ituvtu/Data-Science-AB-Testing
This project focuses on conducting A/B testing to evaluate the effectiveness of two marketing campaigns. Using statistical analysis and hypothesis testing, we determine which campaign is more effective in improving conversion rates.
a-b-testing data-analysis data-analysis-python data-mining ipynb jupyter jupyter-notebook python
Last synced: 26 Sep 2025
https://github.com/shriram-vibhute/data-analysis
This repository offers a comprehensive collection of data analysis techniques using NumPy Pandas, Matplotlib and Seaborn.
data-aggregation data-analysis data-visualization data-wrangling matplotlib numpy pandas seaborn
Last synced: 02 Aug 2025
https://github.com/t-mohamed-shafeek/data-analysis-on-tamil-nadu-road-accidents
The "Data Analysis on Tamil Nadu Road Accidents" is a project deals with analysis of data on Road Accidents encountered by Tamil Nadu ( one of the states of India ) in the year of 2020 and 2021. But the dataset is most recently created (created on February 15, 2023 with source form TN Police).
dashboard data-analysis data-science data-visualization jupyter-notebook tableau
Last synced: 03 Aug 2025
https://github.com/juliusmarkwei/iris-dataset-analysis
Data analysis, data visualization and model training using the popular Iris Dataset
data-analysis data-visualisation linear-regression machine-learning
Last synced: 03 Aug 2025
https://github.com/pinedah/loan-approval-predictor-excercise
Proyecto de Machine Learning para predecir la aprobación de tarjetas de crédito utilizando dos datasets. Incluye limpieza, análisis exploratorio, imputación de datos sintéticos y modelado con algoritmos como Random Forest, Gradient Boosting y Árboles de Decisión.
data-analysis data-science decision-tree escom gradient-boosting machine-learning predictor random-forest school-project
Last synced: 11 Oct 2025
https://github.com/vimal0156/ruaroa-ai
🧙♂️ Zero-Code Machine Learning Wizard - Transform ideas into intelligent solutions without writing code. AI-powered ML pipeline automation with interactive web interface.
ai-agents ai-assistant artificial-intelligence automated-machine-learning code-generation data-analysis data-science deep-learning jupyter machine-learning machine-learning-pipeline neural-networks no-code openai python scikit-learn streamlit visualization
Last synced: 09 Apr 2026
https://github.com/ganesh2409/cricket-player-performance
This repository contains a comprehensive project focused on analyzing cricket player performance using various datasets, including batting, bowling, and match results. The project involves data preprocessing, feature engineering, and model training to predict and evaluate player performance scores. It includes detailed scripts for data analysis
cricket-performance-analysis data-analysis machine-learning sports-analytics
Last synced: 05 Aug 2025
https://github.com/cuadernin/coffeeanalysis
Análisis de datos correspondiente a la tercera etapa de la certificación de Datacamp.
coffee data-analysis datacamp python
Last synced: 07 Aug 2025
https://github.com/nafisalawalidris/northwind-traders-sales-analysis
Northwind Traders Sales Analysis project, which analyses sales data for a fictitious company. It utilises the Northwind Database and includes SQL queries to provide insights on employees, products, suppliers and revenue. The project aims to help the company gain valuable information for business decision-making.
business-insights data-analysis database northwind-traders sales sql
Last synced: 07 Aug 2025
https://github.com/turquetti/projeto5-vamoai
Projeto final da Resilia + iFood <3
Last synced: 14 May 2026
https://github.com/the-tech-idea/beepdm
A Library for Managing your Connection to Different DataSources . Still in Alpha.please be patient
data-analysis data-management data-management-platform data-science database dataset information
Last synced: 08 Aug 2025
https://github.com/simranjeet97/ipl-dataanalysis
Data Analysis performed on IPL Dataset with Data Profiling, Data Pre-Processing, Data Manipulation, and Data Visualization.
artificial-intelligence data-analysis data-manipulation data-mining data-preprocessing data-science data-visualization indian-premier-league-2008-2018 ipl ipl-dataset iplayer python
Last synced: 08 May 2026
https://github.com/jayita11/atliqo-bank-credit-card-launch-eda
This project involves exploratory data analysis and statistical testing for AtliQo Bank's new credit card launch. Key insights include targeting high-income occupations and the 18-25 age group. Recommendations focus on tailored marketing campaigns, education, and incentives to enhance credit card adoption and usage among young adults.
data-analysis hypothesis-testing matplotlib p-value pandas python seaborn statistics z-test
Last synced: 09 Apr 2026
https://github.com/rafaumeu/super-trunfo
🎮 Super Trunfo Card Game in C | 🌍 Country Comparison | 📊 Data Analysis | 🎓 Educational Project
beginner-friendly brazilian-dev c-programming card-game cli-app command-line comparison-game computer-science data-analysis educational educational-project estacio game-development learning-c learning-to-code open-source programming-learning super-trunfo
Last synced: 10 Aug 2025
https://github.com/leocornus/leocornus-visualdata
JavaScript libraries to make data visualization simpler and easier.
data-analysis data-mining data-visualization data-visualization-simpler javascript-library
Last synced: 10 Aug 2025
https://github.com/filiplangiewicz/businessintelligence
🏭 Data warehouses and business intelligence project
airbnb business-intelligence data-analysis data-warehouse
Last synced: 09 Mar 2026
https://github.com/r12habh/datacamp.com-micro_projects
data data-analysis data-science datascience python python3
Last synced: 23 May 2026
https://github.com/virajbhutada/walmart-retail-analyzer
Gain valuable insights into retail sales with the "Walmart Retail Performance Dashboard" in MS Excel. This user-friendly tool facilitates an in-depth analysis of key sales metrics, providing a comprehensive view of Walmart's performance. Make data-driven decisions for informed and strategic business outcomes.
analytics data-analysis data-science data-visualization excel insights interactive-visualizations performance-analysis retail-sales walmart
Last synced: 04 Mar 2026
https://github.com/Narius2030/Hive-DataWarehouse-Analysis
Implement a Hive data warehouse to store meaningful data, apply Machine Learning like Clustering or Regression for dealing with business problems
apache-hadoop apache-hive data-analysis etl-pipeline hiveql machine-learning statistics
Last synced: 12 Aug 2025
https://github.com/mahdi-eth/covid-analysis
Covid-19 data analysis project using python, numpy, pandas, matplotlib
data-analysis data-science python
Last synced: 13 Aug 2025
https://github.com/bcko/ud-da-eda-whitewinequality
Udacity Data Analyst Nanodegree Project : Exploratory Data Analysis : White Wine Quality dataset
data-analysis exploratory-data-analysis rmarkdown rstudio udacity udacity-data-analyst-nanodegree
Last synced: 03 Jan 2026
https://github.com/shibam120302/heart-disease-data-analysis-by-shibam
You can read more on the heart disease statistics and causes for self-understanding. This project covers manual exploratory data analysis
analysis data-analysis scraper
Last synced: 13 Aug 2025
https://github.com/musbi8788/free_python_book_for_gambian_dev
Free, beginner-friendly Python books for Gambian learners and devs 📘🐍
algorithms automation data-analysis data-science django flask machine-learning oops-in-python programming-language python python27 python3 testing-automation web-development
Last synced: 14 Aug 2025
https://github.com/nelsonkariuki/dataanalysis
This project involves data analysis of vido game sales from https://www.kaggle.com/gregorut/videogamesales/download
data-analysis data-visualization python
Last synced: 11 Jun 2026
https://github.com/mariam-badr-mb/student-score-prediction
This project predicts students' exam scores based on study-related and demographic factors using machine learning models.
data-analysis data-visualization explore linear-regression machine-learning mean-square-error student-project supervised-learning
Last synced: 16 Aug 2025
https://github.com/reusjimenez/sqlserver-data-analysis
Laboratorios prácticos sobre el análisis de datos utilizando SQL Server. 📊
data-analysis database-scripts joins queries sql sql-server stored-procedures transactions views
Last synced: 16 Aug 2025
https://github.com/floressek/data_analysis_and_visualization
This repository contains a collection of statistical data analysis laboratories using R. Each lab focuses on different aspects of data exploration, visualization, and analysis techniques.
data-analysis data-visualization
Last synced: 05 Oct 2025
https://github.com/pranavarora1895/proteintypeprediction
Data Analysis on Protein Type Prediction
bioinformatics data-analysis supervised-learning
Last synced: 19 Apr 2026
https://github.com/yuvraj0412s/proactive-fraud-detection-using-machine-learning
An end-to-end machine learning project for detecting financial fraud using LightGBM, featuring in-depth EDA, advanced feature engineering, and a focus on actionable business insights.
class-imbalance classification-model data-analysis data-science data-visualization exploratory-data-analysis feature-engineering fintech fraud-detection jupyter-notebook lightgbm machine-learning pandas python scikit-learn smote
Last synced: 02 May 2026
https://github.com/ilchen/eu_economic_data_analysis
Jupyter notebooks for analysis of Eurozone GDP, yields on government bonds, inflation expectations, unemployment and participation rates, money supply, personal consumption and savings, stock market. Using APIs from Eurostat, ECB, OECD and Yahoo-Finance.
data-analysis disposable-income finance gdp hicp inflation interest-rates jupyter-notebook money-supply participation-rate risk-free-interest-rate savings stock-market unemployment-rate
Last synced: 10 Oct 2025
https://github.com/pavelgrigoryevds/olist-deep-dive
🌊 Deep Sales Analysis of Olist E-Commerce: EDA | Time Series| Viz | RFM | NLP | Geospatial | Segmentation & Actionable Business Recommendations.
business-recommendations clusterization data-analysis data-analytics data-science deep-analysis e-commerce eda feature-engineering geospatial jupyter-notebook nlp pandas plotly preprocessing python rfm statistics time-series visualization
Last synced: 07 May 2026
https://github.com/airscholar/data_analysis_with_ai
A repository showing how to use AI and ChatGPT for Data Analysis with Pandas and Python
chatgpt data-analysis gpt4 openai pandas pandasai python
Last synced: 10 Apr 2026
https://github.com/devexpress-examples/web-forms-pivot-grid-implement-editable-aspxpivotgrid
This example demonstrates how to allow end-users to modify data cell values in Pivot Grid for Web Forms.
asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms
Last synced: 09 Mar 2026
https://github.com/richardwarepam16/hotel_analysis_using_python
Unlocking Insights: Analyzing Hotel Reservation Data to Boost Business Performance
data data-analysis data-visualization hotel-booking hotel-cancellation-solution hotel-management-system jupyter-notebook python python3
Last synced: 22 Aug 2025
https://github.com/mohamed3nan/udacity
Udacity Data Analysis Nanodegree Program
data-analysis data-visualization numpy pandas python
Last synced: 10 Apr 2026
https://github.com/realorangeone/docker-cyberchef
A containerized deployment of CyberChef, with additional protections
cyberchef data-analysis data-manipulation docker encoding
Last synced: 24 Aug 2025
https://github.com/mathurutkarsh/zomato_bangalore_restaurants_kaggle
Exploratory data analysis
data-analysis data-visualization machine-learning
Last synced: 25 Aug 2025
https://github.com/wildanmujjahid29/books-sales-analytics-python
Books Sales Analytics With Pyhton
data data-analysis data-science data-visualization
Last synced: 12 Jun 2026
https://github.com/luminati-io/airbnb-dataset-samples
A sample dataset of over 1000 Airbnb listings, extracted using the Bright Data API, ideal for competitor tracking, brand reputation, and market analysis.
airbnb airbnb-listings api data-analysis datasets web-scraper web-scraper-api web-scraping
Last synced: 04 Jan 2026
https://github.com/unnatmalik/dattavism-ai-powered-data-insight-generator-
Dattavism is an AI-powered data insight platform that transforms raw CSV files into comprehensive, contextualized reports—complete with visualizations, statistical summaries, and natural language insights. Dattavism is designed to handle datasets across diverse domains. it is Built using Python, Streamlit, Gemini API, Pandas, Matplotlib, NumPy,
data-analysis python streamlit
Last synced: 24 Jul 2025
https://github.com/jossimmar/ensa-ss25
Repositorio destinado al manejo de datos de consumo de los Clientes Mayores de ENSA del Grupo Distriluz.
data-analysis electrical-engineering python sqlite
Last synced: 30 Mar 2025
https://github.com/prernarohra/heart-disease-prediction
This project develops a machine learning model to predict heart disease risk based on symptoms and medical history. The model achieved the best accuracy with Logistic Regression, as it works well for binary classification problems.
artificial-intelligence data-analysis data-science dataset heartdisease-prediction machine-learning models
Last synced: 06 Nov 2025
https://github.com/cr-mao/machine-learning
机器学习笔记
data-analysis data-handling machine-learning math numpy pandas
Last synced: 12 Nov 2025
https://github.com/umutsevdi/hr-management
HR Management, Analytics and Salary Determination System
analytics data-analysis java java17 postgresql python spring spring-boot vaadin vaadin-flow
Last synced: 10 Apr 2026
https://github.com/rkirlew/workoutrecommendationsdataset
This repository contains a synthetic dataset designed for building personalized workout recommendation models. The data is generated for educational and experimental purposes, allowing users to practice machine learning techniques such as classification, k-NN, and clustering, as well as explore fitness-related data analysis.
classification data-analysis dataset k-nearest-neighbours machine-learning
Last synced: 08 May 2026
https://github.com/ipanalytics/vpn-provider-overlap-intelligence
Aggregate VPN provider infrastructure overlap analysis: exact-IP overlap, shared /24 prefixes, hosting dependency, and provider relationship clusters. No raw VPN IP lists.
anti-fraud asn cybersecurity data-analysis fraud-detection infrastructure ip ip-intelligence ip-reputation network-analysis network-intelligence osint proxy-detection threat-intelligence vpn vpn-detection
Last synced: 25 May 2026
https://github.com/virajbhutada/global-universities-success-analysis-powerbi-sql-excel
This capstone project conducts in-depth analysis using Power BI, SQL, and Excel to explore complex dynamics shaping global university success. Integrating data from diverse ranking systems and criteria, our aim is to unravel the factors influencing universities worldwide.
capstone capstoneproject data-analysis data-analytics data-insights data-science data-science-projects data-visualization excel exploratory-data-analysis mece mysql powerbi powerpoint sql
Last synced: 20 Jun 2025
https://github.com/ibnaleem/cyberchef-discord
A versatile Discord bot that implements CyberChef's features for encoding, decoding, encrypting, compressing, analysing data directly and more in your Discord server
compression cti cyberchef cybersecurity data-analysis data-manipulation discord-bot discord-js encoding encryption hashing infosec parsing redteam
Last synced: 28 Jan 2026
https://github.com/aryansharma5/data-visualization-and-thorough-analysis
comprehensive guide for data analysis and visualization
data-analysis data-visualization
Last synced: 18 Mar 2025
https://github.com/allanotieno254/codsoft
This repository showcases a series of data science projects completed during an internship with CODESOFT. Each project utilizes Python and various machine learning techniques to solve specific problems in data analysis, classification, regression, and predictive modeling.
classification data-analysis data-science feature-engineering machine-learning model-evaluation predictive-modeling python-programming regression
Last synced: 15 May 2025
https://github.com/antonijn/polyfit
Fits a polygon to a given data input
c data-analysis linear-algebra toy
Last synced: 16 Jul 2025
https://github.com/hifza-khalid/book-management-system-sql
A Book Management System SQL project 📚 featuring tables for Authors ✍️, Books 📖, Customers 👤, and Orders 🛒. Includes sample queries for tracking book sales 💰, pricing by genre 🎭, and customer order history 📅.
book-management data-analysis database-management sql sql-queries
Last synced: 03 Feb 2026
https://github.com/gesiscss/wikipedia-language-olga-master
Measuring Gender Inequalities of German Professions on Wikipedia
bias crowdflower data-analysis data-science gender images python statistics wikipedia
Last synced: 10 Apr 2026
https://github.com/sauravbhattacharya001/biobots
A simple tool for checking 3D printer run statistics.
3d-bioprinting aspnet-web-api biobot bioinformatics bioprinting csharp data-analysis dotnet rest-api tissue-engineering
Last synced: 19 May 2026
https://github.com/satvikvirmani/engineering-graduate-salary-analysis
A Data Science/Machine Learning project to analyse and study salary patterns of a engineering graduate in India.
data-analysis data-science jupyter-notebook machine-learning prediction python regression
Last synced: 19 Jun 2026
https://github.com/husna-poyraz/titanic-machine-learning
Use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.
data data-analysis data-science data-visualization deep-learning machine-learning missing-data outlier-detection python titanic
Last synced: 10 May 2026
https://github.com/paezha/isdas
Companion package for An Introduction to Spatial Data Analysis and Statistics with R
data-analysis gis rstats spatial-analysis spatial-statistics
Last synced: 04 Jan 2026
https://github.com/5ekastanx/data-analysis
Extracting data from parsing, for example, like hacking using Python using all sorts of function methods
Last synced: 14 Mar 2025
https://github.com/matthewgrosman/messenger-analytics
Project that ingests Facebook Messenger conversations and generates analytics.
analytics data-analysis excel facebook facebook-messenger java mongodb
Last synced: 15 Apr 2025
https://github.com/kishlayjeet/zomato-data-exploration
In this project, we will be exploring a dataset containing information on various restaurants and their ratings, location, and other attributes.
data-analysis eda matplotlib numpy pandas zomato-data-exploration
Last synced: 10 Apr 2026
https://github.com/gf712/abpytools-qt
Qt interface of AbPyTools
antibody-numbering antibody-sequences cpp11 data-analysis python3 qt5
Last synced: 10 Apr 2026
https://github.com/greed2411/ndl
Numbers Don't Lie, attempt on Data Analysis using pandas and matplotlib.
cities data-analysis data-science data-visualization india kaggle
Last synced: 19 Apr 2026
https://github.com/vineet416/airbnb_data_analysis
Ineuron.ai Internship Project
data-analysis exploratory-data-analysis matplotlib numpy pandas-python python seaborn statistical-analysis statistics
Last synced: 10 Apr 2026
https://github.com/nouman6093/advanced-statistical-models
in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl
data data-analysis data-science data-visualization
Last synced: 11 Jun 2026
https://github.com/walkerdustin/vergleich-von-messmethoden-fuer-punktwolken
Bei der Vermessung eines physischen Raumes ist das Ergebnis eine Punktwolke. Diese Punktwolke beschreibt dann ausgewählte Punkte im Raum, zum Beispiel auf den Wänden und der Decke. Wenn diese Punkte in zwei seperaten Messungen gemessen werden, vielleicht sogar von unterschiedlichen Geräten, soll hinterher herausgefunden werden wie genau diese Punktwolken übereinstimmen. Dafür gibt es zwei grundsätzlich verschiedene Methoden. Diese sollen hier verglichen werden.
3d-models accuracy-metrics data-analysis data-visualization kaggle measure-distance numpy point-cloud pointcloudprocessing punkte python science-research simulation statistics
Last synced: 11 Apr 2026
https://github.com/ivanildobarauna-dev/api-to-dataframe
Python library that simplifies obtaining data from API endpoints by converting them directly into Pandas DataFrames. This library offers robust features, including retry strategies for failed requests.
data-analysis data-analytics data-engineering library pypi-packages python
Last synced: 06 Mar 2025
https://github.com/prime-infinity/type-one
Software to visualize and analyze GitHub repos based on certain statistics such as stars, forks and issues
data-analysis data-visualization
Last synced: 03 Feb 2026
https://github.com/happybono/sonatasmooth
Provides three different noise reduction algorithms for smoothing out data : Rectangular Averaging, Binomial Median Filtering, and Binomial Averaging. It processes data from a list and displays the results in another list.
algorithms average binomial binomial-coefficient binomial-theorem calibration csharp data-analysis data-calibration dynamic-noise-reduction median noise-algorithms noise-reduction noise-reduction-kernel outliers rectangular-averaging windows-desktop windows-desktop-application windows-forms winforms
Last synced: 30 Oct 2025
https://github.com/ryanfranklin237/data-visualization-python
A tool that allows you to visualize data from a csv or excel file in a graph or charts form
data-analysis data-science data-visualization matplotlib pandas-dataframe python
Last synced: 11 Jun 2026
https://github.com/shrawans007/hotel_customers_sentiments
Sentiment Analysis for a Hotel Based on Customer's Reviews
2018-2019 data-analysis data-analysis-in-excel data-cleaning data-cleaning-and-preprocessing data-visualization excel excel-pivot-tables github hotel-review-sentiments hotel-service ms-excel ms-excel-data-analytics pivot-tables sentiment-analysis tableau tableau-public text-reviews treemap
Last synced: 22 Mar 2025
https://github.com/ondrejhruby/countries-of-the-world
Explore global data with this repository, featuring insights, visualizations, and Python code examples on countries worldwide—perfect for enhancing your data analysis and visualization skills.
data-analysis data-science data-visualization geography jupyter-notebook machine-learning matplotlib pandas python statistics
Last synced: 16 Apr 2026
https://github.com/luochang212/weibo-analysis
Data analysis based on sina weibo.
Last synced: 03 Apr 2026
https://github.com/ryannapp12/quant_trading_engine
A modular, and scalable quantitative trading engine built in Python. This project demonstrates efficient data caching with SQLite, concurrent backtesting, and advanced risk analytics, showcasing best practices in clean code architecture and performance optimization.
algorithmic-trading backtesting dash data-analysis data-visualization fintech lstm machine-learning numpy pandas plotly python quantitative-finance real-time risk-management sqlite technical-analysis tensorflow time-series-analysis trading-strategies
Last synced: 11 Apr 2026
https://github.com/mokeddembillel/student-performance-prediction
Using Machine learning to predict a student final grade
data-analysis data-exploration feature-extraction feature-importance feature-selection linear-regression machine-learning power-bi principal-component-analysis regression spyder student-performance-prediction svm-regressor
Last synced: 15 Mar 2025
https://github.com/ndohvich/ibm-data-science-professional-certificate
Kickstart your career in data science & ML. Build data science skills, learn Python & SQL, analyze & visualize data, build machine learning models. No degree or prior experience required.
coursera dash data-analysis data-science html5 ibm ibm-professional-certificate javascript machine-learnng python sql
Last synced: 16 Nov 2025
https://github.com/akankshaaa013/30-day-machine-learning-deep-learning
To practically Learn, Explore, and Share my Insights on the Libraries and Tools that power Machine Learning.
data-analysis machine-learning python
Last synced: 15 Mar 2025
https://github.com/ahmednasef3/udemy-courses-full-eda
Exploratory Data Analysis on the factors that can affect the promotions and earnings in Udemy Courses and the perfect way to make a good saled course in Udemy.
data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib pandas seaborn udemy-course-project
Last synced: 01 May 2026
https://github.com/vitia-fritelle/analise_dieese
Análise realizada com base nos dados extraídos do site https://www.dieese.org.br/analisecestabasica/salarioMinimo.html
Last synced: 09 Apr 2025
https://github.com/vatshayan/hospital-discharge-analysis
Analysis of Hospitalization Discharge Rates in Lake County, Illinois of various attributes like Anxiety, Alcohol, mood, Diabetes, Asthma, etc
data-analysis data-visualization jupyter-notebook machine machine-learning machine-learning-algorithms scikit-learn
Last synced: 04 Mar 2025
https://github.com/mafda/seattle_airbnb_data_analysis
This repository contains a comprehensive analysis of the Seattle Airbnb dataset, conducted using the CRISP-DM (Cross Industry Standard Process for Data Mining) methodology.
crisp-dm data-analysis data-science jupyter-notebook pandas-python seattle-data
Last synced: 29 May 2026
https://github.com/faisal-khann/diwali-sales-analysis
The "Diwali Sales Analysis" project aims to analyze the sales data during the Diwali festival period to uncover insights and trends that can help improve marketing strategies and sales performance in the future
csv data-analysis eda jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/sunnybibyan/call_centre_power_bi_dashboard
Create a dashboard in Power BI to visualize relevant KPIs and metrics that will help the call center manager understand trends.
call-centre-analysis dashboard data-analysis data-visualization powerbi
Last synced: 19 Mar 2026
https://github.com/luabagg/worldwide-trends
Worldwide Google Trends visualization and classification
data-analysis data-visualization google-trends trends
Last synced: 03 Feb 2026
https://github.com/renatomaynard/statistical-modeling-and-regression-analysis-life-expectancy
Statistical Modeling and Regression Analysis for Life Expentancy
data-analysis healthcare linear-regression machine-learning predictive-modeling r regression-analysis statistical-models statitics
Last synced: 23 Mar 2025
https://github.com/zeinhasan/eksploration-and-data-visualization-course-material
Exploratory Data Analysis (EDA) Laboratory Assistant Teaching Materials
data-analysis data-visualization statistics
Last synced: 24 Jun 2026
https://github.com/equicirco/cirquant
Code and data delivering for quantifying circularity through open data and digital innovation.
circular-economy data-analysis database julialang official-statistics
Last synced: 13 Jan 2026
https://github.com/gracysapra/r-in-data-science
This repository contains essential guides for data analysis using R, covering topics like data preparation, data reshaping, and data visualization. Each file focuses on fundamental techniques to manipulate, clean, and visualize data effectively using R programming.
data-analysis data-preparation data-reshaping data-science data-visualization data-visualizations ggplot r r-for-data-science
Last synced: 19 Apr 2026
https://github.com/aymane-maghouti/sentiment-analysis-for-jumia-reviews-and-smartphone-price-prediction-system
The project focuses on customer sentiment analysis for Jumia, aiding informed online decisions. It collects and analyzes product comments to determine sentiments and implements a decision-making algorithm. Additionally, it includes product price prediction system using regression techniques.
beutifulsoup data-analysis data-cleaning data-collection data-preprocessing data-scraping data-visualization eda falsk machine-learning python web-application
Last synced: 18 Apr 2026
https://github.com/rcv911/lyapunov-indicators
Calculating Lyapunov indicators with multiprocessing in Python
data-analysis lyapunov lyapunov-indicators multiprocessing
Last synced: 18 Jan 2026
https://github.com/babak2/synthea-data-analysis
Synthea Data Analysis
data-analysis data-visualization jupyter-notebook jupytext matplotlib numpy pandas python3 seaborn synthea
Last synced: 11 Apr 2026
https://github.com/dcs-training/good-data-visualisation-with-r
Our guide on how we create data visualisations through R. Go to the readme file
data-analysis data-visualisation r rmarkdown
Last synced: 16 Jun 2026
https://github.com/avijit-jana/redbus-data_scraping_and_filtering_with_streamlit_app
A Streamlit-based application leveraging Selenium to automate data scraping from Redbus, enabling efficient collection, analysis, and visualization of bus travel data for improved operational efficiency and strategic planning in the transportation industry.
automation dashboard data-analysis data-visualization datadrivendecisions python3 redbus selenium-python streamlit-application webscrapping
Last synced: 15 Mar 2025
https://github.com/andr3w03/employee-attrition-problem
Employee Attrition Problem Analysis and Prediction
data-analysis data-science data-visualization dicoding gradient-boosting-classifier machine-learning problem-solving python sklearn streamlit
Last synced: 11 Apr 2026
https://github.com/samuelsoaress/python-study-datascience-ia
My data science and AI studies
data-analysis data-crawler data-mining data-science deep-learning machine-learning-algorithms
Last synced: 13 May 2026