Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/bala-1409/loan-clustering-datascience-projects
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering clustering-algorithm data-analysis data-science data-visualization kmeans-clustering machine-learning machine-learning-algorithms sql unsupervised-learning unsupervised-machine-learning
Last synced: 22 Mar 2025
https://github.com/bala-1409/rafik-s-kitchen-data-analysis
The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.
business-analytics business-intelligence data-analysis data-analytics data-visualization eda exploratory-data-analysis ms-office powerbi-report powerpoint-presentations python sql-server
Last synced: 06 May 2026
https://github.com/aleskandro/r-hadoop-madreduce-examples
A lot of examples about using R with hadoop for MapReduce with and without libraries as rhadoop/rhipe - DIEEI@unict.it - Advanced Programming Languages
data-analysis hadoop mapreduce r
Last synced: 04 Nov 2025
https://github.com/beolawork-art/novabank-churn-analysis
NovaBank has noticed that customers are closing accounts or going inactive, and they want to understand why.
data-analysis data-science-projects data-visualization eda machine-learning numpy pandas python scikit-learn sql
Last synced: 08 Apr 2026
https://github.com/ragedunicorn/mantisx-notebook
A repository for Jupyter notebooks analysing mantisx data
data-analysis data-visualization mantis mantisx shooting training
Last synced: 24 Jul 2025
https://github.com/jakebrehm/lemons
🍋 A Python package which makes building GUIs easy peasy lemon squeezy.
data-analysis data-science gui python python3 python37 tkinter tkinter-gui tkinter-python
Last synced: 27 Mar 2025
https://github.com/smohanta23/ev-trendanalytics-24
This Tableau project analyzes EV adoption trends using data up to May 2024. Visualizations cover growth, geography, market share, CAFV eligibility, and consumer preferences, supporting data-driven decisions with detailed drill-downs. Data is meticulously cleaned, offering stakeholders valuable insights into EV market dynamics and trends for future.
business-intelligence data-analysis data-engineering electric-vehicles feature-engineering kpianalysis predictive-analytics tableau trendanalysis
Last synced: 27 Mar 2026
https://github.com/dcs-training/much-ado-about-nothing-missing-data-in-research
Repo for the Much ado about nothing workshop. Go to the Readme file
data-analysis data-cleaning data-wrangling r
Last synced: 15 Jun 2025
https://github.com/vladstudennikov/diabetes-prediction-app
ML-powered web app built with Laravel and Vue.js to predict diabetes risk based on users' daily habits and behavior
cypress data-analysis diabetes-prediction fastapi inertiajs laravel matplotlib medicine ml pandas php scikit-learn seaborn vuejs
Last synced: 08 Apr 2026
https://github.com/wb-az/sql-for-data-analysis-udacity
This repository contains SQL queries for the SQL for Data Analysis given by Udacity. The queries include commands to define, select, manipulate, control access, aggregate, and join data and data tables.
aggregation data-analysis data-cleaning erd joints postgresql sql subqueries-and-joins window-functions-in-sql
Last synced: 23 May 2026
https://github.com/hrolive/patc-big-data-analytics-bsc
Introduction to the main concepts and technologies related to Big Data and Data Analytics and its applications to real projects.
analytics bias big-data data-analysis hadoop hpc machine-learning mapreduce nosql python spark spark-streaming visualization
Last synced: 12 Apr 2026
https://github.com/yousef-jaber-abdelaziz/electrical-vehicles-data-analysis-project
A full stack Data Engineering f\project from Getting the data to the Data warehousing and then the Dashboard using Power BI
data-analysis data-engineering data-modeling data-visualization data-warehouse data-warehousing fabric microsoft-azure microsoft-fabric-data-engineer powerbi sql-server
Last synced: 23 Jun 2026
https://github.com/esr-style/stylegrid
A free alternative to AG grid built by me for personal use case.
aggrid data-analysis grid pivot-chart pivot-grid table
Last synced: 16 Sep 2025
https://github.com/dsarceno/portfolio
Portafolio de Científico de Datos. Proyectos realizados por Diego Sarceño.
computer-vision data-analysis data-science deep-learning docker graph-algorithms investing keras-tensorflow machine-learning markdown neural-networks optimization-algorithms pipelines python sentiment-analysis sklearn tensorflow voice-recognition
Last synced: 06 Mar 2026
https://github.com/ansh-info/literaturesurvey
Literature Survey Engine, leverages the powerful Semantic Scholar's Recommendation API to provide you with highly relevant research article recommendations based on your curated lists of articles.
api api-integration automation data-analysis data-visualization docker docker-compose literature-survey machine-learning mysql paper-recommendations python recommendation-system research-tools semantic-scholar streamlit zotero
Last synced: 10 Apr 2026
https://github.com/faris771/identify_customer_segments
This project is part of the Palestine Launchpad by Spark, and Udacity with Google. It uses unsupervised learning to identify customer segments for a mail-order company in Germany. The goal is to direct marketing campaigns towards the most promising audiences. The data is provided by Bertelsmann Arvato Analytics.
clustering data-analysis decomposition feature-engineering machine-learning unsupervised-learning
Last synced: 08 Aug 2025
https://github.com/leabrodyheine/ml-kaggle-cirrhosis-data
This project showcases skills in machine learning, data preprocessing, and model evaluation using Python libraries such as scikit-learn, XGBoost, and Optuna. It involves implementing various machine learning models, handling imbalanced data, and employing imputation techniques to enhance model performance for predicting cirrhosis outcomes.
data-analysis data-pre imbalanced-data imputation machine-learning optuna pipeline scikit-learn xgboost
Last synced: 14 May 2026
https://github.com/rohithay/titanic-data-analysis
Predict Survival Outcomes from the 1912 Titanic disaster based on each passenger's features, such as sex and age.
data-analysis machine-learning matplotlib pandas scipy-stats statistical-models
Last synced: 15 May 2026
https://github.com/kingflow-23/association-matching
Recherche et Structuration d'Opportunités de Financement pour les Associations
association data-analysis data-engineering excel fondation pyqt5 python webscraping
Last synced: 07 Apr 2025
https://github.com/hatamiarash7/ir-system
IR System for Reuters DB
data-analysis data-mining ir python
Last synced: 29 Mar 2025
https://github.com/rajesh9943/visualizing-global-development-trends-an-animated-analysis-of-life-expectancy-and-fertility-rates
To clean and analyze data to find trends in global population, fertility, and life expectancy from 1960 to 2016. This idea was inspired by hans rosling . To analyze the data, I used a scatter bubble chart, which clearly shows how's the population increased and the fertility rate decreased from 1960 to 2016.
data-analysis data-cleaning-and-preprocessing data-exploration expolatory-data-analysis identify-patterns reporting vizualisation
Last synced: 08 Oct 2025
https://github.com/mindlessmuse666/eda-explorer
Инструмент на Python для разведочного анализа данных (EDA) и визуализации, поддерживающий загрузку данных CSV и JSON, с модульной архитектурой ООП. Практическая работа по теме: "Обнаружение и визуализация данных для понимания их сущности" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".
csv-visualization data-analysis data-science data-visualization exploratory-data-analysis json-visualization matplotlib oop pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/diliprk/smartcityvisualization
Data Wrangling and Data Visualization Works done for Smart City Project at HBK Saar
bokeh data-analysis data-visualization python3
Last synced: 15 May 2026
https://github.com/smehra1208/certifications
data-analysis data-visualization excel postgres powerbi python sql
Last synced: 14 May 2026
https://github.com/cadedupont/mlb-data-analysis
Performing analysis on dataset of active MLB players in R
baseball-analytics data-analysis data-science mlb-stats-api r
Last synced: 23 Jun 2026
https://github.com/advestis/adadjust
Package allowing to fit any mathematical function to (for now 1-D only) data.
Last synced: 17 May 2026
https://github.com/dimits-ts/visualization-assignments
Visualizing and analyzing results from the PISA-2018 competitions with regards to Greek performance and gender gap.
data-analysis data-visualization interactive-graphs presentation-slides r-language tableau
Last synced: 06 Nov 2025
https://github.com/oubiche-ishak19/stock_evaluation_python
A Python script to classify companies based on financial metrics like Piotroski F-Score and Stock Valuation, using CSV financial data for analysis and output.
backtesting-frameworks classification csv-processing data-analysis expert-system finance financial-analysis-tools python rule-based-classifier stock stock-market streamlit tkinter-gui yahoo-finance
Last synced: 15 May 2026
https://github.com/noor188/preswald-data-app
A data app to visualize and manipulate the graduate admission dataset
data-analysis data-visualization open-source
Last synced: 04 Jul 2025
https://github.com/georgehanymilad/end-to-end-shopping-trends-data-analysis
SQL+ Python + Power BI Project for Data Analysis
data-analysis data-visualization datacleaning mssql powerbi python sql
Last synced: 17 May 2026
https://github.com/kefilweditse/awesome-matchem-datasets
Awesome-matchem-datasets is a curated collection of high-quality datasets for machine learning and data analysis in the field of chemistry. This repository includes various datasets, ranging from molecular structures to experimental results, suitable for both research and educational purposes.
awesome awesome-dataset awesome-dataset-collection awesome-match-data awesome-matchem data-analysis data-matching dataset dataset-collection dataset-research dataset-samples match match-data match-dataset-analysis match-examples
Last synced: 07 Apr 2025
https://github.com/tejaswirupa/data-analysis-of-departure-delays-at-united-airlines
Explored how weather and time factors influence delays in 58,000+ UA flights. Used permutation testing and visual analytics to show how temperature, visibility, and time of day affect departure punctuality.
Last synced: 25 Jan 2026
https://github.com/bpkaur/exploring-67-years-of-lego
Exploring 67 years of LEGO
data-analysis datacamp pandas python3
Last synced: 10 May 2026
https://github.com/leosimoes/datascienceacademy-python-analisededados
Atividades do curso Análise de Dados com Linguagem Python da DataScienceAcademy.
data-analysis data-science jupyter-notebook python sql
Last synced: 29 Apr 2026
https://github.com/kaoutarmi/analyse-des-ventes-pour-optimiser-la-performance
Analyse des données de ventes pour identifier des opportunités d'amélioration des performances commerciales. Utilisation de Pandas pour le traitement des données, et Matplotlib/Seaborn pour la visualisation des tendances et des résultats.
business-intelligence data-analysis data-visualization jupyter-notebook matplotlib pandas sales-optimization seaborn
Last synced: 01 Jul 2026
https://github.com/saob007/tablero_subsidios_servicio_agua
Se construye un dashboard para el análisis de la distribución y asignación de subsidios para agua potable y alcantarillado otorgados por la Secretaría de Planeación de la Alcaldía de Sincelejo en 2020, con el objetivo de identificar patrones en cobertura, consumo, facturación y subsidios, facilitando la toma de decisiones en políticas públicas
dashboard data-analysis data-visualization looker-studio
Last synced: 31 Jan 2026
https://github.com/ddjain/jsonl-visualizer
A beautiful web tool for visualizing JSONL files with syntax highlighting and multiple view modes
data-analysis json jsonl viusal
Last synced: 01 Jul 2026
https://github.com/silianpan/python-data-analysis-course
python data analysis course of drotion-lega
data-analysis jupyter-notebook panda
Last synced: 11 Apr 2025
https://github.com/rajesh9943/web-scraping-analysis-of-top-us-company-revenue-growth-in-2023
Explore the landscape of US business growth in 2023 with our dynamic project, 'Web Scraping for US 2023 Revenue Growth.' Utilizing advanced web scraping techniques, we unveil insights into the top companies driving economic expansion.
cleaning-data data data-analysis data-visualization manipulation numpy pandas pre-fill
Last synced: 16 Aug 2025
https://github.com/manish506/loan-approval-prediction
Explore predictive modeling in this project by applying classification techniques to a loan approval dataset. Analyze and preprocess the data, then use models like K-Nearest Neighbors, Random Forest, SVC, and Logistic Regression to predict loan outcomes. Gain insights into approval factors and enhance prediction accuracy.
classification classification-models data-analysis data-science jupyter-notebook loan-approval-prediction machine-learning predictive-analytics predictive-modeling project python
Last synced: 19 Jan 2026
https://github.com/shz-code/diwali_sales_data_analysis
Customer Product Purchase Behavior Analysis
behavior-analysis data-analysis matplotlib ml sales seaborn
Last synced: 14 Mar 2025
https://github.com/brunomontezano/sleep-cognition-and-functioning
💤 Data analysis of a brief communication published in Psychiatry Research Communications journal by Montezano et al (2023).
bipolar-disorder cognition data-analysis data-visualization data-viz depression ggplot2 pelotasrs psychiatry psychology published-article r sleep ucpel
Last synced: 13 Jun 2026
https://github.com/lucaspadoni/9-11-hijackers-social-network-analysis
Social Network Analysis focused on the events of 9/11/2001. By examining publicly available data through SNA techniques, we gain insights into the organizational structure of the terrorist network, offering valuable perspectives on key relationships and connections.
9-11 data-analysis data-analytics graph-theory hijacking network-analysis sna social-network-analysis terrorism terrorist-attacks
Last synced: 26 Jun 2026
https://github.com/muthukumar0908/cardekho_used_car_price_prediction
The project aim is to build a machine learning model that offers users to find current valuations for used cars.
data-analysis data-visualization datacleaning eda machine-learning python streamlit
Last synced: 30 Mar 2025
https://github.com/marcomadera/test-for-random-numbers
Test for random number between 0 and 1
Last synced: 09 Jul 2025
https://github.com/gonzalofuentes28/dpeek
Interactive terminal data viewer for CSV, TSV, JSON, and JSONL files
bubbletea cli csv csv-viewer data-analysis data-viewer golang json json-viewer sqlite terminal tui
Last synced: 06 Apr 2026
https://github.com/sivkri/shiny-scatter-plot-app
This repository contains a Shiny app that allows users to create interactive scatter plots by selecting the X and Y axes and customizing the point color. The app utilizes the shiny package in R to provide a user-friendly interface and the ggplot2 package for creating visually appealing plots.
data-analysis data-visualization ggplot2 interactive-web-application r rprogramming scatter-plot shiny
Last synced: 22 Mar 2025
https://github.com/sivkri/rnaseq-analysis-junctionseq-qorts
This repository provides scripts for RNA-Seq data analysis using JunctionSeq and QoRTs, enabling quality control, differential splicing analysis, and generation of browser tracks.
bioinformatics data-analysis differential-splicing genomics junctionseq qorts quality-control rna-seq rna-seq-analysis splice-junctions splice-variants spliced-alignment transcriptomics
Last synced: 22 Mar 2025
https://github.com/habiburrahman-mu/exploratory-data-analysis
Methods to see if certain characteristics or features can be used to predict.
data-analysis data-mining data-science data-visualization
Last synced: 20 Jan 2026
https://github.com/rohitha-tata/bike-sales
This project focuses on data cleaning, transformation, and dashboard creation using a bike buyers dataset. It includes Pivot Tables, slicers, visualizations, and statistical insights to analyze trends based on income, age, occupation, and other key factors. Insights help understand customer behavior, purchasing patterns, and decision-making trends.
data-analysis data-cleaning excel-dashboards interactive-slicers pivot-charts pivot-tables
Last synced: 08 Mar 2026
https://github.com/hasnathjami/data-analysis-of-covid-19
An Oracle PL/SQL-based project on COVID-19 data analysis. It is my CSE 4.1 project of Distributive Database Management System LAB.
data-analysis naive-bayes-classifier oracle-database probability-statistics sqlplus
Last synced: 08 Mar 2026
https://github.com/nimomach/amazon-sales-data
This is a small dataset containing Amazon sales data analysis for few regions.
dashboards data data-analysis data-visualization
Last synced: 08 Mar 2026
https://github.com/jatin-mehra119/sales-analysis
Sales Analysis of super market
data-analysis salesanalysis visualization
Last synced: 30 Jun 2026
https://github.com/shubhammittal-data/hr_dashboard_tableau
An interactive HR Analytics Dashboard built using Tableau. Provides insights into workforce demographics, hiring trends, salary analysis, and employee records for data-driven decision-making.
chatgpt4 data data-analysis data-visualization drawio-tools faker-generator hr-analytics hr-analytics-dashboard human-resources numpy python tableau tableau-public
Last synced: 17 May 2026
https://github.com/cosmoduende/r-earthquakes
Análisis y visualización de datos de actividad sísmica en México con R. Cómo analizar y visualizar la historia sísmica de México con datos del SSN (Servicio Sismológico Nacional)
data-analysis data-analytics data-science dataviz earthquakes r-code r-programming r-studio rstudio sismo sismologia sismos ssn ssnmx terremoto terremotos
Last synced: 24 Jan 2026
https://github.com/faith99/water_pollution_dashboard
A data visualization project exploring water access, contamination and health outcomes
data-analysis data-visualization powerbi public-health publichealth
Last synced: 02 Feb 2026
https://github.com/tolumie/web-scraping-rest-api-stock-data-operations
Web Scraping, REST API & Stock Data Operations is a data-driven project that explores the power of web scraping, API interactions, and stock market analysis using Python. From extracting stock data and public records to analyzing real-world financial trends, this repository is a one-stop resource for data enthusiasts, traders, and analysts.
api-integration data-analysis data-cleaning data-visualization financial-data python rest-api sql-databases stock-data web-scraping
Last synced: 19 May 2026
https://github.com/mrham17/spotify_streaming_analytics
Project is stable & documentation will be completed soon. Thank you for your understanding and patience.
big-data-analytics data-analysis google-colab music-data r-programming spotify streaming-analytics
Last synced: 24 Jul 2025
https://github.com/poglolopez/prueba_tecnica_inlaze
Este repositorio muestra mis habilidades en análisis de datos a través de una prueba técnica para Inlaze. Incluye flujos de trabajo con Python, SQLite y Power BI para analizar el comportamiento de jugadores, depósitos y rendimiento de fuentes de tráfico, destacando eficiencia operativa e información estratégica.
data-analysis data-v etl jupyter powerbi python sqlite
Last synced: 26 Feb 2025
https://github.com/dzakwanalifi/reglins
regLins is an R package designed for performing linear regression analysis using various optimization methods. It also provides an interactive Shiny application for a more dynamic analysis experience.
data-analysis linear-regression optimization r shiny-app
Last synced: 09 Jul 2025
https://github.com/bris0yzbekaye/json-to-excel-converter
This repository provides a tool to convert JSON data to Excel format (.xlsx). It allows you to easily transform structured JSON data into a well-organized spreadsheet for better analysis and visualization.
automation-script automation-tools data-analysis data-converter data-export data-formatting data-tools data-visualization excel excel-automation excel-converter excel-tools json json-exporter json-parser json-processing json-to-csv json-to-excel programming-tools spreadsheet-tools
Last synced: 25 Jul 2025
https://github.com/bhiogade/customer-purchase-analysis
Comprehensive Customer Purchase Analysis Across Multiple Dimensions
data-analysis data-visualization tableau tableau-desktop
Last synced: 02 Feb 2026
https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql
This project involves cleaning a dataset containing information about layoffs from companies around the world.
data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql
Last synced: 08 Jun 2026
https://github.com/leandrocollares/street-cherry-trees-in-vancouver
Street cherry trees in Vancouver: an exploratory data analysis
data-analysis data-visualization folium pandas plotly-express
Last synced: 17 Sep 2025
https://github.com/samuelsoaress/predict-future-sales
Machine Learning applied to sales forecast
data-analysis data-mining data-science data-visualization forecasting-models
Last synced: 24 Jul 2025
https://github.com/netesf13d/expt-sequence-analysis
Data processing, analysis and visualization package for atomic physics experiments in the single-atom regime.
cold-atoms data-analysis data-visualization optical-tweezers
Last synced: 24 Jul 2025
https://github.com/matte34/auto-insurance-analysis
Conducted a comprehensive exploratory data analysis (EDA) on an auto insurance dataset that I found from Kaggle. I performed a permutation test and generated data visualizations.
data-analysis data-visualization permutation-test python3 scipy seaborn
Last synced: 06 May 2026
https://github.com/devanshsahu47/talentscape-glassdoor-analysis
TalentScape is an end-to-end Python project that cleans and analyzes a comprehensive Glassdoor Jobs dataset. It features robust data wrangling and 20 insightful visualizations to uncover trends in job titles, salary ranges, company ratings, and more—providing actionable recommendations to optimize recruitment and compensation strategies.
business-intelligence data-analysis data-vizualisation jupyter-notebook python3
Last synced: 15 May 2026
https://github.com/muhammadhussain-2009/stock-price-prediction-using-stacked-lstm
Predicting Google Stock Prices using Deep Learning Techniques.
data-analysis data-science data-visualization deep-learning jupyter-notebook keras lstm-neural-networks machine-learning-algorithms python stock-data stock-price-prediction tensorflow
Last synced: 16 Apr 2026
https://github.com/ryan-wong1/72-years-of-shark-incidents-in-california-data-analysis
Shark Incidents in California 1950 - 2022
data-analysis data-cleaning data-visualization exploratory-data-analysis sql
Last synced: 25 Jan 2026
https://github.com/serlo/data-pipeline-interactive-exercises
processing pipeline for exercise dashboards
Last synced: 26 Feb 2025
https://github.com/anastasius21/creditcardfrauddetection
This repository contains a Jupyter Notebook for Credit Card Fraud Detection Model and a csv dataset on which it is being trained
credit-card-fraud data-analysis data-science data-visualization fraud-detection logistic-regression machine-learning
Last synced: 16 Jun 2025
https://github.com/swethajoseph/netflix-powerbi-interactive-dashboard
Created an interactive Netflix Power BI dashboard to analyze and visualize Netflix's content library, uncovering trends in content type, genre distribution, and global reach
data-analysis data-visualization interactive-visualizations powerbi powerbi-dashboards powerbi-report
Last synced: 03 Jan 2026
https://github.com/phanchenh/youtube_analysis_rlanguage
Insights into YouTube Channel Performance - A Data-Driven Approach
business-analytics data-analysis data-driven data-visualization etl-pipeline preprocessing r-language r-programming-language
Last synced: 10 Mar 2026
https://github.com/ayberkyavuz/body_type_estimator
This repository is a tutorial for all levels who want to learn how to develop end to end machine learning system.
backend classification css data-analysis dataset end-to-end flask flask-application frontend html javascript machine-learning machine-learning-application material-design materializecss pandas python tutorial webapp xgboost
Last synced: 10 Apr 2026
https://github.com/tomy-jr98/air-quality-sql-project
Air pollution analysis using BigQuery and Tableau, with data cleaning, aggregation, and visualization.
air-pollution bigquery data-analysis portfolio sql tableau
Last synced: 25 Jul 2025
https://github.com/singhs05/global-youtube-trends
Understand the impact of Likes, comments, dislikes on the video consumption for the videos that were trending.
data-analysis mssqlserver query sql
Last synced: 18 Mar 2026
https://github.com/cescedes/medical-insurance-costs-with-python
Investigate how different factors affect the prediction of medical insurance costs by practicing many python concepts.
codecademy data-analysis python python-dictionaries python-functions python-lists python-loops python-strings
Last synced: 19 May 2026
https://github.com/shreeparab1890/india-gdp-rate-1960-to-2021-data-analysis
This ipython notebook is the Exploratory data analysis (EDA) of the India GDP Rate 1960 to 2021.
analysis data-analysis eda exploratory-data-analysis ipython-notebook jyputer-notebook matplotlib matplotlib-pyplot pandas python
Last synced: 06 Mar 2026
https://github.com/anthonysanalysis/bellabeat-analysis
Bellabeat Tech Case Study Capstone Project
analysis capstone case-study data data-analysis data-visualization md r rmd rstudio
Last synced: 20 Apr 2026
https://github.com/andersoncrs/clasificacion-propina-restaurante
Este informe desarrolla, de manera clara y práctica, un análisis completo del conocido conjunto de datos de propinas (tips), mostrando paso a paso cómo transformar la información cruda en modelos predictivos útiles.
clasification data-analysis data-visualization tips
Last synced: 26 Jul 2025
https://github.com/farzeennimran/apriori-algorithm
Apriori Algorithm for Association Rule Mining
algorithm apriori apriori-algorithm apyori association-rule-mining association-rules data-analysis data-mining data-science numpy pandas python
Last synced: 06 Apr 2026
https://github.com/vishal-verma-96/Pre-Owned-Car-Price-prediction-using-Streamlit-App
Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app
data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit
Last synced: 02 Mar 2025
https://github.com/wassimhedfi/exploring-the-evolution-of-linux
Datacamp guided Project
data-analysis data-science ml python
Last synced: 15 May 2026
https://github.com/riyajain255/customer-segmentation-for-e-commerce
This project analyzes online retail data to segment customers using K-Means clustering and build classification models to predict those segments based on purchasing behavior.
customer-segmentation data-analysis kmeans-clustering logistic-regression machine-learning matplotlib numpy pandas python random-forest scikit-learn seaborn-plots
Last synced: 02 Apr 2026
https://github.com/zakintaliban/indonesia-train-passenger-forecasting
A project to analyze and forecast train passenger numbers in Indonesia using Python, Pandas, and Scikit-learn.
bps data-analysis data-science dataanalysis datascience forecasting indonesia kereta-api machine-learning machinelearning numpy pandas python scikit-learn scikitlearn seaborn time-series
Last synced: 29 Apr 2026
https://github.com/kushalagarwalla/netflix-movie-data-analysis
🚀 Netflix Data Analytics Project 🎬📊 | Analyzed 9K+ movies to uncover insights on genres, popularity, votes & release trends. Includes EDA, KPIs & visualizations using Python (Pandas, NumPy, Matplotlib, Seaborn). Supports data-driven content & engagement strategy.
data-analysis data-visualization jupyter-notebook numpy pandas python seaborn
Last synced: 06 May 2026
https://github.com/samiksha29-patil/flipkart-mobiles-data-analysis-visualization-in-python
This project analyzes Flipkart Mobiles Dataset to extract useful insights about mobile phones, their pricing, ratings, discounts, and customer reviews. The analysis and visualization are done using Python to understand market trends and customer preferences.
data-analysis data-visualization matplotlib numpy pandas python seaborn
Last synced: 04 May 2026
https://github.com/labex-labs/numpy-for-beginners
This comprehensive course covers the fundamental concepts and practical techniques of NumPy, the essential library for numerical computing in Python. Learn to create, manipulate, and analyze arrays efficiently.
array-manipulation array-slicing beginner-friendly course data-analysis data-science data-structures fast-computation hands-on labex labs linear-algebra matrix-operations numerical-computing numpy programming python python-programming scientific-computing vectorized-operations
Last synced: 20 Jun 2026
https://github.com/vitor-ace/sunspots-data-analysis
This is a Jupyter Notebook which works with Data Analysis logic and libraries implementation with Python.
data-analysis data-visualization debbuging error-handling file-handling matplotlib-pyplot numpy pandas python
Last synced: 06 May 2026
https://github.com/shrikantnaidu/greyatom-projects
GreyAtom Projects.
data-analysis data-science greyatom machine-learning portfolio
Last synced: 24 Jul 2025
https://github.com/sanafagal/wsp-msg-automation
An intuitive application for managing and analyzing customer and reseller data stored in Google Sheets, providing insights and streamlined data organization.
automation cloud-credentials data-analysis google-sheets-api python
Last synced: 16 Jun 2025
https://github.com/hecatops/ad_libs
A real time advertisement data analytics platforming, displaying important metrics in easy to understand language.
dashboard data-analysis data-visualization kpi plotly-dash python
Last synced: 07 Nov 2025
https://github.com/chahelgupta/hospital-readmission-prediction-and-analysis
The Hospital Readmission Prediction project uses clinical data to predict diabetic readmissions. SVM + SMOTE achieved 61.16% accuracy, with key predictors including hospital stay, lab tests, and medications.
data-analysis knn-classification logistic-regression machine-learning prediction prediction-model python random-forest-classifier smote svm-classifier
Last synced: 15 May 2026
https://github.com/zwelz3/unofficial-survivor-knowledge-graph
A comprehensive RDF knowledge graph covering all 50 seasons of Survivor (US), with 23,000+ triples across 749 named graphs.
Last synced: 23 May 2026
https://github.com/amruthadevops/stock-market-analysis
To analyze market trends and predict future market behavior using machine learning techniques
data-analysis data-science jupyter-notebook machine-learning powerbi-desktop python stock-market
Last synced: 15 May 2026
https://github.com/karsterr/repeated-measurement
An R-based workflow for conducting repeated measures ANOVA using the ez package, with data wrangling via tidyverse and visualization through ggplot2. Includes data import, transformation to long format, statistical analysis, and graphical summary.
anove data-analysis experimental-design ezanove ggplot2 r repeated-measurements rstats statistics tidyverse
Last synced: 18 Sep 2025
https://github.com/carlosvinimsouza/full-tutorial-python
My tutorial Python completed
data-analysis data-science data-structures django django-framework fastapi fastapi-framework flask flask-web frameworks learn-to-code learning python python3 roadmap tutorial tutorial-code
Last synced: 10 Apr 2026