Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-29 00:07:38 UTC
- JSON Representation
https://github.com/esr-style/stylegrid
A free alternative to AG grid built by me for personal use case.
aggrid data-analysis grid pivot-chart pivot-grid table
Last synced: 16 Sep 2025
https://github.com/hrolive/patc-big-data-analytics-bsc
Introduction to the main concepts and technologies related to Big Data and Data Analytics and its applications to real projects.
analytics bias big-data data-analysis hadoop hpc machine-learning mapreduce nosql python spark spark-streaming visualization
Last synced: 12 Apr 2026
https://github.com/dsarceno/portfolio
Portafolio de Científico de Datos. Proyectos realizados por Diego Sarceño.
computer-vision data-analysis data-science deep-learning docker graph-algorithms investing keras-tensorflow machine-learning markdown neural-networks optimization-algorithms pipelines python sentiment-analysis sklearn tensorflow voice-recognition
Last synced: 06 Mar 2026
https://github.com/faris771/identify_customer_segments
This project is part of the Palestine Launchpad by Spark, and Udacity with Google. It uses unsupervised learning to identify customer segments for a mail-order company in Germany. The goal is to direct marketing campaigns towards the most promising audiences. The data is provided by Bertelsmann Arvato Analytics.
clustering data-analysis decomposition feature-engineering machine-learning unsupervised-learning
Last synced: 08 Aug 2025
https://github.com/natgluons/fmcg-data-modeling
SQL, ARIMA, and K-Means Clustering for data analysis dan customer segmentation regarding sales data
arima-forecasting arima-model customer-segmentation data-analysis data-science-projects kmeans-clustering sales-forecasting
Last synced: 13 Aug 2025
https://github.com/ansh-info/literaturesurvey
Literature Survey Engine, leverages the powerful Semantic Scholar's Recommendation API to provide you with highly relevant research article recommendations based on your curated lists of articles.
api api-integration automation data-analysis data-visualization docker docker-compose literature-survey machine-learning mysql paper-recommendations python recommendation-system research-tools semantic-scholar streamlit zotero
Last synced: 10 Apr 2026
https://github.com/kingflow-23/association-matching
Recherche et Structuration d'Opportunités de Financement pour les Associations
association data-analysis data-engineering excel fondation pyqt5 python webscraping
Last synced: 07 Apr 2025
https://github.com/hatamiarash7/ir-system
IR System for Reuters DB
data-analysis data-mining ir python
Last synced: 29 Mar 2025
https://github.com/bhaveshbhakta/movie-recommendation-system-using-ml
Movie Recommendation System
data-analysis data-visualization eda machine-learning movie-recommedation
Last synced: 10 Nov 2025
https://github.com/mindlessmuse666/eda-explorer
Инструмент на Python для разведочного анализа данных (EDA) и визуализации, поддерживающий загрузку данных CSV и JSON, с модульной архитектурой ООП. Практическая работа по теме: "Обнаружение и визуализация данных для понимания их сущности" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".
csv-visualization data-analysis data-science data-visualization exploratory-data-analysis json-visualization matplotlib oop pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/rohithay/titanic-data-analysis
Predict Survival Outcomes from the 1912 Titanic disaster based on each passenger's features, such as sex and age.
data-analysis machine-learning matplotlib pandas scipy-stats statistical-models
Last synced: 15 May 2026
https://github.com/itsachrafmansari/moroccan-real-estate-analysis
Scrape, process, analyze, and visualize data from Avito.ma to uncover current trends in Morocco's real estate market.
api-scraping data data-analysis data-mining data-science data-scraping data-visualization eda exploratory-data-analysis morocco real-estate web-scraping
Last synced: 13 Aug 2025
https://github.com/diliprk/smartcityvisualization
Data Wrangling and Data Visualization Works done for Smart City Project at HBK Saar
bokeh data-analysis data-visualization python3
Last synced: 15 May 2026
https://github.com/cadedupont/mlb-data-analysis
Performing analysis on dataset of active MLB players in R
baseball-analytics data-analysis data-science mlb-stats-api r
Last synced: 23 Jun 2026
https://github.com/dimits-ts/visualization-assignments
Visualizing and analyzing results from the PISA-2018 competitions with regards to Greek performance and gender gap.
data-analysis data-visualization interactive-graphs presentation-slides r-language tableau
Last synced: 06 Nov 2025
https://github.com/advestis/adadjust
Package allowing to fit any mathematical function to (for now 1-D only) data.
Last synced: 17 May 2026
https://github.com/noor188/preswald-data-app
A data app to visualize and manipulate the graduate admission dataset
data-analysis data-visualization open-source
Last synced: 04 Jul 2025
https://github.com/oubiche-ishak19/stock_evaluation_python
A Python script to classify companies based on financial metrics like Piotroski F-Score and Stock Valuation, using CSV financial data for analysis and output.
backtesting-frameworks classification csv-processing data-analysis expert-system finance financial-analysis-tools python rule-based-classifier stock stock-market streamlit tkinter-gui yahoo-finance
Last synced: 15 May 2026
https://github.com/kefilweditse/awesome-matchem-datasets
Awesome-matchem-datasets is a curated collection of high-quality datasets for machine learning and data analysis in the field of chemistry. This repository includes various datasets, ranging from molecular structures to experimental results, suitable for both research and educational purposes.
awesome awesome-dataset awesome-dataset-collection awesome-match-data awesome-matchem data-analysis data-matching dataset dataset-collection dataset-research dataset-samples match match-data match-dataset-analysis match-examples
Last synced: 07 Apr 2025
https://github.com/tejaswirupa/data-analysis-of-departure-delays-at-united-airlines
Explored how weather and time factors influence delays in 58,000+ UA flights. Used permutation testing and visual analytics to show how temperature, visibility, and time of day affect departure punctuality.
Last synced: 25 Jan 2026
https://github.com/jailsonsb2/kit-analise-de-dados
🚀 Um kit de ferramentas Python para acelerar a análise de dados. Carregue arquivos de forma inteligente (CSV, Excel, etc.) e converta notebooks Jupyter para scripts de produção sem esforço.
analise-de-dados analise-exploratoria analise-exploratoria-de-dados automation automations dados data-analysis data-cleaning etl etl-automation jupyter-notebook pandas powerquery python toolkit
Last synced: 29 Apr 2026
https://github.com/leosimoes/datascienceacademy-python-analisededados
Atividades do curso Análise de Dados com Linguagem Python da DataScienceAcademy.
data-analysis data-science jupyter-notebook python sql
Last synced: 29 Apr 2026
https://github.com/georgehanymilad/end-to-end-shopping-trends-data-analysis
SQL+ Python + Power BI Project for Data Analysis
data-analysis data-visualization datacleaning mssql powerbi python sql
Last synced: 17 May 2026
https://github.com/saob007/tablero_subsidios_servicio_agua
Se construye un dashboard para el análisis de la distribución y asignación de subsidios para agua potable y alcantarillado otorgados por la Secretaría de Planeación de la Alcaldía de Sincelejo en 2020, con el objetivo de identificar patrones en cobertura, consumo, facturación y subsidios, facilitando la toma de decisiones en políticas públicas
dashboard data-analysis data-visualization looker-studio
Last synced: 31 Jan 2026
https://github.com/silianpan/python-data-analysis-course
python data analysis course of drotion-lega
data-analysis jupyter-notebook panda
Last synced: 11 Apr 2025
https://github.com/brunomontezano/sleep-cognition-and-functioning
💤 Data analysis of a brief communication published in Psychiatry Research Communications journal by Montezano et al (2023).
bipolar-disorder cognition data-analysis data-visualization data-viz depression ggplot2 pelotasrs psychiatry psychology published-article r sleep ucpel
Last synced: 13 Jun 2026
https://github.com/byte7/fifa17-analysis-and-prediction
⚽ FIFA 17 Analysis and Prediction⚽
data-analysis dreamteam fifa17 fifa17-analysis ultimate-team
Last synced: 26 Mar 2025
https://github.com/apostolis-bloutsos-data/employee-data-eda
Mini EDA project on synthetic employee records using Python, pandas, and matplotlib
data-analysis eda jupyter-notebook matplotlib pandas python seaborn
Last synced: 09 May 2026
https://github.com/shz-code/diwali_sales_data_analysis
Customer Product Purchase Behavior Analysis
behavior-analysis data-analysis matplotlib ml sales seaborn
Last synced: 14 Mar 2025
https://github.com/lucaspadoni/9-11-hijackers-social-network-analysis
Social Network Analysis focused on the events of 9/11/2001. By examining publicly available data through SNA techniques, we gain insights into the organizational structure of the terrorist network, offering valuable perspectives on key relationships and connections.
9-11 data-analysis data-analytics graph-theory hijacking network-analysis sna social-network-analysis terrorism terrorist-attacks
Last synced: 26 Jun 2026
https://github.com/marcomadera/test-for-random-numbers
Test for random number between 0 and 1
Last synced: 09 Jul 2025
https://github.com/muthukumar0908/cardekho_used_car_price_prediction
The project aim is to build a machine learning model that offers users to find current valuations for used cars.
data-analysis data-visualization datacleaning eda machine-learning python streamlit
Last synced: 30 Mar 2025
https://github.com/gonzalofuentes28/dpeek
Interactive terminal data viewer for CSV, TSV, JSON, and JSONL files
bubbletea cli csv csv-viewer data-analysis data-viewer golang json json-viewer sqlite terminal tui
Last synced: 06 Apr 2026
https://github.com/faith99/water_pollution_dashboard
A data visualization project exploring water access, contamination and health outcomes
data-analysis data-visualization powerbi public-health publichealth
Last synced: 02 Feb 2026
https://github.com/sivkri/shiny-scatter-plot-app
This repository contains a Shiny app that allows users to create interactive scatter plots by selecting the X and Y axes and customizing the point color. The app utilizes the shiny package in R to provide a user-friendly interface and the ggplot2 package for creating visually appealing plots.
data-analysis data-visualization ggplot2 interactive-web-application r rprogramming scatter-plot shiny
Last synced: 22 Mar 2025
https://github.com/sivkri/rnaseq-analysis-junctionseq-qorts
This repository provides scripts for RNA-Seq data analysis using JunctionSeq and QoRTs, enabling quality control, differential splicing analysis, and generation of browser tracks.
bioinformatics data-analysis differential-splicing genomics junctionseq qorts quality-control rna-seq rna-seq-analysis splice-junctions splice-variants spliced-alignment transcriptomics
Last synced: 22 Mar 2025
https://github.com/habiburrahman-mu/exploratory-data-analysis
Methods to see if certain characteristics or features can be used to predict.
data-analysis data-mining data-science data-visualization
Last synced: 20 Jan 2026
https://github.com/rohitha-tata/bike-sales
This project focuses on data cleaning, transformation, and dashboard creation using a bike buyers dataset. It includes Pivot Tables, slicers, visualizations, and statistical insights to analyze trends based on income, age, occupation, and other key factors. Insights help understand customer behavior, purchasing patterns, and decision-making trends.
data-analysis data-cleaning excel-dashboards interactive-slicers pivot-charts pivot-tables
Last synced: 08 Mar 2026
https://github.com/hasnathjami/data-analysis-of-covid-19
An Oracle PL/SQL-based project on COVID-19 data analysis. It is my CSE 4.1 project of Distributive Database Management System LAB.
data-analysis naive-bayes-classifier oracle-database probability-statistics sqlplus
Last synced: 08 Mar 2026
https://github.com/nimomach/amazon-sales-data
This is a small dataset containing Amazon sales data analysis for few regions.
dashboards data data-analysis data-visualization
Last synced: 08 Mar 2026
https://github.com/dzakwanalifi/reglins
regLins is an R package designed for performing linear regression analysis using various optimization methods. It also provides an interactive Shiny application for a more dynamic analysis experience.
data-analysis linear-regression optimization r shiny-app
Last synced: 09 Jul 2025
https://github.com/baguilar6174/python-jupyter-notebooks
Explore data analysis projects with Python, Jupyter and more tools. Discover stunning visualizations and reveal meaningful information in datasets to make informed decisions.
data-analysis jupyter-notebook kaggle pandas python
Last synced: 09 Apr 2026
https://github.com/cosmoduende/r-earthquakes
Análisis y visualización de datos de actividad sísmica en México con R. Cómo analizar y visualizar la historia sísmica de México con datos del SSN (Servicio Sismológico Nacional)
data-analysis data-analytics data-science dataviz earthquakes r-code r-programming r-studio rstudio sismo sismologia sismos ssn ssnmx terremoto terremotos
Last synced: 24 Jan 2026
https://github.com/bhiogade/customer-purchase-analysis
Comprehensive Customer Purchase Analysis Across Multiple Dimensions
data-analysis data-visualization tableau tableau-desktop
Last synced: 02 Feb 2026
https://github.com/tolumie/web-scraping-rest-api-stock-data-operations
Web Scraping, REST API & Stock Data Operations is a data-driven project that explores the power of web scraping, API interactions, and stock market analysis using Python. From extracting stock data and public records to analyzing real-world financial trends, this repository is a one-stop resource for data enthusiasts, traders, and analysts.
api-integration data-analysis data-cleaning data-visualization financial-data python rest-api sql-databases stock-data web-scraping
Last synced: 19 May 2026
https://github.com/mrham17/spotify_streaming_analytics
Project is stable & documentation will be completed soon. Thank you for your understanding and patience.
big-data-analytics data-analysis google-colab music-data r-programming spotify streaming-analytics
Last synced: 24 Jul 2025
https://github.com/ryan-wong1/analyzing-stress-and-fatigue-drivers-in-railroad-workforces-data-analysis
Railroad dispatcher data on demographics, work, lifestyle, and stress factors
data-analysis data-cleaning data-visualization exploratory-data-analysis sql
Last synced: 25 Jan 2026
https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql
This project involves cleaning a dataset containing information about layoffs from companies around the world.
data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql
Last synced: 08 Jun 2026
https://github.com/bris0yzbekaye/json-to-excel-converter
This repository provides a tool to convert JSON data to Excel format (.xlsx). It allows you to easily transform structured JSON data into a well-organized spreadsheet for better analysis and visualization.
automation-script automation-tools data-analysis data-converter data-export data-formatting data-tools data-visualization excel excel-automation excel-converter excel-tools json json-exporter json-parser json-processing json-to-csv json-to-excel programming-tools spreadsheet-tools
Last synced: 25 Jul 2025
https://github.com/ryan-wong1/72-years-of-shark-incidents-in-california-data-analysis
Shark Incidents in California 1950 - 2022
data-analysis data-cleaning data-visualization exploratory-data-analysis sql
Last synced: 25 Jan 2026
https://github.com/anastasius21/creditcardfrauddetection
This repository contains a Jupyter Notebook for Credit Card Fraud Detection Model and a csv dataset on which it is being trained
credit-card-fraud data-analysis data-science data-visualization fraud-detection logistic-regression machine-learning
Last synced: 16 Jun 2025
https://github.com/leandrocollares/street-cherry-trees-in-vancouver
Street cherry trees in Vancouver: an exploratory data analysis
data-analysis data-visualization folium pandas plotly-express
Last synced: 17 Sep 2025
https://github.com/samuelsoaress/predict-future-sales
Machine Learning applied to sales forecast
data-analysis data-mining data-science data-visualization forecasting-models
Last synced: 24 Jul 2025
https://github.com/netesf13d/expt-sequence-analysis
Data processing, analysis and visualization package for atomic physics experiments in the single-atom regime.
cold-atoms data-analysis data-visualization optical-tweezers
Last synced: 24 Jul 2025
https://github.com/matte34/auto-insurance-analysis
Conducted a comprehensive exploratory data analysis (EDA) on an auto insurance dataset that I found from Kaggle. I performed a permutation test and generated data visualizations.
data-analysis data-visualization permutation-test python3 scipy seaborn
Last synced: 06 May 2026
https://github.com/emmarhoffmann/analysis-of-sleep-patterns-and-psychological-well-being-among-college-students
Explores the relationship between sleep patterns, psychological well-being, and lifestyle choices among college students using statistical analysis on 253 observations.
college-students data-analysis r statistical-models
Last synced: 04 Oct 2025
https://github.com/muhammadhussain-2009/stock-price-prediction-using-stacked-lstm
Predicting Google Stock Prices using Deep Learning Techniques.
data-analysis data-science data-visualization deep-learning jupyter-notebook keras lstm-neural-networks machine-learning-algorithms python stock-data stock-price-prediction tensorflow
Last synced: 16 Apr 2026
https://github.com/shreeparab1890/india-gdp-rate-1960-to-2021-data-analysis
This ipython notebook is the Exploratory data analysis (EDA) of the India GDP Rate 1960 to 2021.
analysis data-analysis eda exploratory-data-analysis ipython-notebook jyputer-notebook matplotlib matplotlib-pyplot pandas python
Last synced: 06 Mar 2026
https://github.com/siddhartha-padhy/heart-disease-predictor
data-analysis machine-learning pandas python
Last synced: 20 Apr 2026
https://github.com/anthonysanalysis/bellabeat-analysis
Bellabeat Tech Case Study Capstone Project
analysis capstone case-study data data-analysis data-visualization md r rmd rstudio
Last synced: 20 Apr 2026
https://github.com/swethajoseph/netflix-powerbi-interactive-dashboard
Created an interactive Netflix Power BI dashboard to analyze and visualize Netflix's content library, uncovering trends in content type, genre distribution, and global reach
data-analysis data-visualization interactive-visualizations powerbi powerbi-dashboards powerbi-report
Last synced: 03 Jan 2026
https://github.com/misszeferino/us-traffic-accidents-analysis
Exploratory Data Analysis using Python
data-analysis matplotlib numpy pandas python seaborn
Last synced: 09 Apr 2026
https://github.com/applicativesystem/numpy-builder
code getter and numpty operator for numpy operations
data-analysis numpy numpy-python shell-script
Last synced: 15 Aug 2025
https://github.com/tomy-jr98/air-quality-sql-project
Air pollution analysis using BigQuery and Tableau, with data cleaning, aggregation, and visualization.
air-pollution bigquery data-analysis portfolio sql tableau
Last synced: 25 Jul 2025
https://github.com/lashawnfofung/super-heroes-analysis-project
This portfolio project involves a detailed analysis of 732 superhero records from the heroes_information.csv dataset, comprising 11 columns of unique characteristics for each hero. The primary goal is to showcase key insights derived from this rich dataset, demonstrating proficiency in data analysis using SQL.
data-analysis datasets mysql-database mysql-server mysql-workbench sql
Last synced: 07 Jul 2025
https://github.com/dimits-ts/sport-repression-repl-study
A replication Study for the recent paper "International Sports Events and Repression in Autocracies: Evidence from the 1978 FIFA World Cup" paper.
data-analysis jupyter regression-models replication-study statistical-analysis
Last synced: 25 Jul 2025
https://github.com/cescedes/medical-insurance-costs-with-python
Investigate how different factors affect the prediction of medical insurance costs by practicing many python concepts.
codecademy data-analysis python python-dictionaries python-functions python-lists python-loops python-strings
Last synced: 19 May 2026
https://github.com/farzeennimran/apriori-algorithm
Apriori Algorithm for Association Rule Mining
algorithm apriori apriori-algorithm apyori association-rule-mining association-rules data-analysis data-mining data-science numpy pandas python
Last synced: 06 Apr 2026
https://github.com/vishal-verma-96/Pre-Owned-Car-Price-prediction-using-Streamlit-App
Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app
data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit
Last synced: 02 Mar 2025
https://github.com/andersoncrs/clasificacion-propina-restaurante
Este informe desarrolla, de manera clara y práctica, un análisis completo del conocido conjunto de datos de propinas (tips), mostrando paso a paso cómo transformar la información cruda en modelos predictivos útiles.
clasification data-analysis data-visualization tips
Last synced: 26 Jul 2025
https://github.com/wassimhedfi/exploring-the-evolution-of-linux
Datacamp guided Project
data-analysis data-science ml python
Last synced: 15 May 2026
https://github.com/sanafagal/wsp-msg-automation
An intuitive application for managing and analyzing customer and reseller data stored in Google Sheets, providing insights and streamlined data organization.
automation cloud-credentials data-analysis google-sheets-api python
Last synced: 16 Jun 2025
https://github.com/chahelgupta/hospital-readmission-prediction-and-analysis
The Hospital Readmission Prediction project uses clinical data to predict diabetic readmissions. SVM + SMOTE achieved 61.16% accuracy, with key predictors including hospital stay, lab tests, and medications.
data-analysis knn-classification logistic-regression machine-learning prediction prediction-model python random-forest-classifier smote svm-classifier
Last synced: 15 May 2026
https://github.com/riyajain255/customer-segmentation-for-e-commerce
This project analyzes online retail data to segment customers using K-Means clustering and build classification models to predict those segments based on purchasing behavior.
customer-segmentation data-analysis kmeans-clustering logistic-regression machine-learning matplotlib numpy pandas python random-forest scikit-learn seaborn-plots
Last synced: 02 Apr 2026
https://github.com/zakintaliban/indonesia-train-passenger-forecasting
A project to analyze and forecast train passenger numbers in Indonesia using Python, Pandas, and Scikit-learn.
bps data-analysis data-science dataanalysis datascience forecasting indonesia kereta-api machine-learning machinelearning numpy pandas python scikit-learn scikitlearn seaborn time-series
Last synced: 29 Apr 2026
https://github.com/kushalagarwalla/netflix-movie-data-analysis
🚀 Netflix Data Analytics Project 🎬📊 | Analyzed 9K+ movies to uncover insights on genres, popularity, votes & release trends. Includes EDA, KPIs & visualizations using Python (Pandas, NumPy, Matplotlib, Seaborn). Supports data-driven content & engagement strategy.
data-analysis data-visualization jupyter-notebook numpy pandas python seaborn
Last synced: 06 May 2026
https://github.com/samiksha29-patil/flipkart-mobiles-data-analysis-visualization-in-python
This project analyzes Flipkart Mobiles Dataset to extract useful insights about mobile phones, their pricing, ratings, discounts, and customer reviews. The analysis and visualization are done using Python to understand market trends and customer preferences.
data-analysis data-visualization matplotlib numpy pandas python seaborn
Last synced: 04 May 2026
https://github.com/labex-labs/numpy-for-beginners
This comprehensive course covers the fundamental concepts and practical techniques of NumPy, the essential library for numerical computing in Python. Learn to create, manipulate, and analyze arrays efficiently.
array-manipulation array-slicing beginner-friendly course data-analysis data-science data-structures fast-computation hands-on labex labs linear-algebra matrix-operations numerical-computing numpy programming python python-programming scientific-computing vectorized-operations
Last synced: 20 Jun 2026
https://github.com/vitor-ace/sunspots-data-analysis
This is a Jupyter Notebook which works with Data Analysis logic and libraries implementation with Python.
data-analysis data-visualization debbuging error-handling file-handling matplotlib-pyplot numpy pandas python
Last synced: 06 May 2026
https://github.com/Solrikk/PicTrace-Web
PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image
automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api
Last synced: 15 Aug 2025
https://github.com/amruthadevops/stock-market-analysis
To analyze market trends and predict future market behavior using machine learning techniques
data-analysis data-science jupyter-notebook machine-learning powerbi-desktop python stock-market
Last synced: 15 May 2026
https://github.com/hecatops/ad_libs
A real time advertisement data analytics platforming, displaying important metrics in easy to understand language.
dashboard data-analysis data-visualization kpi plotly-dash python
Last synced: 07 Nov 2025
https://github.com/azaz9026/loan_approval_prediction
Welcome to the Loan Approval Prediction repository! This project aims to build a predictive model that can determine whether a loan application should be approved or denied based on various features. Purpose The goal of this repository is to develop a machine learning model that can accurately predict loan approval decisio
data data-analysis data-visualization eda machine-learning numpy pandas python statistics
Last synced: 06 Apr 2026
https://github.com/zwelz3/unofficial-survivor-knowledge-graph
A comprehensive RDF knowledge graph covering all 50 seasons of Survivor (US), with 23,000+ triples across 749 named graphs.
Last synced: 23 May 2026
https://github.com/melvinjwallace/melvinjw.github.io
A portfolio of a host of projects completed using python and sql.
data data-analysis data-cleaning data-loading data-mining data-preparation data-processing data-science data-transformation data-visualization dataset matplotlib microsoft-sql-server pandas-python seaborn
Last synced: 02 Apr 2026
https://github.com/ddjain/jsonl-visualizer
A beautiful web tool for visualizing JSONL files with syntax highlighting and multiple view modes
data-analysis json jsonl viusal
Last synced: 18 Sep 2025
https://github.com/karsterr/repeated-measurement
An R-based workflow for conducting repeated measures ANOVA using the ez package, with data wrangling via tidyverse and visualization through ggplot2. Includes data import, transformation to long format, statistical analysis, and graphical summary.
anove data-analysis experimental-design ezanove ggplot2 r repeated-measurements rstats statistics tidyverse
Last synced: 18 Sep 2025
https://github.com/m-coder-umer/sales-dashboard-power-bi-project
An interactive Sales Dashboard built with Power BI using MySQL data, showcasing monthly trends, top-performing products, and key sales KPIs (Key Performance Indicators).
business-intelligence data-analysis data-cleaning data-modeling data-visualization dax interactive-dashboard mysql power-query powerbi sales-dashboard sql time-series-analysis
Last synced: 07 Jul 2025
https://github.com/ariyaarka/sales-analysis
A simple analysis on random dataset of pizza sales using SQL
data-analysis presentation-slides sql
Last synced: 17 Jan 2026
https://github.com/rh01/data-analysis-with-r
Duke University - Data Analysis With R
data-analysis r r-language r-studio rmarkdown
Last synced: 23 May 2026
https://github.com/jpcadena/cancer-classification
Breast cancer classification project.
cancer-detection classification data-analysis data-science deep-learning imblearn machine-learning neuronal-network numpy pandas pylint python scikit-learn supervised-learning tensorflow
Last synced: 09 Apr 2026
https://github.com/mothraa/etl-marketanalysis-webscraping
OC project 2
data-analysis etl python web-scraping
Last synced: 15 Aug 2025
https://github.com/lewismakau/portfolio-projects
This repository contains file data and SQL files for projects used for my Portfolio.
data-analysis data-cleaning data-structures data-visualization database google-analytics microsoft-sql-server mysql powerbi tableau
Last synced: 02 Apr 2026
https://github.com/ljadhav25/healthcare-data-collection-and-analysis
This repository contains a project focused on collecting healthcare data from the web, storing it in a structured format, and performing comprehensive analysis. The objective is to gather valuable health-related information, process and clean the data, and derive insights to support healthcare research and decision-making.
data-analysis data-visualization flask-application flask-backend html-css-javascript pycharm-ide python
Last synced: 09 Apr 2026
https://github.com/ramapinnimty/udacity-mlfoundation-nanodegree
This is a repository containing solutions to the assignments that are a part of the Udacity Machine Learning Foundation Nanodegree program.
assignments data-analysis python3 statistics udacity-machine-learning-nanodegree
Last synced: 26 Jul 2025
https://github.com/sharoonjoseph321/indian-liver-diseases
Indian Liver Disease Analysis and Prediction This project leverages the Indian Liver Patient Dataset (ILPD) to analyze liver disease trends and develop predictive models for early diagnosis. Through data preprocessing, exploratory analysis, and machine learning, it identifies key risk factors and builds classification models
data-analysis data-science data-visualization logistic-regression machine-learning pandas python seaborn
Last synced: 27 Jul 2025
https://github.com/mobutolakecondyle107/sql-server-ddw
🚀 Streamline data management with sql-server-ddw, a powerful tool for efficient queries and seamless integration in SQL Server environments.
backup-and-recovery data-analysis data-integration data-modeling database-management database-security performance-tuning query-optimization reporting-tools sql-scripting sql-server sql-server-express stored-procedures table-design transaction-management
Last synced: 12 Jun 2026
https://github.com/hfzdzakii/dicoding-shipclusteringanalysisdataandmodelling
This repo is a master submission for my Dicoding Final Project. Ship Performance Clustering Dataset was being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!
clustering data-analysis machine-learning
Last synced: 27 Jul 2025
https://github.com/susshiii/sql-layoffs-data-cleaning-and-eda
Full SQL project using MySQL to clean and analyze a real-world tech layoff dataset from 2020–2023.
data-analysis data-analytics-project data-cleaning eda layoffs mysql sql
Last synced: 07 Jul 2025