Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/souravxbera/credit-card-approval-predictor
End-to-end Machine Learning project to predict credit card approval decisions using real-world financial features. Includes EDA, model training, and deployment-ready architecture
credit-card-approval-prediction data-analysis machine-learning python scikit-learn streamlit
Last synced: 15 May 2026
https://github.com/karsterr/repeated-measurement
An R-based workflow for conducting repeated measures ANOVA using the ez package, with data wrangling via tidyverse and visualization through ggplot2. Includes data import, transformation to long format, statistical analysis, and graphical summary.
anove data-analysis experimental-design ezanove ggplot2 r repeated-measurements rstats statistics tidyverse
Last synced: 18 Sep 2025
https://github.com/m-coder-umer/sales-dashboard-power-bi-project
An interactive Sales Dashboard built with Power BI using MySQL data, showcasing monthly trends, top-performing products, and key sales KPIs (Key Performance Indicators).
business-intelligence data-analysis data-cleaning data-modeling data-visualization dax interactive-dashboard mysql power-query powerbi sales-dashboard sql time-series-analysis
Last synced: 07 Jul 2025
https://github.com/ariyaarka/sales-analysis
A simple analysis on random dataset of pizza sales using SQL
data-analysis presentation-slides sql
Last synced: 17 Jan 2026
https://github.com/rh01/data-analysis-with-r
Duke University - Data Analysis With R
data-analysis r r-language r-studio rmarkdown
Last synced: 23 May 2026
https://github.com/jpcadena/cancer-classification
Breast cancer classification project.
cancer-detection classification data-analysis data-science deep-learning imblearn machine-learning neuronal-network numpy pandas pylint python scikit-learn supervised-learning tensorflow
Last synced: 09 Apr 2026
https://github.com/Solrikk/PicTrace-Web
PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image
automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api
Last synced: 15 Aug 2025
https://github.com/dina-hosny/retail-store-data-modeling-and-analysis-using-datastage
The project implements a star-schema data warehousing flow, then utilize IBM InfoSphere DataStage to develop efficient ETL pipelines to create data marts and perform some analysis on them.
data-analysis datastage datawarehousing etl extract ibm load transform
Last synced: 06 Mar 2026
https://github.com/ljadhav25/healthcare-data-collection-and-analysis
This repository contains a project focused on collecting healthcare data from the web, storing it in a structured format, and performing comprehensive analysis. The objective is to gather valuable health-related information, process and clean the data, and derive insights to support healthcare research and decision-making.
data-analysis data-visualization flask-application flask-backend html-css-javascript pycharm-ide python
Last synced: 09 Apr 2026
https://github.com/ramapinnimty/udacity-mlfoundation-nanodegree
This is a repository containing solutions to the assignments that are a part of the Udacity Machine Learning Foundation Nanodegree program.
assignments data-analysis python3 statistics udacity-machine-learning-nanodegree
Last synced: 26 Jul 2025
https://github.com/sharoonjoseph321/indian-liver-diseases
Indian Liver Disease Analysis and Prediction This project leverages the Indian Liver Patient Dataset (ILPD) to analyze liver disease trends and develop predictive models for early diagnosis. Through data preprocessing, exploratory analysis, and machine learning, it identifies key risk factors and builds classification models
data-analysis data-science data-visualization logistic-regression machine-learning pandas python seaborn
Last synced: 27 Jul 2025
https://github.com/sevilaymuni/project-no.2-pandas-tableau-student-mobility
Pandas assisted Feature Engineering on Study Mobility: Tableau Dashboards on Students' Preferences
data-analysis data-extraction data-visualization feature-engineering pandas python tableau-dashboards tableau-desktop tableau-public
Last synced: 03 May 2026
https://github.com/hfzdzakii/dicoding-shipclusteringanalysisdataandmodelling
This repo is a master submission for my Dicoding Final Project. Ship Performance Clustering Dataset was being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!
clustering data-analysis machine-learning
Last synced: 27 Jul 2025
https://github.com/susshiii/sql-layoffs-data-cleaning-and-eda
Full SQL project using MySQL to clean and analyze a real-world tech layoff dataset from 2020–2023.
data-analysis data-analytics-project data-cleaning eda layoffs mysql sql
Last synced: 07 Jul 2025
https://github.com/revtpark/teamseas_scrapper
Scraping Team Seas for data analysis and visualization.
chartjs data-analysis python webscraping
Last synced: 28 Mar 2025
https://github.com/danymukesha/bioga
Apply multi-objective genetic algorithms to genomic data for biologically informed feature selection and pattern discovery.
data-analysis gene-expression genetic-algorithms genomics optimization-algorithms
Last synced: 18 Sep 2025
https://github.com/dmvianna/python-nix
Trivial Nix environment with pandas and postgresql
Last synced: 27 Jul 2025
https://github.com/grindelfp/data-analysis-example
One of my UNI Artificial Intelligence Systems course's projects.
data-analysis data-preprocessing ipynb
Last synced: 19 Sep 2025
https://github.com/tbep-tech/tbeploads
R Package for estimating nutrient loading to Tampa Bay
data-analysis loads package tampa-bay tbep tbnmc water-quality
Last synced: 19 Feb 2026
https://github.com/jofaval/ionosphere
Binary Classification of Ionosphere signals at Goose Bay, Labrador in 1988
data-analysis data-science data-visualization deep-learning google-colab keras machine-learning python scikit-learn tensorflow uci xgboost
Last synced: 09 Apr 2026
https://github.com/mothraa/etl-marketanalysis-webscraping
OC project 2
data-analysis etl python web-scraping
Last synced: 15 Aug 2025
https://github.com/zeh237/superstore-data-analytics
This is a Flask based data analytics project based on the superstore dataset using flask, pandas, sql and python
analytics data data-analysis data-science data-visualization flask python superstore
Last synced: 04 May 2025
https://github.com/deliprofesor/customerseg-customer-segmentation-and-shopping-analysis
This project performs data exploration, segmentation, and modeling of wholesale customer data using clustering algorithms, PCA, and decision trees to analyze purchasing behavior and predict customer channel preferences.
clustering customer-segmentation data-analysis data-visualization dbscan decision-tree gmm kmeans machine-learning pca
Last synced: 24 Jun 2025
https://github.com/tbep-tech/tbep-r-training
Repository for miscellaneous R training materials
data-analysis open-science workshop
Last synced: 19 Feb 2026
https://github.com/tbep-tech/pep-graphics
Materials for generating PEP graphics
data-analysis pep water-quality
Last synced: 19 Feb 2026
https://github.com/tbep-tech/tberf-oyster
Materials for evaluating TBERF oyster restoration success
ccmp-bh4 ccmp-bh6 data-analysis tampa-bay tbep tberf
Last synced: 19 Feb 2026
https://github.com/tbep-tech/pep-r-training
Materials for PEP R training
data-analysis open-science workshop
Last synced: 19 Feb 2026
https://github.com/r8vnhill/hdp
Hentai data processing
data-analysis e-hentai hentai kotlin
Last synced: 02 Apr 2025
https://github.com/scailfin/rob-webapi-flask
Default RESTful Web API implementation for the Reproducible Open Benchmarks for Data Analysis Platform (ROB) using the Flask web framework.
benchmarks data-analysis reproducibility webapi
Last synced: 17 Mar 2026
https://github.com/shubhamgoyal575/tableau-visualization-dashboard
This repository features interactive Tableau dashboards for sales performance and healthcare analysis. It includes insights on revenue trends, regional sales, patient demographics, and hospital occupancy for data-driven decision-making. 🚀
dashborad data-analysis data-cleaning-and-preprocessing healthcare-analysis healthcare-dashboard sales-dashboard sales-data-analysis-project tableau tableau-dashboards tableau-public visualization visualization-tools
Last synced: 20 Feb 2026
https://github.com/shafaq-aslam/predicting-heart-disease-risk-with-logistic-regression-techniques
Develop a predictive model using logistic regression techniques to assess heart disease risk based on patient health metrics and data analysis.
data-analysis heart-disease logistic-regression machine-learning machine-learning-models matplotlib numpy pandas python scikit-learn seaborn
Last synced: 09 Apr 2026
https://github.com/fer-aguirre/cookiecutter-data-analysis-extensive
A cookiecutter template for data analysis projects using Python.
cookiecutter data-analysis project-template python
Last synced: 09 Apr 2025
https://github.com/vetrivel07/flight-price-prediction
Developed a flight price prediction model using Python, analyzing historical data to forecast airfare prices and help travelers make informed booking decisions
data-analysis data-visualization jupyter-notebook numpy pandas python
Last synced: 15 Jun 2025
https://github.com/lucas-mazzolim/superstore-bi
Project where I prepared two data sources for querying and created a BI visualization in Data Studio. Used tools as Mysql, Looker Studio, Google Spreadsheet and Python.
business-intelligence data-analysis data-visualization google-looker-studio mysql spreadsheet
Last synced: 27 Jul 2025
https://github.com/buildwithlal/introduction-to-data-science-in-python-coursera
introduction to data science in python, part of Applied Data Science using Python Specialization from University of Michigan offered by Coursera
data-analysis matplotlib numpy pandas
Last synced: 03 May 2026
https://github.com/nandit123/python_on_excel
Data Analysis using python libraries on excel data
csv data-analysis data-science fill fluctuations graph numpy python python-library
Last synced: 16 May 2026
https://github.com/tkhoa2711/twitter-hate-speech
Hate speech detection on Twitter
Last synced: 28 Jul 2025
https://github.com/antononcube/java-tilestats
Java package for statistics over 2D tillings. (Tile binning, aggregation functions application, etc.)
Last synced: 02 Apr 2025
https://github.com/cassiofb-dev/fide-rating-analysis
The plot speaks for itself
chess data-analysis fide hans rating
Last synced: 15 Jun 2025
https://github.com/tbep-tech/fim-seagrass
Materials for analysis of FIM data, seagrass, and other datasets
data-analysis fim seagrass tampa-bay
Last synced: 19 Feb 2026
https://github.com/tbep-tech/seagrass-analysis
Materials for assessing coverage changes and analysis of drivers of change for Tampa Bay seagrass
dashboard data-analysis seagrass tampa-bay water-quality
Last synced: 19 Feb 2026
https://github.com/tbep-tech/piney-point-analysis
Materials for analysis of Piney Point monitoring data
data-analysis open-science piney-point tampa-bay tbep water-quality
Last synced: 19 Feb 2026
https://github.com/tbep-tech/peptools
Materials for wrangling and summarizing data from the Peconic Estuary
data-analysis package pep water-quality
Last synced: 19 Feb 2026
https://github.com/tbep-tech/rookery-bay-training
Materials for R training at Rookery Bay Monitoring Workshop 2020
data-analysis open-science workshop
Last synced: 19 Feb 2026
https://github.com/ggarciajavier/udacity-dalf-project1-investigate-dataset
Work performed for the 1st project of Udacity Data Analyst Nanodegree: exploratory data analysis of a football dataset.
data-analysis football-analytics python python36 udacity-data-analyst-nanodegree
Last synced: 15 May 2026
https://github.com/arju10/exploring-hacker-news-posts
data-analysis hacker-news jupyter-notebook python
Last synced: 19 May 2026
https://github.com/rohitblaze10/survey_monkey_analysis--using-ipython
This data analysis project focused on extracting insights from survey responses. It involves data cleaning, merging, and transformation using iPython (Pandas,OS) and SQL. The goal is to identify trends and patterns in survey data for better decision-making.
data-analysis ipynb ipython-notebook
Last synced: 28 Jul 2025
https://github.com/mohitsai/mohitsai.github.io
My Personal Project Portfolio - simple SPA with basic HTML, CSS & Javascript
data-analysis data-engineering portfolio portfolio-page portfolio-website project-portfolio single-page-app software-engineering
Last synced: 28 Jul 2025
https://github.com/kineticloom/plydb-fun-nfl-analyst
Analyze NFL data with your AI agent
data-analysis football-analytics nfl
Last synced: 15 May 2026
https://github.com/gui-sitton/prepaid
In this project I work as an analyst for the telecommunications company Megaline. The company offers its customers prepaid plans, Surf and Ultimate. The sales department wants to know which plans bring in the most revenue in order to adjust the advertising budget
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 22 May 2026
https://github.com/srummanf/elnino-anomaly-study
Study on El Niño’s impact on Chennai groundwater sustainability
data-analysis machine-learning python satellite-imagery-analysis
Last synced: 15 May 2026
https://github.com/ashwin331133/sql-project--sales-data-analysis--walmart
This SQL-based Walmart data analysis project aims to identify top-performing branches and products, optimize sales strategies using Kaggle's Walmart Sales Forecasting Competition dataset.
Last synced: 03 Jan 2026
https://github.com/j-wu1/analyse_ventes_jeuxvideo_python
Analyse Exploratoire de Données (EDA) sur les ventes de jeux vidéo avec Python, Pandas, Matplotlib et Seaborn dans un Jupyter Notebook.
data-analysis eda jupyter-notebook matplotlib pandas python seaborn
Last synced: 19 Aug 2025
https://github.com/labex-labs/sqlite-intermediate-to-advanced
In this course, delve into advanced SQLite techniques. Master constraints, indexing, joins, subqueries, transactions, triggers, views, full-text search, JSON, backups, PRAGMA tuning, CTEs, window functions, and more!
advanced-sql course data-analysis data-integrity data-manipulation data-modeling database database-design hands-on labex labs performance-tuning programming query-optimization relational-database schema-management sql sqlite stored-procedures transaction-management
Last synced: 18 May 2026
https://github.com/lucashomuniz/project-09
SALES DATA ANALYSIS WITH POWERBI AND PYTHON
business-analytics business-intelligence data-analysis data-science data-visualization excel powerbi python toy-project
Last synced: 28 Jul 2025
https://github.com/yassin522/health-insurance-cross-sell-prediction
Prediction of Vehicles Health Insurance
data data-analysis data-science machine-learning plotly python
Last synced: 15 May 2026
https://github.com/antononcube/wl-tilestats-paclet
Wolfram Language (aka Mathematica) paclet for statistics over 2D tillings. (Tile binning, aggregation functions application, etc.)
2d-data data-analysis geospatial-data mathematica wolfram-language
Last synced: 20 Mar 2026
https://github.com/jailsonsb2/kit-analise-de-dados
🚀 Um kit de ferramentas Python para acelerar a análise de dados. Carregue arquivos de forma inteligente (CSV, Excel, etc.) e converta notebooks Jupyter para scripts de produção sem esforço.
analise-de-dados analise-exploratoria analise-exploratoria-de-dados automation automations dados data-analysis data-cleaning etl etl-automation jupyter-notebook pandas powerquery python toolkit
Last synced: 29 Apr 2026
https://github.com/odinsride/monpl
PL/SQL Data Load Monitoring Tools
data-analysis database etl logging logging-framework monitoring oracle plsql
Last synced: 28 Jul 2025
https://github.com/brad-cannell/waiver_evaluation
Waiver Evaluation
data-analysis evaluation hospital-admissions medicaid
Last synced: 19 Feb 2026
https://github.com/12danielll/neurogenomics_project
This project focuses on analyzing sequencing data to understand molecular mechanisms of neurological diseases and predict the effectiveness of immunotherapy in breast cancer patients. It integrates Python and R scripts for data processing, statistical analysis, and visualization, alongside a comprehensive report detailing methods and findings.
bioinformatics biostatistics clustering clustering-algorithms data-analysis data-visualization deseq2 differential-gene-expression functional-analysis immune-therapy machine-learning neurological-disease neuroscience pca-analysis python r seurat single-cell-analysis
Last synced: 06 Apr 2026
https://github.com/archanakokate/eda_amazon_products_and_discounts_2023
Exploratory Data Analysis (EDA) on Amazon's 2023 Products and Discounts data
data-analysis data-mining data-visualization exploratory-data-analysis
Last synced: 03 Jan 2026
https://github.com/a-iceberg/clustering_and_naming_categories
Summarization, clastering and characterization of text categories using LLM
bertscore clustering data-analysis data-science deep-learning gpt llm mssqlserver nlp openai prompt-engineering python summarization transformers
Last synced: 08 Feb 2026
https://github.com/swethajoseph/statistical-stock-performance-analysis
Conducted a statistical analysis of Microsoft, Tesla, and Apple stock performance compared to the S&P 500, examining price trends, volatility, and correlations to derive investment insights.
advancedexcel comparative-analysis data-analysis data-visualization datapreparation descriptive-statistics moving-average msexcel performance-analysis performance-metrics regression-analysis statistical-analysis
Last synced: 03 Jan 2026
https://github.com/prateek5525/retail-sales-analysis-project
This project involves analyzing retail sales data using SQL to uncover insights into sales patterns, customer behavior, and product performance. It serves as an exercise to develop foundational SQL skills in data exploration, cleaning, and analysis.
data-analysis data-cleaning retail-sales-data sql
Last synced: 03 Jan 2026
https://github.com/aabbtree77/uci-marketing-analysis-cart
UCI bank marketing data analysis with decision trees (CART).
cart chatgpt commerce conversion-rate data-analysis decision-trees deepseek grok kovnatsky marketing-analytics miniconda scikit-learn-python uci-machine-learning
Last synced: 29 Jul 2025
https://github.com/hasinii12/-chocolate-analysis-dashboard
This Power BI report provides a comprehensive analysis of chocolate ratings and related attributes.
data-analysis data-visualization powerbi
Last synced: 09 Feb 2026
https://github.com/jofaval/teams-assistance-graph
Visualize in a Graph your Teams Attendance Report
charts data-analysis data-visualization data-viz github-copilot highcharts highcharts-js javascript microsoft-teams
Last synced: 08 Sep 2025
https://github.com/kanasia-moore/sql-project
bigquery data-analysis data-analysis-project data-analytics dataset homelessness sql
Last synced: 21 Sep 2025
https://github.com/noorulhudaajmal/business-performance-analytics
Python-Streamlit based interactive dashboard to analyze and visualize key business metrics for an online store.
business-analytics dashboard data-analysis python-streamlit
Last synced: 29 Jul 2025
https://github.com/malakasupun/crime-data-analysis-of-lapd
This project aims to explore and analyse crime patterns in Los Angeles using a dataset spanning from 2020 to the present. The primary focus is to extract meaningful insights by integrating structured data analysis and advanced techniques in SQL and Natural Language Processing (NLP).
data-analysis data-visualization llm nlp sql
Last synced: 29 Jul 2025
https://github.com/yash22222/literacy-exploration-analysis
Delve into India's literacy landscape through data analysis. Uncover regional disparities, high/low literacy states & gender imbalances.
csv data-analysis data-visualization government-data india literacy literacy-analysis states
Last synced: 29 Jul 2025
https://github.com/nguyenda18/ppp-data-tool
Command line tool (could later be used as lambda function) to download CSV files from SBA and generate JSON
data-analysis nodejs-server ppp-files ppp-loans
Last synced: 29 Jul 2025
https://github.com/cyprianfusi/data-scientist-technical-exercise-10ds
With recommendations to UK Department for Education of 10 Local Authorities where National Tutoring Programme (NTP) should be intensified and a response to UK Secretary of Health regarding a 76% Accident and Emergency (A&E) performance target which seems far-fetched.
data-analysis data-cleaning data-visualization hypothesis-testing pandas-python policy statistics
Last synced: 21 Sep 2025
https://github.com/pentalpha/eu-car-emissions-analysis-2015
Analysis of CO² Emissions on Passenger Cars at the E.U. Contries, Year 2015.
data-analysis data-science dataset jupyter-notebook python python3
Last synced: 15 May 2026
https://github.com/naso7y/twitter-sentiment-analysis
Classifies airline-related tweets as positive, negative, or neutral using machine learning and NLP.
data-analysis machine-learning nlp sentiment-analysis
Last synced: 29 Jul 2025
https://github.com/finnishcancerregistry/directadjusting
Compute estimates of weighted averages with confidence intervals.
biostatistics confidence-intervals data-analysis direct-adjusting direct-adjustment epidemiology health-statistics r r-package statistical-adjusting statistical-adjustment weighted-analysis weighted-average weighted-averages
Last synced: 26 Mar 2025
https://github.com/douglasdavis/twaml
tW Analysis Machine Learning
data-analysis high-energy-physics machine-learning python
Last synced: 16 Aug 2025
https://github.com/pentalpha/bti-performance-study
A series of analysis on a large amount of data about the grades of students in the Technology Information course at UFRN
analysis big-data clustering data-analysis data-science data-visualization ipynb ipython jupyter-notebook performance-analysis plot python python3
Last synced: 15 May 2026
https://github.com/mtholahan/advanced-mysqlquery-tuning-mini-project
Analyzed EuroCup 2016 data with advanced SQL queries. Imported CSV datasets into MySQL, designed schema with match, player, and referee details, and implemented queries covering match outcomes, penalty shootouts, player stats, bookings, substitutions, and referee activity to explore tournament dynamics.
bootcamp data-analysis data-engineering data-modeling database eurocup football mysql queries soccer sports springboard sql
Last synced: 15 May 2026
https://github.com/apostolis-bloutsos-data/employee-data-eda
Mini EDA project on synthetic employee records using Python, pandas, and matplotlib
data-analysis eda jupyter-notebook matplotlib pandas python seaborn
Last synced: 09 May 2026
https://github.com/eesunmoon/genai_cor-recom
[Project] Outfit Coordination Recommender System using KoAlpaca
data-analysis fine-tuning generative-ai huggingface keyword llm numpy pandas python python3 selenium
Last synced: 06 Apr 2026
https://github.com/qalita-io/tutorials
Tutorials how to best use Qalita Products
data-analysis data-engineering data-quality data-quality-checks documentation qalita tutorials
Last synced: 17 Jan 2026
https://github.com/prestonjohnson-portfolio/marketing-data-portfolio-project
Analyzing Marketing Data for Future Improvements
data data-analysis data-visualization powerbi sql sql-server
Last synced: 21 Sep 2025
https://github.com/dcs-training/scottishaccounts
This repo contains various examples of analysis that can be performed on the Statistical Accounts of Scotland dataset. Go to the readme file
data-analysis data-visualisation data-wrangling geographical-data r rmarkdown text-analysis
Last synced: 16 Aug 2025
https://github.com/jovicdev97/Financial-Loan-DataScience-Notebook
using numpy and pandas to analyze a synthetic loan dataset with python
data-analysis matlabplot numpy pandas plotting python seaborn
Last synced: 12 Mar 2025
https://github.com/danicaalana/sales-review-sentiment-analysis
This project is a sentiment analysis project using a machine learning model. It analyzes Amazon product reviews to determine whether the sentiment expressed is positive, negative, or neutral using Multinomial Naive Bayes Method.
amazon data-analysis data-science machine-learning naive-bayes python sales-review sentiment-analysis
Last synced: 15 May 2026
https://github.com/nishumehta/uber-rides-data-analysis
An in-depth analysis of Uber ride data for the year 2016, to uncover patterns in ride behavior, mileage trends, and frequent start locations to generate actionable insights for business decisions.
data-analysis jupyter-notebook matplotlib-pyplot pandas python tableau-dashboards
Last synced: 09 May 2026
https://github.com/syarwinaaa09/exploring-airbnb-market-trends
a data analysis project exploring NYC Airbnb listings, using data visualization and pandas for price trends, room types, and reviews.
airbnb data-analysis data-science data-visualization jupyter-notebook new-york-city nyc pandas price-analysis reviews room-types
Last synced: 30 Apr 2026
https://github.com/daveornedo/ejercicios-machine-learning
Proyecto académico de Machine Learning realizado con Python
algoritmos data-analysis data-visualization diccionario hyperparameter-tuning jupyter-notebook jupyter-notebooks knn lambda-functions lasso list machine-learning phd rstudio
Last synced: 10 Apr 2025
https://github.com/ozep/genshincharacteranalysis
Uses a spreadsheet with Character Data and organizes it into readable graphs.
data-analysis jypyternotebook python
Last synced: 18 Apr 2026
https://github.com/smsraj2001/sds-datathon
A simple data science project/hackathon done as part of SDS course
data-analysis data-analysis-python data-cleaning data-science statistics statistics-for-data-science
Last synced: 16 Jul 2025
https://github.com/jonnor/acm-2019-dbscan
clustering data-analysis data-science health machine-learning nhanes nutrition
Last synced: 03 Apr 2025
https://github.com/truongnhatbui/automatidata
Automatidata
data data-analysis data-science data-visualization python tableau
Last synced: 08 Jul 2025
https://github.com/antrita/stroke_prediction_model
A model that combines Kaggle's Stroke Prediction Dataset with live weather/air quality data to implement FDA-compliant MLOps pipeline and shows expertise in healthcare regulations and real-time inference.
ai data-analysis deep-learning kaggle-dataset machine-learning prediction-model random-forest real-time scikit-learn streamlit weather-api xgboost
Last synced: 07 May 2026
https://github.com/dimits-ts/sport-repression-repl-study
A replication Study for the recent paper "International Sports Events and Repression in Autocracies: Evidence from the 1978 FIFA World Cup" paper.
data-analysis jupyter regression-models replication-study statistical-analysis
Last synced: 30 Jun 2026
https://github.com/rijul007/market-basket-analysis-using-r
Market Basket Analysis using association rules, leveraging R’s powerful tools for data-driven retail strategies.
Last synced: 02 Apr 2025
https://github.com/annnieglez/computer-vision-parking-lot
This project leverages computer vision techniques to analyze parking lot occupancy. The goal is to detect available parking spaces in real-time using image and video input.
computer-vision data-analysis data-science data-visualization google-colab image-classification image-processing machine-learning python transfer-learning
Last synced: 15 May 2026
https://github.com/sinsunsan/earth-survival-kit
Global warning data visualisation app to make everyone understand global warning and take actions that matter
angular angular7 d3 data-analysis data-visualization ecology global-warning ngx-charts
Last synced: 05 May 2026
https://github.com/nathadriele/transaction_fraud_prevention_pipeline
Uma solução de detecção e prevenção de fraudes em transações financeiras, combinando Machine Learning, regras de negócio e análises estatísticas avançadas. O sistema oferece um dashboard interativo para monitoramento em tempo real, análise de dados e gestão de alertas de fraude.
data-analysis data-visualization docker fraud-prevention machine-learning matplotlib numpy pandas pipeline pytest python scikit-learn scipy seaborn streamlit tensorflow transaction xgboost
Last synced: 10 Apr 2026