Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/paladitya/cn_term_project
Code for testbed
automated-testing data-analysis reliability-score tcl testbed wrapper-library
Last synced: 02 Mar 2026
https://github.com/dhruwsunita/zomato-data-analysis-project
Zomato data analysis project using Python.
data-analysis data-visualization jupyter-notebook matplotlib numpy pandas-dataframe python
Last synced: 08 May 2026
https://github.com/llnl/cap
HPC workflow that automates the tedious actions of compiling, analyzing, and parsing with bincfg
data-analysis hpc python workflows
Last synced: 17 Jun 2026
https://github.com/jjfiv/csc212spellchecking
Data Structure Analysis for Spell Checking
Last synced: 03 Mar 2026
https://github.com/abhipatel35/diabetes_ml_classification
Predict diabetes using machine learning models. Experiment with logistic regression, decision trees, and random forests to achieve accurate predictions based on health indicators. Complete lifecycle of ML project included.
classification data-analysis data-science data-visualization descision-tree diabetes-prediction jupiter-notebook logistic-regression machine-learning model-evaluation open-source pandas pycharm-ide python random-forest scikit-learn
Last synced: 20 Jan 2026
https://github.com/anas436/student-performance-analysis
In this project I have constructed a Machine Learning System which will analyis students performance with about their academic records. Note that, this project will work with any students recods which you want to provide.
data-analysis jupyter-notebook matplotlib numpy pandas python3 seaborn
Last synced: 16 Apr 2026
https://github.com/kheriberto/bedu_dc
Ejercicios del curso de "python desde 0" de la plataforma BEDU
Last synced: 18 Jun 2026
https://github.com/mugambi645/exploring-ebay-car-sales-data
Exploring ebay car sales dataset
car-sales data-analysis numpy pandas
Last synced: 16 Apr 2026
https://github.com/abhiram-kandiyana/us-bikeshare-analysis
Explorative analsis on a bike-share system (Motivate) to understand it's pain points
data-analysis data-visualization
Last synced: 26 Mar 2025
https://github.com/soumya-kushwaha/uber-analysis
data-analysis data-science data-visualization uber-analysis
Last synced: 16 Apr 2026
https://github.com/asghar-rizvi/eda_student_dataset
This repository contains the results of data analysis and exploratory data analysis (EDA) conducted on the Student_Dataset. The analysis focuses on understanding various factors affecting student grades and visualizing these relationships using Matplotlib and Seaborn.
data-analysis data-analysis-python data-science jupyter-notebook python3
Last synced: 16 Apr 2026
https://github.com/dvaser/world-happiness-expanatory-data-analysis
DATA ANALYSIS
data-analysis data-visualization dataset jupyter jupyter-notebook kaggle python
Last synced: 03 Mar 2026
https://github.com/ayushsiloiya619/online-food-orders-analysis
Data Analytics with Python
data-analysis data-visualization matplotlib pandas-dataframe python3 seaborn-python
Last synced: 08 May 2026
https://github.com/steno-aarhus/mediation-analysis-course
Modern mediation analysis for basic, clinical and epidemiological research in diabetes and endocrinology
data-analysis data-analysis-in-r diabetes diabetes-epidemiology mediation-analysis open-educational-resource
Last synced: 03 Mar 2026
https://github.com/grindelfp/logistic-regression-study
Example of logical regression data analysis and exercise on it.
data-analysis ipynb logistic-regression python
Last synced: 03 Mar 2026
https://github.com/deepanshkhurana/udacityproject-prediciting-boston-housing-prices
This is a Udacity Project for the Machine Learning Nanodegree. Here, we are trying to predict Boston Housing Prices using sklearn.
data-analysis data-science machine-learning python scikit-learn udacity
Last synced: 08 May 2026
https://github.com/samalyarov/practicum_projects
Various data analysis projects displaying tools and instruments that I am proficient with
data-analysis datetime folium geojson matplotlib numpy pandas plotly postgresql powerpoint python regular-expressions requests-library-python scipy seaborn sql sqlalchemy tableau tqdm
Last synced: 02 Apr 2026
https://github.com/banner-19/extraction-and-analysis-of-text
The objective is to analyze text content from a list of URLs. This involves extracting article titles and text, then performing natural language processing to generate metrics like sentiment, readability, and word usage. Finally, the results are stored for further analysis or visualization.
data-analysis data-analytics data-science nlp nltk python3 text-analysis text-extraction
Last synced: 03 May 2026
https://github.com/lintangwisesa/ujian_analyticsvisualization_jcds07
Panduan Soal Ujian Data Analytics & Visualization Job Connector Data Science batch 7
data-analysis data-science data-visualisation exam
Last synced: 04 Mar 2026
https://github.com/victorlcastro-dsa/coping_struggles_prediction
Repositório para prever dificuldades de enfrentamento com base em dados de saúde mental. Inclui análise, visualização e modelagem usando aprendizado de máquina. Resultados alcançam 86.58% de acurácia com um Voting Classifier.
classification-algorithm data-analysis data-science data-visualization machine-learning-algorithms problem-solving project-based-learning python
Last synced: 19 Apr 2025
https://github.com/edanur-y/bank-customer-churn-prediction-with-classification-models
Comparing the performances of multi-layer perceptron, decision tree, random forest, gradient boosting and extreme gradient boosting classifications on customer data to predict their status of exiting the bank.
data-analysis data-transformation hyperparameter-tuning python
Last synced: 16 Apr 2026
https://github.com/abhipatel35/gym-performance-analysis
Analyzing gym performance and user engagement in Arizona using Spark SQL, PySpark, and visualization techniques on the Yelp dataset.
apache-spark asu business-insights data-analysis data-processing-at-scale data-visualization dps gym-analysis rating-patterns sql trend-analysis user-insights yelp-dataset
Last synced: 16 Apr 2026
https://github.com/kosuri-indu/allaboutolympics
All About Olympics is an interactive dashboard presenting comprehensive data and insights on Olympic Games from 1896 to 2020.
data-analysis pandas plotly python streamlit
Last synced: 16 Apr 2026
https://github.com/samuelson777/titanic-dataset-analysis
Exploratory data analysis of the Titanic dataset, uncovering insights on passenger survival rates based on gender, age, and class. Includes data cleaning, visualization, and findings.
data-analysis data-visualization exploratory-data-analysis kaggle machine-learning matplotlib pandas python seaborn titanic-dataset
Last synced: 16 Apr 2026
https://github.com/ronaessi-28/sales-data-analysis-visualization-project
A comprehensive data analysis and visualization project using Python, Pandas, Matplotlib, Seaborn, and Streamlit. The project explores Superstore sales data to uncover trends, region-wise performance, product category insights, and builds an interactive dashboard.
data-analysis data-visualization eda matplotlib pandas plotly python-project sales-dashboard seaborn streamlit
Last synced: 16 Apr 2026
https://github.com/themihirmathur/qlik-intern-project
Qlik Analysis of Road Safety & Accident Patterns in India 📈 Analyzed & visualized road safety data for 20.85k+ accident cases with 9+ accident data patterns in India using Qlik 📉 Reduced inefficiencies by 25% by developing design of an avant-garde data tracking dashboard that monitored injuries.
data-analysis data-visualization presentation qlik qlik-cloud qlik-sense qlikview
Last synced: 04 Mar 2026
https://github.com/katiesaund/tidy_tuesday
A weekly data project in R from the R4DS online learning community
data-analysis data-visualization datascience plot r rstats tidytuesday
Last synced: 24 Mar 2025
https://github.com/ryan-wong1/analyzing-arrest-patterns-in-chicago-data-analysis
Chicago Police Department (CPD) arrest data on offenses, locations, and demographics
data-analysis data-cleaning data-visualization exploratory-data-analysis matplotlib pandas python seaborn
Last synced: 08 May 2026
https://github.com/neerajcodes888/data-science
This repository is a hub for data science enthusiasts, offering a diverse collection of projects, notebooks, and resources covering topics such as data analysis, machine learning, deep learning, and generative AI. Explore innovative ideas, contribute to cutting-edge research, and enhance your skills in the dynamic field of data science
data-analysis data-science data-visualization deep-learning deep-learning-algorithms eda genai jupyter-notebook machine-learning machine-learning-algorithms openai-api pandas plotting python3 sklearn-library streamlit
Last synced: 01 May 2026
https://github.com/johannaschmidle/netflix-subscription-analysis
Examined Netflix subscription data to understand market behaviour, predict future trends, and identify consumer preferences. [SQL, Tableau]
data-analysis data-cleaning data-trend data-visualization netflix
Last synced: 05 Mar 2026
https://github.com/ribin-baby/the-sparks-foundation-data-science-internship
This repository contains tasks and solutions assigned as part of internship program. This repository contains workbooks on data analysis and model building parts.
Last synced: 16 Apr 2026
https://github.com/hubtou/adsv
Analyze delimiter-separated values files
command-line-tool csv csv-converter csv-format csv-parser csv-parsing csv-reader csv-reading data data-analysis data-engineering data-mining learning-python pnu-project python servier shell tools unix utility
Last synced: 17 Apr 2026
https://github.com/satyacoder29/e-commerce-sales-analysis
Performed E-commerce Sales Analysis to identify trends, optimize sales, and improve decision-making. Analyzed customer patterns, seasonal trends, and product performance using Python, SQL, and Power BI. Delivered actionable insights to enhance revenue, streamline inventory management, and boost customer engagement.
data-analysis data-visualization datacleaning msexcel pivottables powerquerym visualisation vlookups
Last synced: 05 Mar 2026
https://github.com/divyanshugit/indian-judiciary-analysis
Analysis of Indian district court data across states.
Last synced: 02 Jul 2025
https://github.com/pizofreude/divvybikes-share-success
Developing data-driven marketing campaign for Divvy to convert casual riders into annual members. Divvy is a bike-share program of the Chicago Department of Transportation (CDOT).
airflow bi-analytics data-analysis data-engineering data-visualization database dbt docker etl jupyterlab python r redshift s3
Last synced: 17 Apr 2026
https://github.com/katiebuntic/research_methods
Data Science Research Methods
analysis data-analysis data-science python research-project
Last synced: 23 Jun 2026
https://github.com/preetesh21/spotme
This repository is using the web-based API provided by Spotify to retrieve data and then analyse it.
Last synced: 18 Jun 2026
https://github.com/ddsuhaimi/turkiye-student-evaluation-eda
A little bit of exploration of well-known Turkiye Student Evaluation dataset
data-analysis data-science data-visualization-project exploratory-data-analysis exploratory-data-visualizations
Last synced: 18 Jun 2026
https://github.com/rosanafss/r-journey
Diving into to wonderful see of DATA
Last synced: 19 Nov 2025
https://github.com/cnoret/ibm-data-analyst-professional
Final project & Courses Notebooks
analyzing-data data-analysis data-analyst data-manipulation data-science data-visualization ibm ibm-certificate ibm-data-analyst-professional ibm-datascience-certification pandas python
Last synced: 09 May 2026
https://github.com/0-mostafa-rezaee-0/sandwich_structures
Impact test of Sandwich Structures
composite-materials data-analysis r
Last synced: 09 Aug 2025
https://github.com/nathadriele/ifood-data-governance-pipeline
Este projeto demonstra uma solução completa de Data Governance com foco em qualidade, rastreabilidade, segurança e conformidade com LGPD. Utiliza tecnologias modernas como Streamlit, Airflow, dbt e Pydantic para implementar um ecossistema funcional e interativo com dashboard de governança de dados.
airflow dashboard data-analysis data-catalog data-engineering data-governance data-quality data-visualization dbt ifood lgpd matplotlib numpy observability-data pandas pipeline pyspark redis seaborn streamlit
Last synced: 02 Apr 2026
https://github.com/dvaser/heart-attact-analysis-prediction
DATA ANALYSIS
classification data data-analysis data-visualization jupyter jupyter-notebook lineer-regresyon machine-learning python regression
Last synced: 20 Jan 2026
https://github.com/thenazar9/user-behavior-email-campaign-analysis-sql
Analysis of user behavior and email campaign performance using BigQuery and Looker Studio, focusing on account creation trends, email engagement, and user segmentation.
analytics bigquery data-analysis data-visualization etl looker-studio sql structured-query-language
Last synced: 16 Oct 2025
https://github.com/nahiyanhkhan/sales-insight-dashboard_powerbi
Build a dashboard to display the sales insights of a company's sales data over the 4 years period. It includes displaying revenue, sales quantity in different regions over the years.
dashboard data-analysis data-analytics data-visualization powerbi salesdashboard
Last synced: 08 Jan 2026
https://github.com/zachbateman/easy_plot
Easy Statistical Visualization in Python
data-analysis data-visualization graphics matplotlib python seaborn
Last synced: 18 Jan 2026
https://github.com/bocchio01/skyward_recruitment_assignment
Assignment to join the PoliMi SkyWard software team
data-analysis kalman-filter model-rocket
Last synced: 15 Mar 2025
https://github.com/anuragmudgal96/data-warehouse-project
Designing and implementing a modern data warehouse on SQL Server, covering ETL pipelines, dimensional modeling, and analytical reporting.
data-analysis data-engineering data-warehouse datawarehousing etl etl-job etl-pipeline sql sql-server
Last synced: 09 Oct 2025
https://github.com/shivakumarhl/digital-music-store-analysis
Digital Music Store Data Analysis using SQL
Last synced: 10 Mar 2026
https://github.com/eliasdehondt/learn-r
Welcome to the Learn-R repository! This is your go-to resource for learning the R programming language, whether you're a beginner or looking to enhance your skills.
data-analysis data-visualization education machine-learning programming r statistics tutorials
Last synced: 03 Apr 2026
https://github.com/nicodupont/kaggle
All my Data analysis and Competitions
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 17 Apr 2026
https://github.com/alejandrolara11/data-preprocessing
Data preprocessing through the use of the libraries NumPy and pandas.
data-analysis data-cleaning data-preprocessing numpy pandas python
Last synced: 09 May 2026
https://github.com/atlassandx90/cryptocurrency-volatility-prediction
Cryptocurrency volatility prediction ML pipeline
cryptocurrency data-analysis data-science data-visualization machine-learning
Last synced: 17 Apr 2026
https://github.com/shimazadeh/ft_linear_regression
Implementing a modular linear regression from scratch to predict the price of cars using a gradient descent algorithm.
data-analysis data-science hyperparameter-tuning linear-regression predictive-modeling
Last synced: 03 Jun 2026
https://github.com/shru924/ecommerce_customer_behavior_analysis
A machine learning project that analyzes and segments e-commerce customers based on behavior patterns using Python, Random Forest, and data visualization.
customer-segmentation data-analysis jupyter-notebook machine-learning matplotlib pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/ridemountainpig/education-level-data-analysis
An analysis of the relationship between education levels, unemployment rates, and credit card spending in Taiwan's six major cities.
data-analysis matplotlib pandas-python
Last synced: 17 Apr 2026
https://github.com/sharmas1ddharth/data-analysis-with-python
Freecodecamp's Data Analysis with Python Projects Code
data-analysis data-analysis-with-python freecodecamp-project
Last synced: 03 Jun 2026
https://github.com/nathaliacosim/migration-patrim
Automação para extração, conversão e migração de dados patrimoniais para o sistema patrimônio cloud da betha sistemas. O projeto garante um fluxo estruturado e seguro de transferência de informações, utilizando C# (.NET Framework), PostgreSQL e integração via API.
conversion-tool data-analysis data-conversion data-transformation dotnet dotnet-code dotnet-console-app migration-tool
Last synced: 17 Apr 2026
https://github.com/kgotsosm/fcc-data-analysis
Notebooks created for the Data Analysis Course on freeCodeCamp
data-analysis data-visualization matplotlib pandas seaborn
Last synced: 17 Apr 2026
https://github.com/rishisolanke/pdf_query_langchain
PDF Query LangChain is a tool that extracts and queries information from PDF documents using advanced language processing. Leveraging LangChain, OpenAI, and Cassandra, this app enables efficient, interactive querying of PDF content. Ideal for data analysis, research, and automated reporting, it simplifies detailed document analysis with ease.
artificial-intelligence data-analysis document-query langchain natural-language-processing nlp openai pdf-analysis pdf-extraction python research-tool
Last synced: 17 Apr 2026
https://github.com/victoorv/prediction_covid19
Prédire si un invidu est positif au COVID19 ou non.
classification covid-19-classifier covid-19-data-analysis covid19-data data-analysis data-science data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning machine-learning-algorithms neural-networks oversampling-algorithms python statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/victoorv/criminalite_us
Une analyse de la criminalité en fonction de variables socio-économiques a été menée, incluant la sélection et la comparaison de modèles de régression multiple ainsi que des tests d'hypothèses sur les coefficients et la significativité des modèles.
data-analysis data-science r regression regression-analysis regression-models statistical-analysis statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/royungar/sql_chicago_data_analysis_project
SQL-based data analysis project using SQLite, pandas, and Jupyter SQL magic commands. Analyzes crime, school, and census data from Chicago to explore socioeconomic patterns using filtering, joins, aggregation, and subqueries.
aggregation census-data chicago crime-data data-analysis data-engineering education-data ibm jupyter-notebook pandas sql sqlite subqueries
Last synced: 04 Jun 2026
https://github.com/mansogf/datascience_introduction
Data Science Introductions Practices
data-analysis data-science data-visualization graph
Last synced: 04 Apr 2025
https://github.com/l1ght14/customer-churn-prediction
Predict customer churn using machine learning models like Logistic Regression and Random Forest. Includes data preprocessing, model evaluation, feature importance, and insights to drive retention strategies.
churn-prediction classification customer-churn customer-churn-prediction data-analysis logistic-regression machine-learning python random-forest scikit-learn telecom
Last synced: 09 May 2026
https://github.com/curtisalexander/cramisc
Personal R functions for data analysis
Last synced: 12 Mar 2025
https://github.com/duoan/ds-nbs
Data analysis and machine learning notebook.
data-analysis data-scientists deep-learning kaggle-competition machine-learning
Last synced: 18 Jun 2026
https://github.com/royungar/automotive_sales_insights_dashboard
Data visualization project analyzing automotive sales, recalls, and customer sentiment using IBM Cognos Analytics. Features KPIs, treemaps, heatmaps, and advanced visual storytelling techniques.
automotive-industry business-intelligence cognos-analytics csv customer-sentiment dashboard data-analysis data-engineering data-visualization eda excel heatmap ibm kpi recall-analysis sales-data treemap
Last synced: 04 Jun 2026
https://github.com/cescedes/this-is-jeopardy
Writing several functions that investigate a dataset of Jeopardy! questions and answers.
codecademy data-analysis python
Last synced: 11 Apr 2026
https://github.com/vitornegromonte/eda_stroke
Exploratory data analysis in the stroke prediction dataset
data-analysis data-science exploratory-data-analysis kaggle-dataset visualization
Last synced: 17 Apr 2026
https://github.com/gabrieladados/analise-ecommerce
Análise SQL para E-commerce: Estratégias de Crescimento para Impulsionar Vendas
bigquery data-analysis ecommerce sql
Last synced: 31 Mar 2025
https://github.com/vasilescur/decsci-group-9
DECSCI 101 - Final Project (Group 9)
data-analysis decision-science qualtrics statistics visualization
Last synced: 17 Apr 2026
https://github.com/jbalooshie/movies-etl
Exercise working with movie datasets from Kaggle and Wikipedia. Python is used to extract, clean, and combine the data, and then it is loaded into a postgreSQL database.
data-analysis data-science jupyter-notebook numpy pandas postgresql postgresql-database python sqlalchemy
Last synced: 11 Apr 2026
https://github.com/atharvapathak/rsvp_movies_case_study
SQL queries performed on IMDb database to provide recommendations to RSVP Movies based on insights.
data-analysis data-cleaning data-science imdb-dataset rsvp-movies sql
Last synced: 28 Jan 2026
https://github.com/q-viper/blog-notebooks
This is the repo to store most of my blogs in dataqoil.com and q-viper.github.io.
data-analysis data-science machine-learning-algorithms timeseries
Last synced: 04 Apr 2026
https://github.com/drod75/burger_king_analysis
A simple analysis on a burger king dataset.
data-analysis data-visualization jupyter-notebook pandas python seaborn
Last synced: 09 May 2026
https://github.com/sanam2405/ahs
This contains the analysis of result of AHS Madhyamik Examination 2022
data-analysis data-visualization jupyter-notebook python
Last synced: 18 Apr 2026
https://github.com/aliiahmadi/data-manipulation
Algorithms and solutions
algorithm algorithms data-analysis data-science
Last synced: 30 Jun 2025
https://github.com/mikeesto/ausvotes19
:bird: A collection of 67,284 public tweets published on the night of the 2019 Australian election
australia data-analysis data-visualization elections open-data twitter
Last synced: 06 Apr 2025
https://github.com/nicovandenhooff/kaggle-competitions
A repository that contains my Kaggle projects.
data-analysis data-visualization deep-learning exploratory-data-analysis kaggle machine-learning matplotlib modeling neural-network numpy pandas seaborn sklearn
Last synced: 04 Apr 2026
https://github.com/pawlo77/airline-performance-data-analysis
Preprocessing of structured data - part of IAD study program, Faculty of Mathematics and Information Science, Warsaw University of Technology
data-analysis data-science visualization
Last synced: 10 May 2026
https://github.com/awanraskall/retail-demand-analysis
Data analysis of retail meal orders, fulfillment centers, and product demand using Python
data-analysis data-visualization jupyter-notebook numpy pandas python
Last synced: 18 Apr 2026
https://github.com/rajeev2806/retail-order-data-analysis
Dataset downloaded from kaggle api and then data cleaning and analysis is performed
data-analysis data-cleaning postgresql
Last synced: 18 Apr 2026
https://github.com/lit26/novel-corona-virus-2019
Data Analysis for Novel Corona Virus 2019
analysis coronavirus-case data-analysis sir-model
Last synced: 10 Jun 2025
https://github.com/vvhacker007/technocolabs
This repo contains the projects that were assigned to me during the internship.
data-analysis data-science flask heroku-deployment internship machine-learning project streamlit website
Last synced: 18 Apr 2026
https://github.com/thbaylson/datascience
All of my past data science assignments put into one singular notebook. Most of this comes from my Machine Learning course.
data-analysis data-science data-visualization decision-tree jupyter-notebook k-nearest-neighbors linear-regression machine-learning neural-network pandas-library python3 scikit-learn
Last synced: 09 May 2026
https://github.com/yixin0829/web-data-scraping-project
Python web scraping script and MySQL database design :bug:
beautifulsoup4 data-analysis data-visualization database gender-api matplotlib python website
Last synced: 19 Apr 2026
https://github.com/danpoynor/data-analysis-of-video-game-sales-2000-2015
This analysis reviews sales for the top 100 video games from the years 2000-2015 to gather insights. Within the notebook I use Python’s Pandas, Matplotlib, and Seaborn libraries to interact with the data and create graphs.
data-analysis jupyter-notebook matplotlib pandas-dataframe python3 seaborn-plots video-game-sales
Last synced: 18 Apr 2026
https://github.com/hugo-hattori/watercraft_values_ai_prediction
Data Science Project.
ai-model artificial-intelligence artificial-intelligence-algorithms data-analysis data-analytics data-science jupyter jupyter-notebook machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot pandas pandas-dataframe pandas-python python seaborn sklearn sklearn-library sklearn-metrics
Last synced: 23 Aug 2025
https://github.com/rizkipragustono/data_analysis_spark
Exploration: Data Analysis using Spark
apache-spark data-analysis pyspark python spark-sql sql
Last synced: 09 May 2026
https://github.com/manalisbhavsar/mall-customers-clustering
K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.
data-analysis data-visualization matplotlib numpy pandas python scikit-learn
Last synced: 18 Apr 2026
https://github.com/syed-amjad-ali/-bank-churn-ml
Predicting bank customer churn using machine learning. This project includes exploratory data analysis (EDA), feature engineering, classification models (Logistic Regression, Random Forest), and customer segmentation using K-Means clustering.
classification data-analysis data-science eda jupyter-notebook k-means-clustering machine-learning ml python segmentation
Last synced: 09 Mar 2025
https://github.com/chardyb/prob-and-stats-bmi6106
A repository for Spring 2025 BMI 6106: Statistics and Probability. This repository contains coursework, code examples, and projects exploring statistical methods and probabilistic models in biomedical informatics.
biomedical-informatics data-analysis data-science probability r statistical-modeling
Last synced: 02 Sep 2025
https://github.com/mtimma001/clinical-trial-data-tool
Clinical Trial Data Analysis Tool is a Flask-based web app for healthcare professionals to manage and analyze clinical trial data. It features full CRUD functionality, interactive visualizations (Plotly/Matplotlib), a responsive Bootstrap UI, MySQL database integration, and Heroku deployment for accessible, scalable use.
bootstrap5 clinical-trials crud data-analysis data-visualization flask healthcare heroku mysql pandas plotly python
Last synced: 05 Apr 2026
https://github.com/satti-hari-krishna-reddy/data-whisperer
Data Whisperer is an AI-driven tool that automates exploratory data analysis (EDA), generates actionable insights, and enables natural language querying of datasets. it combines the power of AI (Google Gemini) with interactive visualizations and professional reporting.
ai data-analysis data-visualization llm python3 streamlit
Last synced: 18 Apr 2026
https://github.com/al-ghaly/prosper-loans-analysis
A statistical Analysis Project, to analyze the data of a finance company’s loans Using Python packages (pandas – NumPy – seaborn – matplotlib)
data-analysis matplotlib numpy pandas python python-data-analysis seaborn statistical-analysis statistics
Last synced: 18 Apr 2026
https://github.com/kwokhing/visualizing-datasets-with-facets
Demo on using Facets: An Open Source Visualization Tool for Machine Learning Training Data developed by Google's PAIR Initiative
anaconda data-analysis data-visualization facets jupyter-notebook missing-data open-source python skewness unbalanced-data visualisation visualization
Last synced: 18 Apr 2026