Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-02-07 00:07:52 UTC
- JSON Representation
https://github.com/simoneas02/data-science
🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻
data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql
Last synced: 31 Jul 2025
https://github.com/prakhar-ff13/yellow-taxi-demand-prediction
Predicting Taxi Demand in various regions of New York City
data-analysis data-analytics data-science data-visualization machine-learning python3 time-series
Last synced: 09 Nov 2025
https://github.com/rawsashimi1604/jobextract
Scrapes LinkedIn data. Conducts sentiment analysis on what traits and qualifications employers are looking for.
data data-analysis data-analytics data-cleaning linkedin mvc python webscraper
Last synced: 06 Nov 2025
https://github.com/grburgess/gbm_kitty
Database, reduce, and analyze GBM data without having to know anything. Curiosity killed the catalog.
3ml catalogue data-analysis fermi-science grbs pipelines
Last synced: 29 Jun 2025
https://github.com/techytushar/india-odi-analysis
Analysis of ODI cricket matches of Indian Team
cricket data-analysis data-science pandas plotting python3
Last synced: 31 Jul 2025
https://github.com/hyperspy/holospy-demos
HoloSpy Jupyter Notebook demos
data-analysis data-visualization electron-holography hyperspy materials-science multi-dimensional physical-sciences tutorial
Last synced: 28 Dec 2025
https://github.com/ziaeemehr/itng_nest
Nest Simulator quick guides and examples, adding new model using NESTML
computational-neuroscience data-analysis nest-simulator neuroscience
Last synced: 30 Aug 2025
https://github.com/kylekirkby/cardatasnatch
CarDataSnatch allows you to quickly find information about a car in the uk using a valid number plate. Grab an image of the car in question along with a multitude of other data. Compare two cars' data for fast and easy analysis.
beautifulsoup cars command-line-tool data data-analysis data-mining ethical-hacking python python3 requests scraper social-engineering
Last synced: 15 Apr 2025
https://github.com/mrjxtr/tokyo_airbnb_analysis_project
Full project case study and analysis to show potential opportunities to start an AirBnb business in Tokyo, Japan.
data-analysis data-cleaning data-science data-visualization pandas python3
Last synced: 07 Apr 2025
https://github.com/juliamanifolds/multivariatedataanalysis.jl
Multivariate data analysis using geometric algorithms made easy!
data-analysis geometric-algorithms julia multivariate-statistics
Last synced: 10 Jun 2025
https://github.com/fabienarcellier/qjoin
qjoin is a data manipulation library that provides simple and efficient joining and collection processing functionality
composable data-analysis developer-tools functools python
Last synced: 17 Nov 2025
https://github.com/kenvilar/data-analysis-using-python
Transforming a description of a location from an analyzed CSV file data using Pandas with Python 3
bs4 data-analysis jupyter pandas python python3 requests xlrd
Last synced: 04 Oct 2025
https://github.com/praju-1/pandas
The library is widely used in data science and machine learning for data cleaning, preparation, and analysis.
Last synced: 23 Apr 2025
https://github.com/thisisashukla/survival-analysis
Hands-On Survival Analysis in Python
data-analysis data-science survival-analysis
Last synced: 28 Jul 2025
https://github.com/mljar/enrichment
Data enrichment with AI for pandas DataFrame
data-analysis data-enrichment data-science openai pandas
Last synced: 01 Jul 2025
https://github.com/mohamedomar2020/random-forest
Creating a Random Forest model to predict the progression of bladder cancer
bladder-cancer cancer-genomics cancer-research data-analysis data-science genomics machine-learning machine-learning-algorithms random-forest
Last synced: 18 Sep 2025
https://github.com/cbg-ethz/scdna-pipe
Python data analysis pipeline for single cell copy number event history reconstruction
bioinformatics bioinformatics-pipeline data-analysis genomics python snakemake snakemake-workflows workflow
Last synced: 05 Jan 2026
https://github.com/kalebers/data_streams_parametric_t-sne
Research for Parametric T-SNE in high to low dimensional data stream, published in 2021 by Kalebe Rodrigues Szlachta and Andre de Macedo Wlodkovski, oriented by Jean Paul Barddal, Computer Science graduation from Pontifical Catholic University of Parana (PUCPR)
classifier data-analysis data-science data-visualization machinelearning parametric parametric-tsne python tsne-algorithm tsne-visualization
Last synced: 08 May 2025
https://github.com/michaelcurrin/water-crisis-scraper
Scrape and explore data related to Cape Town's water crisis (Python3 application)
cape-town cron csv dam-levels data-analysis html open-data python3 schedule scraping south-africa water-crisis water-level webscraping
Last synced: 28 Jul 2025
https://github.com/sevdanurgenc/data-modeling-techniques-lecture-notes
In this repo, I have the course contents of Data Modelling Techniques training, which will be given to Innova Technology by the cooperation of Academy Peak Information Technologies Training and Consultancy between 25 - 26 January 2022.
data-analysis data-mining data-modeling data-science data-structure data-visualization
Last synced: 05 Jan 2026
https://github.com/happymyguy/mining-data-viewer
A web application for visualising and analysing mining project data. Features an interactive data table, project-specific dashboards with location mapping, and analytics visualisation tools. Built with FastAPI, Python, and modern web technologies.
analytics dashboard data-analysis data-visualisation datatables fastapi gis-mapping leaflet mining mining-industry python web-application
Last synced: 15 Mar 2025
https://github.com/emptymalei/mini-lab
Some code snippets used to explain stuff to myself in my personal data science wiki
data-analysis data-mining data-science data-visualization datascience
Last synced: 07 Apr 2025
https://github.com/shibam120302/black-friday-sales-data-analysis
This repository contain Data Analysis on Black Friday Sales Data using various Regression ML algorithms
data-analysis eda machine-learning python random-forest regression
Last synced: 15 Mar 2025
https://github.com/yusufcinarci/covid-19-data-analysis-visualization
The first project of our data visualization studies is the COVID-19 data analysis project. In this project, we analyzed the data of the COVID-19 pandemic, which started in the first month of 2020 and still continues to affect the world, on the basis of countries. You can find the brief details of the project we realized in 3 stages in the readme file. We have tried to explain the details of the project step by step below. We wish you healthy days.
covid-19-data-visualization data-analysis data-science data-visualization
Last synced: 22 Jul 2025
https://github.com/louislefevre/sstubs-miner
Data mining and analysis for the ManySStuBs4J dataset.
data-analysis data-mining manysstubs4j-dataset msr
Last synced: 30 Mar 2025
https://github.com/mertcandav/julenum
A high-performance library for numerical methods and scientific computing in Jule
data-analysis jule julelang math matrix scientific-computing statistics
Last synced: 29 Jul 2025
https://github.com/mainakverse/ml-algorithms-starter
List of machine learning algorithms that are needed to start with ML projects and lay a foundation into data science
data-analysis data-science jupyter-notebooks machine-learning-algorithms practice
Last synced: 19 Apr 2025
https://github.com/mindful-ai-assistants/movierevenueanalysis
🎬💰 Analyze movie companies' revenue, release strategies, and financial performance using statistical techniques for actionable insights. This project explores data on total revenue, number of releases, and lifetime gross to uncover patterns that can drive strategic decisions in the film industry.
correlation-analysis data-analysis data-science heatmap jupyter-notebook oneness-consciousness open-source python statistical-analysis statistical-analysis-and-hypothesis-testing statistics ttest
Last synced: 14 Apr 2025
https://github.com/tesfamichael12/solar-farm-analysis
This repository contains code and analysis for exploring solar farm data from Benin, Sierra Leone, and Togo. It includes EDA, strategic recommendations for optimal solar farm locations, and an interactive Streamlit dashboard.
data-analysis eda ml solar-farm-analysis
Last synced: 07 Aug 2025
https://github.com/supertetelman/kaggle-public
A collection of Python and Matlab projects aimed at utilizing various machine learning techniques to solve big data problems.
cnn data-analysis deep-learning machine-learning matlab python
Last synced: 23 Mar 2025
https://github.com/victor-lis/regression-ai-model-practice
ai data-analysis python regression-model
Last synced: 01 Apr 2025
https://github.com/hayesall/babybear
🐼 It's like pandas, but tiny.
data-analysis data-analysis-python data-science dataframe python teaching teaching-tool
Last synced: 04 Mar 2025
https://github.com/nelson-gon/nelson-gon.github.io
Biologically Plausible Programming
bioinformatics blog blogdown computational-biology data-analysis data-exploration ghost ghostwriter-theme github github-pages hugo-site hugo-theme programming python3 r side-project
Last synced: 26 Jul 2025
https://github.com/cego669/datathonengopevi
Equipe: Embrapeiros. Solução proposta para o Datathon do VI ENGOPE (Encontro Goiano de Probabilidade e Estatística). Obs: FOMOS CAMPEÕES!!!!!!!!
data-analysis data-science datathon python r streamlit xgboost-classifier
Last synced: 26 Jul 2025
https://github.com/felixcharotte/ibm_datascience_capstone
In this project, we predicted if the SpaceX Falcon 9 first stage will land successfully by following the data science methodology. We also summarized the results for the business stakeholders.
analysis data-analysis data-science data-visualization databases folium jupyter-notebook machine-learning machine-learning-alrgorithms matplotlib pandas plotly plotly-dash python scikit-learn scipy seaborn sql
Last synced: 26 Jul 2025
https://github.com/billy-enrizky/kimia-farma-sales-management-database-replica-project
SQL Database Management, Then Visualizing it on Tableau!
analytics data-analysis data-visualization sql
Last synced: 27 Jul 2025
https://github.com/frikishaan/browsing-history-analysis
This is a data analysis of my browsing history for the last 7 months.
browsing-history data-analysis jupyter-notebook python
Last synced: 22 Sep 2025
https://github.com/ezzz-lui/rsm-evaluationproject
Este repositorio es donde esta documentado nuestro proyecto para RSM por parte de actividad final para el bootcamp Data Analyst
Last synced: 04 Oct 2025
https://github.com/fernandezfran/exma
A Python library with C extensions to analyze and manipulate molecular dynamics trajectories and electrochemical data
computational-physics data-analysis molecular-dynamics oop python science
Last synced: 16 Jan 2026
https://github.com/gallillio/data_science-data_visualizer_tool
## About Supervised ML Helper is a Python application that streamlines exploratory data analysis (EDA) and preprocessing for supervised machine learning. Featuring a user-friendly Tkinter interface, it enables users to load CSV files, visualize data, and perform essential transformations, making data preparation accessible for all skill levels.
data-analysis data-science data-visualization matplotlib numpy pandas seaborn sklearn
Last synced: 11 Jul 2025
https://github.com/patex1987/temperature-calibration
Notebook for sensor calibration evaluation
calibration data-analysis jupyter-notebook sensor
Last synced: 20 Jun 2025
https://github.com/joanacmbarros/ardm-website
Website to support the R in Pharma 2023 workshop on the ARDM
analysis-results automation clinical-data data-analysis data-model r-in-pharma
Last synced: 03 Apr 2025
https://github.com/stelios-c/gps_analysis
Analysis of GPS interruption data in public domain
data-analysis electronic-warfare gps gps-quality jamming jupyter-notebook osint pandas python spoofing web-scraping
Last synced: 12 Apr 2025
https://github.com/deep-diver/data-analysis-on-titanic
applying data analysis on titanic data sheet
Last synced: 30 Mar 2025
https://github.com/praveendecode/product_sentiment_analysis
This project employs NLTK, Prowebscraper, and Python for sentiment analysis on online product reviews. Through web scraping, EDA, and NLP, it evaluates user satisfaction by comparing actual ratings and sentiment scores
data-analysis data-visualization natural-language-processing nltk-python product-analysis python sentiment-analysis
Last synced: 04 Apr 2025
https://github.com/tameronline/tameronline
Showcasing Projects on Data Analysis, Programming, and AI — Developed Using Python and Modern Frameworks
data-analysis deep-learning flask machine-learning numpy pandas python3 sql web-development
Last synced: 11 Jun 2025
https://github.com/quantumudit/analyzing-suez-services
This project focuses on scraping all the service locations across Australia & New Zealand and their associated attributes from "Suez" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 01 Nov 2025
https://github.com/prithivsakthiur/data-board
Data Boards - Visualization of various plots ( Analysis )
data-analysis gradio huggingface keras mathplotlib pandas plots pyplot scikit-learn seaborn spaces
Last synced: 28 Oct 2025
https://github.com/rapidsurveys/oldr
An Implementation of the Rapid Assessment Method for Older People (RAM-OP)
assessment data-analysis odk r ram-op rapid-assessment
Last synced: 12 Apr 2025
https://github.com/elysian01/ml-eda-and-modelling-using-streamlit
Beautiful Web interface made using Streamlit for quick Exploratory Data Analysis and building classification models which are implemented from scratch.
data-analysis data-visualization eda exploratory-data-analysis knn-classification logistic-regression matplotlib ml-model-on-web ml-models naive-bayes-classifier pandas seaborn streamlit streamlit-webapp
Last synced: 12 Apr 2025
https://github.com/shramkoweb/bookbot
A Python-based text analyzer that counts words and character frequencies in any .txt file, providing a detailed, sorted report. Perfect for quick text insights and learning text processing basics!
automation beginner-friendly character-frequency data-analysis file-processing open-source python text-analysis text-parser text-processing word-count
Last synced: 02 Feb 2026
https://github.com/salvatorecorvaglia/estimation-data-analysis-with-applications
Bitcoin Price Prediction using LSTM and ARIMA with Quandl API and Prophet Facebook with CoinMarketCap API
arima bitcoin blockchain coinmarketcap-api data-analysis lstm prophet-facebook quandl-api
Last synced: 04 Apr 2025
https://github.com/jesussantana/data-science-with-python-it-academy
Learn how to extract value from data by ingesting, transforming, storing, analyzing, and visualizing data
classification-model clustering-methods dash data-analysis data-mining data-science database machine-learning matplotlib mongodb numpy pandas plotly python3 regression-models seaborn sklearn sql web-scraping
Last synced: 04 Aug 2025
https://github.com/pycroscopy/scalable_analytics
Self-help guides for scalable data analytics
data-analysis deep-learning jupyter-notebook python remote virtual-machine
Last synced: 11 Apr 2025
https://github.com/manmolecular/http-response-clustering
:chart_with_downwards_trend: Clustering of HTTP responses using k-means++ and the elbow method
data-analysis elbow-method elbow-plot jupyter k-means-plus-plus python3
Last synced: 17 Jun 2025
https://github.com/vvipjain/ecommerce-sales-analysis
Ecommerce Sales Analysis
data-analysis pandas pandas-dataframe python sql sqlalchemy
Last synced: 28 Jun 2025
https://github.com/cosmoduende/r-marvel-vs-dc
DC Comics vs Marvel Comics - Exploratory Data Analysis and Data Visualization with R. Who has the smartest, strongest, fastest, or most powerful hero or villain? How to answer this and more questions with R
comics data-analysis data-analysis-r data-analytics data-visualization dataviz dc-characters dc-comics eda exploratory-analysis exploratory-data-analysis exploratory-data-visualizations marvel-characters marvel-comics marvel-vs-dc shdb superherodb superheroes superheros
Last synced: 11 Apr 2025
https://github.com/phollemans/cwutils
CoastWatch Utilities software for working with satellite data files from NOAA CoastWatch and elsewhere
cdat coastwatch-utilities data-analysis data-visualization install4j java noaa-coastwatch remote-sensing satellite-imagery
Last synced: 02 May 2025
https://github.com/ikanurfitriani/project-data-analysis-python
This repository contains the results of data analysis learning using the Python.
data-analysis data-analysis-project data-analysis-python python
Last synced: 21 Mar 2025
https://github.com/kevinyang372/san-francisco-crime-data-analysis
An ARIMA prediction model for forecasting potential crimes based on users' time and location
data-analysis machine-learning
Last synced: 29 Oct 2025
https://github.com/yukito0209/predict-podcast-listening-time
Kaggle · Playground Prediction Competition, Playground Series - Season 5, Episode 4
data-analysis ensemble-learning jupyter-notebook kaggle-competition machine-learning prediction
Last synced: 10 Apr 2025
https://github.com/mdh266/wikimedia_challenge
Analyzing click-through rates from Wikimedia
data-analysis data-challenge exploratory-data-analysis matplotlib pandas python
Last synced: 26 Mar 2025
https://github.com/alexandregazagnes/unilasalle-public-resources
UniLaSalle-Public-Ressources : This public repository contains the notebooks and the data used for both : 2nd Year - Practical Statistical Tests 4th Year - Data Analysis with Python
data data-analysis data-analytics data-cleaning data-storytelling education educational exploratory-data-analysis python python3 r r-programming rstudio statistics visualization
Last synced: 06 Apr 2025
https://github.com/openpmd/openpmd-ccd
A Python Module & LabView Bindings for Storing CCD Images with openPMD
ccd data-analysis database hdf5 open-data open-science openpmd
Last synced: 23 Feb 2025
https://github.com/CAIDA/submarine-cable-impact-analysis-public
This repository contains tools implemented for the PAM 2020 paper "Unintended consequences: Effects of submarine cable deployment on Internet routing" to collect and analyze data depicting the impact of the South-Atlantic Cable System (SACS) launch on Internet routing. This codebase can be extended to other use-cases of cable launches, failures, etc.
africa-americas africa-south-america bgp-data-analysis caida-ark-measurement-platform data-analysis historical-traceroutes impact internet-routing ripe-atlas-measurement-platform sacs-cable sail-cable submarine-cables
Last synced: 06 Apr 2025
https://github.com/kevinschoon/qviz
QViz Interactive Plotting
data-analysis data-visualization go gonum qframe yaegi
Last synced: 24 Dec 2025
https://github.com/iantomasinicola/portfoliodataanalyst
Progetto di Data analysis con Python, Microsoft Sql Server e Excel
data-analysis excel python sql
Last synced: 22 Feb 2025
https://github.com/quantumudit/demographic-data-analysis
This project focuses on analyzing and finding correlations between the three important metrics by 195 countries,i.e., birth rate, internet users, and income group.
data-analysis jupyter-notebook power-bi python
Last synced: 12 Oct 2025
https://github.com/pizofreude/data-career-navigator
An interactive dashboard providing deep insights into career opportunities for data-related roles, utilizing a comprehensive dataset sourced from LinkedIn. Features include analysis of experience levels, salaries, key skills, job locations, and industry trends, aiding job seekers and professionals in exploring and identifying optimal career paths.
codeinplace data-analysis data-visualization standford-university
Last synced: 26 Jun 2025
https://github.com/trybnetic/tu7-acceleration-sleep-wake-classification
Supporting material for the paper ''Discrimination of sleep and wake periods from a hip-worn raw acceleration sensor using recurrent neural networks''
accelerometer accelerometry actigraphy data-analysis sensors sleep
Last synced: 04 Mar 2025
https://github.com/andreaschandra/who-suicides-statistics
Exploratory Data Analysis for Suicides using Python
data-analysis data-science eda python
Last synced: 06 Apr 2025
https://github.com/antononcube/wl-outlieridentifiers-paclet
Wolfram Language (aka Mathematica) paclet that provides outlier identifier functions.
data-analysis hampel outlier-detection outliers
Last synced: 16 Jan 2026
https://github.com/shashankbansal6/signal-analysis-for-patient-monitoring
A reliable patient monitoring system which analyzes the correlated physiological signals collected from the patient's body, and generates alarms for abnormalities.
data-analysis patient-monitoring
Last synced: 30 Jan 2026
https://github.com/idaraabasiudoh/vehicle-co2emission_model
Predicts CO2 emissions from vehicle fuel consumption using a multiple linear regression model trained on sklearn, based on a dataset of engine sizes and corresponding CO2 emissions in Canada.
data-analysis jupyter-notebook machine-learning python3 scikit-learn
Last synced: 04 Mar 2025
https://github.com/pangeo-data/foss4g-2022
Pangeo tutorial at FOSS4G 2022
data-analysis hvplot pangeo time-series xarray
Last synced: 12 Apr 2025
https://github.com/dcs-training/from-spss-to-r-how-to-make-your-statistical-analysis-reproducible
Comfortable/aware of how to run your stats in SPSS? Curious to learn how to run them in R? You've come to the right place. Go to the readme file
data-analysis data-visualisation data-wrangling good-practices-digital-research r rmarkdown spss statistics
Last synced: 25 Jan 2026
https://github.com/milind220/hk-air-quality-analysis
My final project for a statistics and data analysis course. Whew that was a lot of graphs!
data-analysis jupyter-notebook numpy pandas python python3 scipy seaborn statistics
Last synced: 22 Feb 2025
https://github.com/justsecret123/twitter-sentiment-analysis
A sentiment analysis model trained with Kaggle GPU on 1.6M examples, used to make inferences on 220k tweets about Messi and draw insights from their results.
classification data-analysis data-science deep-learning deep-neural-networks docker glove-embeddings kaggle lstm lstm-neural-networks machine-learning natural-language-processing nlp python rnn scikit-learn sentiment-analysis sentiment-classification tensorflow word-embeddings
Last synced: 03 Jan 2026
https://github.com/iraikov/chicken-dataframe
Tabular data structure for data analysis in Scheme
chicken-scheme chicken-scheme-eggs data-analysis dataframe linear-regression scheme scheme-programming-language
Last synced: 12 Sep 2025
https://github.com/aalkiyumi/senior-design-project
Web scraper for collecting product and review data from e-commerce sites using Scraping Bee, AWS, Selenium, and Pandas. Focuses on cost-effective solutions, user-friendly interfaces, and efficient data extraction and analysis.
aws cs5001 data-analysis data-extraction data-processing data-storage e-commerce-analytics e-commerce-data pandas product-reviews review-sentiment-analysis scraping-bee selenium senior-design-project uc uc2026 university-of-cincinnati web-crawlers web-scraping
Last synced: 13 Apr 2025
https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django
A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data
analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas
Last synced: 05 Apr 2025
https://github.com/marios-mamalis/mca-visualisation
A script for automatic visualisation of Multiple Correspondence Analysis (MCA) results from FactoMineR in 3 dimensions using Plotly (exported as html)
3d-scatterplots correspondence-analysis data-analysis factominer html mca multiple-correspondence-analysis plotly visualisation
Last synced: 02 Mar 2025
https://github.com/maciekmalachowski/crypto-charts-site
📊Application that returns financial data for selected cryptocurrency.
binance-api data-analysis jupyter-notebook matplotlib mplfinance numpy pandas python python-binance
Last synced: 02 Apr 2025
https://github.com/hariprashad-ravikumar/ai-datascience-lab
AI‑DataScience‑Lab is a web app for uploading CSV datasets, cleaning with Pandas, and running quick exploratory analyses and regression models using scikit‑learn. Its modular design supports future AI extensions, like deep learning with TensorFlow or insight generation via the OpenAI API.
ai api azure cloudcomputing data data-analysis data-science data-visualization mathplotlib numpy openai pandas python scikit-learn
Last synced: 02 Aug 2025
https://github.com/haloapping/malas-ngetik-clf
Saya malas ngetik, makanya saya buat aja template proyek kompetisi Kaggle 😜. Template ini khusus untuk kasus klasifikasi.
data-analysis exploratory-data-analysis feature-engineering kaggle kaggle-competition machine-learning python3 scikit-learn
Last synced: 24 Feb 2025
https://github.com/janheinrichmerker/song-analysis
Analysing the Million Song Dataset.
big-data data-analysis data-science hadoop hadoop-mapreduce java kotlin songs
Last synced: 13 Apr 2025
https://github.com/chaitanyac22/customer-segmentation-using-rfm-analysis-for-online-retail
This project uses RFM (Recency, Frequency, and Monetary) segmentation to analyze customer behavior and provide insights for targeted marketing campaigns. By classifying customers based on their purchasing patterns, strategies can be tailored to improve customer retention, drive growth, and maximize the lifetime value of each customer.
customer-segmentation data-analysis data-science data-visualization exploratory-data-analysis marketing numpy pandas python python3 rfm rfm-analysis rfm-segmentation
Last synced: 11 Jun 2025
https://github.com/armanx200/diabetes_model
🚀 A machine learning model predicting diabetes with logistic regression, feature scaling, and VIF analysis. 📊🩺
arman-kianian classification data-analysis data-science data-visualization feature-engineering healthcare logistic-regression machine-learning model-evaluation predictive-modeling python scaling scikit-learn statistical-analysis statsmodels
Last synced: 18 Mar 2025
https://github.com/adriens/endoflife-date-snapshots
Daily consolidated and enriched snapshots of endoflife.date
apache-parquet csv csv-export data-analysis data-science database datavisualization dataviz duckdb duckdb-database end-of-life endoflife eol jupyter-notebook kaggle kaggle-notebook olap python release-policy release-schedule
Last synced: 30 Dec 2025
https://github.com/emelyantsev/digital-twin-data-analysis
Notebooks for R&D tasks
data-analysis data-visualization
Last synced: 21 Mar 2025
https://github.com/kaguya163/ankara_coffee_sales_analysis
"Coffee shop sales analysis in Ankara. SQL, Tableau, Python, Data Analytics"
data-analysis mysql python sql tableau
Last synced: 06 Jul 2025
https://github.com/juicedata/juicefs-deeplearning-tutorials
Deep Learning and Data Analytics Techniques with the help of JuiceFS.
data-analysis deep-learning filesystem juicefs machine-learning
Last synced: 07 Jul 2025
https://github.com/tushar2704/everyday-sql
Welcome to Everyday SQL Sheets – your go-to resource for everyday SQL cheat sheets, pro tips, interview questions, and more. Whether you're a beginner looking to learn SQL or an experienced developer seeking quick reference materials, this application has got you covered.
artificial-intelligence cheatsheet data-analysis data-science database mysql postgresql query-language sql sqlalchemy streamlit streamlit-tushar2704 tushar2704
Last synced: 30 Dec 2025
https://github.com/yash22222/data-analysis-with-python
This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.
binning data data-acquisition data-analysis data-binning data-cleaning data-formatting data-integration data-normalization data-preprocessing data-science data-transformation data-wrangling dataframe description numpy pandas pandas-dataframe python python3
Last synced: 24 Feb 2025
https://github.com/mwoss/mlflow-stock-market-example
Stock market prediction - machine learning pipeline using MLFlow.
anaconda data-analysis databricks example lstm mlflow python stock-market stock-price-prediction tutorial
Last synced: 21 Jul 2025
https://github.com/dain55788/ibm-data-engineer-lecture-note
Lecture Notes and Practice Materials of IBM Data Engineering Course
data-analysis database dataengineering datawarehouse ibm
Last synced: 03 Apr 2025
https://github.com/saranshbansal/spam-detection-analytics-tool
This is a nice tool to read chunks of sms data from a csv and understand how different algorithms (pre-implemented) perform in identifying spam messages.
analytics data-analysis data-science data-visualization mysql spring-boot
Last synced: 24 Feb 2025
https://github.com/cworld1/da-learning
Some notes and code about CWorld learning Data Analysis
data-analysis data-science jupyter-book jupyter-notebook python r
Last synced: 17 Mar 2025