Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/giatraskon/clustering-countries-socioeconomic-health-analysis
Exploration and analysis of socio-economic and health data from 167 countries using MATLAB. Application of clustering algorithms to identify development patterns, visualize disparities, and understand global trends.
calinski-harabasz-index clustering country-data data-analysis data-visualization davies-bouldin-index elbow-method feature-selection health-indicators human-development-index k-means-clustering k-median-clustering k-medoids-clustering machine-learning matlab pca pearson-correlation silhouette-score socio-economic-indicators unsupervised-learning
Last synced: 29 Jan 2026
https://github.com/jonperk318/machine-learning-analysis-of-hyperspectral-data
Using Non-negative Matrix Factorization (NMF) and Variational Autoencoder (VAE) machine learning architectures to analyze spatial and spectral features of hyperspectral cathodoluminescence (CL) spectroscopy images taken from hybrid inorganic-organic perovskite material
data-analysis data-science deep-neural-networks explained-variance hybrid-perovskite hyperspectral-image-classification machine-learning matplotlib nmf non-negative-matrix-factorization python pytorch scikit-learn semi-supervised-learning signal-processing solar-energy spectroscopy unsupervised-learning vae variational-autoencoder
Last synced: 28 Jan 2026
https://github.com/gustavohnsv/teamwork_mqa
Repositório dedicado ao trabalho em grupo baseado nos estudos de métodos para análise de dados da matéria Métodos Quantitativos para Anáise Multivariada.
data-analysis group-project r team-repo
Last synced: 15 Aug 2025
https://github.com/johnsesana/eda-liquor-sales
Exploratory Data Analysis on Public Datasets
data-analysis data-visualization sql tableau-dashboards
Last synced: 09 Mar 2026
https://github.com/gher-uliege/liege-colloquium-on-ocean-dynamics
Python tools and latex files for the Colloquium
data-analysis data-assimilation numerical-simulations ocean-modelling oceanography remote-sensing submesoscale turbulence
Last synced: 14 Oct 2025
https://github.com/hyperspy/holospy-demos
HoloSpy Jupyter Notebook demos
data-analysis data-visualization electron-holography hyperspy materials-science multi-dimensional physical-sciences tutorial
Last synced: 29 Apr 2026
https://github.com/agungbudiwirawan/supermarket_sales_dashboard-sales-analysis-using-pivot-table-and-chart
The objective of this project is to analyze supermarket sales using pivot table and chart in Microsoft Excel. After doing the analysis, I created an interactive dashboard that aims to make it easier for the audience to explore data.
dashboard data-analysis data-science data-visualization excel sales-dashboard spreadsheet
Last synced: 08 Jan 2026
https://github.com/a-r-j/npview
CLI tools for quickly inspecting CSV/TSV & NumPy (.npy) array files
cli csv data-analysis inspector npy numpy python tsv
Last synced: 18 Jan 2026
https://github.com/virajbhutada/cliquebait-digital-marketing-analysis-using-sql
This GitHub repository contains the CliqueBait Digital Marketing Analysis project, utilizing SQL for comprehensive analysis of marketing campaigns, user engagement, product performance, and website interactions within the Clique Bait food app. The project offers actionable insights for optimizing marketing strategies in competitive landscape.
campaign-website data-analysis data-extraction data-science digital-marketing food-store microsoft-excel mysql product-performance sql sql-database sql-project user-engagement website-analytics
Last synced: 27 Feb 2025
https://github.com/shlokashah/student-depression-and-suicide-rate-prediction
https://shlokashah.github.io/Student-Depression-And-Suicide-Rate-Prediction/
data-analysis data-visualization machine-learning student suicide-rate-prediction
Last synced: 19 Nov 2025
https://github.com/wanglaoshi/wanglaoshi-pypi
Useful tools for DA ML DL
data-analysis deep-learning machine-learning unitility
Last synced: 14 Jan 2026
https://github.com/ivangrigorov/neutrino-search-engine
Creating Java search engine both for HTML or document type of files
data data-analysis data-knowledge information-extraction information-retrieval java-language search-engine
Last synced: 31 Mar 2025
https://github.com/martial2023/bank-performance-analysis
Analyse de données bancaires du Berka Dataset (1993-1998) pour calculer et visualiser des KPI clés
dashboard data-analysis data-visualization nextjs pandas plotly-express pymongo python recharts-js sqlalchemy
Last synced: 26 Aug 2025
https://github.com/DCS-training/intromachinelearning
This course is aimed at providing an introduction to machine learning for those with some beginner level python/Rstudio skills. Go to the readme file
data-analysis data-wrangling machine-learning python statistics
Last synced: 25 Apr 2025
https://github.com/virajbhutada/music-recommendation-system
This project is designed to provide personalized music recommendations for relaxation and meditation. Leveraging ML and data analysis, the system suggests tracks based on user preferences such as tempo, energy, and genre. Join us in enhancing music discovery through advanced algorithms and community-driven contributions.
data-analysis data-science-projects data-visualization eda html machine-learning ml-algortihms model-deployment model-evaluation music-recommendation-system nlp pivot-table principal-component-analysis python python-library similarity-matrix spotify-data streamlit-web user-experience
Last synced: 24 Jan 2026
https://github.com/jimbrig/eda
Exploratory Data Analysis R Package and Shiny App
data-analysis data-visualization eda r shiny
Last synced: 03 Jan 2026
https://github.com/wittline/data-analytics-with-r
Repository for data analytics course using R
cassandra-database cql data-analysis genetic-algorithm pentaho-data-integration r
Last synced: 07 Jul 2025
https://github.com/markmelnic/scalg
List scoring algorithm. Analyse data using a range based procentual proximity algorithm.
algorithm data-analysis pypi pypi-package score scorer scoring scoring-algorithm
Last synced: 08 Oct 2025
https://github.com/dcs-training/bayesian-statistics
Materials for the CDCS Introduction to Bayesian Statistics course. Go to the readme file
bayesian-statistics data-analysis r statistics
Last synced: 05 Feb 2026
https://github.com/jpcadena/onemetric-plus
OneMetric+ project for analytical tool on demand forecast and outlier detection
black-formatter data-analysis data-analytics data-science data-visualization demand-forecasting isort machine-learning matplotlib mypy numpy outlier-detection pandas pre-commit-hook pydantic python ruff scikit-learn seaborn solid-principles
Last synced: 10 Apr 2026
https://github.com/visionkernel/centerspoke
Centerspoke is a data management and analysis tool that allows easy access to cloud databases. Say goodbye to using excel for data management. This open-source CLI tool allows for the rapid processing and analysis of all your data, and makes it easy to upload your excel files into your cloud databases.
cli cloud-database data-aggregation data-analysis data-analysis-python data-management data-science python python3
Last synced: 24 May 2026
https://github.com/dcs-training/r-qgisintegratingspatialanalysis
This was an intermediate course of three sessions with a focus on developing skills in data visualisation, analysis and integration using both R studio and QGIS. Go to the readme file
data-analysis data-visualisation data-wrangling gis qgis r spatial-analysis
Last synced: 28 Jan 2026
https://github.com/mathieu2301/pbsc-tracker
Expérience de tracking des vélos en libre service fonctionnants avec PBSC
ai data-analysis data-mining data-science data-visualization libelo machine-learning pbsc valence velib-tracker
Last synced: 23 Jun 2026
https://github.com/elfgk/ogretmenanalizantalya
OgretmenAnalizAntalya
analysis data-analysis data-science data-visualization ogretmenanaliz
Last synced: 08 Feb 2026
https://github.com/quantumudit/analyzing-goodreads-famous-quotes
This project focuses on scraping famous quotes and their related data from the GoodReads website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 20 May 2026
https://github.com/emso-c/stream-analyser
A tool that analyses YouTube live streams.
cli data-analysis guessing highlights python youtube-video
Last synced: 18 Jan 2026
https://github.com/thennen/py-ivtools
A package for reproducible measurement and analysis of current-voltage characteristics of electronic devices.
current-voltage data-analysis data-visualization electrical-engineering emerging-technology instrumentation measurements
Last synced: 23 Jan 2026
https://github.com/jimbrig/EDA
Exploratory Data Analysis R Package and Shiny App
data-analysis data-visualization eda r shiny
Last synced: 30 Jul 2025
https://github.com/yusufcinarci/covid-19-data-analysis-visualization
The first project of our data visualization studies is the COVID-19 data analysis project. In this project, we analyzed the data of the COVID-19 pandemic, which started in the first month of 2020 and still continues to affect the world, on the basis of countries. You can find the brief details of the project we realized in 3 stages in the readme file. We have tried to explain the details of the project step by step below. We wish you healthy days.
covid-19-data-visualization data-analysis data-science data-visualization
Last synced: 22 Jul 2025
https://github.com/fernandezfran/exma
A Python library with C extensions to analyze and manipulate molecular dynamics trajectories and electrochemical data
computational-physics data-analysis molecular-dynamics oop python science
Last synced: 16 Jan 2026
https://github.com/ekosaputro09/Data-Science-References
Some useful resources to learn about Data Science
cheatsheet data-analysis data-science data-visualization machine-learning statistical-learning
Last synced: 22 Nov 2025
https://github.com/kylekirkby/cardatasnatch
CarDataSnatch allows you to quickly find information about a car in the uk using a valid number plate. Grab an image of the car in question along with a multitude of other data. Compare two cars' data for fast and easy analysis.
beautifulsoup cars command-line-tool data data-analysis data-mining ethical-hacking python python3 requests scraper social-engineering
Last synced: 15 Apr 2025
https://github.com/naso7y/students-performance-analysis
A project analyzing students' academic performance to identify trends and factors affecting outcomes. Built with Python, using data visualization and statistical techniques to derive actionable insights.
data-analysis data-visualization machine-learning python
Last synced: 23 Feb 2026
https://github.com/codebypinar/reta
🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!
arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series
Last synced: 12 Oct 2025
https://github.com/parisaroozgarian/ibm-data-analyst-professional-certificate
The IBM Data Analyst Professional Certificate, consisting of 9 courses, equips with essential skills in Excel, SQL, Python, data visualization, and analysis techniques
big-data business-analysis business-communication communication data-analysis data-management data-structures data-visualization databases general-statistics human-resources planning python-programming spreedsheet sql
Last synced: 27 Jan 2026
https://github.com/tillbiskup/cwepr
A Python package based on the ASpecD framework for handling cwEPR data.
continuous-wave data-analysis data-processing electron-paramagnetic-resonance reproducible-research reproducible-science
Last synced: 06 Sep 2025
https://github.com/its-kanii/predictive-maintenance-for-healthcare-equipment
Predictive Maintenance for Healthcare Equipment utilizes machine learning to analyze operational metrics and predict equipment failures. This project leverages a dataset of usage hours, temperature, and maintenance history to enhance equipment reliability and reduce downtime.
data-analysis data-science failure-prediction feature-engineering healthcare-equipment jupyter-notebook machine-learning predictive-maintenance python time-series-analysis
Last synced: 09 May 2026
https://github.com/quantumudit/python-projects
Consists of various projects that are primarily powered by Python
data-analysis data-science data-visualization jupyter-notebook projects python pythonapplication pythonprojects
Last synced: 09 Apr 2025
https://github.com/cbg-ethz/scdna-pipe
Python data analysis pipeline for single cell copy number event history reconstruction
bioinformatics bioinformatics-pipeline data-analysis genomics python snakemake snakemake-workflows workflow
Last synced: 05 Jan 2026
https://github.com/kyle-wannacott/DataCamp-Projects
DataCamp project solutions.
data-analysis data-mining data-science datacamp-projects machine-learning python r
Last synced: 13 Oct 2025
https://github.com/ahmednasef3/titanic-full-eda
Simple EDA for Titanic Dataset.
data-analysis data-visualization eda exploratory-data-analysis matplotlib pandas seaborn titanic titanic-data-analytics
Last synced: 27 Jan 2026
https://github.com/henrylin03/video-games
Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.
analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games
Last synced: 14 Apr 2026
https://github.com/luminati-io/Amazon-dataset-samples
A sample dataset of over 1,000 Amazon product listings, extracted using the Bright Data API, perfect for competitive analysis, market trends, and eCommerce insights.
amazon api data-analysis data-science dataset ecommerce products web-scraping
Last synced: 09 Apr 2025
https://github.com/aravind-selvam/covid_dashboard
With Covid death and vaccine data. I have created a dashboard.
covid-19 data-analysis data-science data-visualization tableau tableau-public visualization
Last synced: 08 Mar 2026
https://github.com/code-jl/nfl-point-kicker-data-scraper
A Python-based web scraping toolkit that extracts and processes NFL kicking statistics from Pro-Football-Reference. This project automates the collection of comprehensive game data, with a particular focus on field goal attempts and environmental conditions.
automation beautifulsoup csv data-analysis data-collection field-goals football-statistics kicking-stats nfl python selenium sports-analysis statistics weather-data web-scraping
Last synced: 06 Sep 2025
https://github.com/fenghaojiang/ethereum-etl
ETL(Extract, Transform, Load) data from Ethereum like EVM Block chain
Last synced: 14 Jan 2026
https://github.com/grburgess/polarpy
Tools for polar
3ml data-analysis grb polarization polarization-data
Last synced: 15 Jul 2025
https://github.com/llnl/hdtopology
High-dimensional topological data analysis library for NDDAV
analysis cpp data-analysis data-viz high-dimensional-data topological-data-analysis visualization
Last synced: 29 Apr 2025
https://github.com/realkarthiknair/data-science-notes
Data science notes and programs
data-analysis data-science data-visualization
Last synced: 27 Aug 2025
https://github.com/haloapping/malas-ngetik-clf
Saya malas ngetik, makanya saya buat aja template proyek kompetisi Kaggle 😜. Template ini khusus untuk kasus klasifikasi.
data-analysis exploratory-data-analysis feature-engineering kaggle kaggle-competition machine-learning python3 scikit-learn
Last synced: 12 Apr 2026
https://github.com/viper373/baidutieba
爬取百度贴吧(指定吧名、起始页数/重点页数、日志输出)
baidutieba-crawler bert data-analysis deep-learning python spider
Last synced: 30 Mar 2025
https://github.com/alieymsxxn/sql_project_data_job_analysis
This project explores top-paying jobs, in-demand skills, and where high demand meets high salary in data analytics.
data-analysis postgresql sql sqlite
Last synced: 16 Apr 2025
https://github.com/elhaban3ro/thewildtool
TheWildTool is a tool developed with the main objective of saving time when working with audio datasets. Either to prepare them, to get them or to train a model with them. 🤖
ai audio audio-processing data-analysis data-science dataset deeplearning python
Last synced: 03 Sep 2025
https://github.com/draym/covid19tracker
Coronavirus COVID-19 dashboard to track global cases
covid-19 covid19-tracker dashboard data-analysis
Last synced: 07 Jan 2026
https://github.com/roland045/road_quality_measurement_analysis
Novel road quality measurement system for cost effective pavement monitoring, ML-based
azure data-analysis data-engineering data-science machine-learning mlops model-deployment python sql unsupervised-learning
Last synced: 24 Jan 2026
https://github.com/tathithienthanh/dataanalysis_diagnosis-of-diabetes-based-on-data-set-of-blood-test-result
Implement all learned knowledge about data analysis and data mining to make a complete project about Diagnosis of diabetes based on data set of blood test result
blood-test classification clustering data-analysis data-processing decision-tree diabetes-prediction diagnosis exercise google-colab health hierarchical ipynb kmeans knn knns py python smote-sampling visualization
Last synced: 08 Jan 2026
https://github.com/mrjxtr/tokyo_airbnb_analysis_project
Full project case study and analysis to show potential opportunities to start an AirBnb business in Tokyo, Japan.
data-analysis data-cleaning data-science data-visualization pandas python3
Last synced: 24 Feb 2026
https://github.com/iamfoysal/data-analysis
This repository contains various examples and exercises to help learn data science using Python.
data-analysis data-science database jupyter-notebook python3
Last synced: 10 Feb 2026
https://github.com/rvalla/chessevolution
Some code to analyze my chess games using the Lichess API.
chess data-analysis lichess lichess-api python
Last synced: 23 Oct 2025
https://github.com/ashwinpn/visualization
Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.
analysis data data-analysis data-science data-visualization graphs plots python python3 visualization
Last synced: 13 Apr 2026
https://github.com/jpquast/icp-ms-data-explorer
A shiny app for the exploration of ICP-MS data.
data-analysis icp-ms r shiny shiny-apps
Last synced: 17 Jan 2026
https://github.com/gxelab/tutorials
Tutorials of frequently used software packages and libraries in the lab
bioinformatics data-analysis evolution genetics genomics julia python3 r-language statistics visualization
Last synced: 18 Jan 2026
https://github.com/ernestaroozoo/memestocks.net
MemeStocks.net is a Python web app that tracks the historical popularity of specific stocks by monitoring Reddit mentions. Users can search for a stock symbol and view information such as the stock's name, price, and historical popularity data. The data is gathered using the Pushshift API and stored in a PostgreSQL database.
dashboard data-analysis financial-data meme-stock python reddit-scraper scraping sql stock streamlit
Last synced: 06 Mar 2026
https://github.com/is-leeroy-jenkins/badger
A data science & analysis toolkit for federal analysts with the environmental protection agency based on WPF, Net 8, and is written in C#.
ai budget budget-management data-analysis data-science data-visualization federal-government large-language-models machine-learning
Last synced: 14 Oct 2025
https://github.com/trybnetic/tu7-acceleration-sleep-wake-classification
Supporting material for the paper ''Discrimination of sleep and wake periods from a hip-worn raw acceleration sensor using recurrent neural networks''
accelerometer accelerometry actigraphy data-analysis sensors sleep
Last synced: 01 Jun 2026
https://github.com/jimut123/ultimate_date_finder
To find the best place for dating in your country
data-analysis data-science date-cluster dating geo location maps software
Last synced: 16 Jan 2026
https://github.com/sarincr/data-analytics-with-knime
Data Analytics with KNIME (Konstanz Information Miner), a free and open-source data analytics, reporting and integration platform. KNIME integrates various components for machine learning and data mining through its modular data pipelining concept. A graphical user interface and use of JDBC allows assembly of nodes blending different data sources, including preprocessing (ETL: Extraction, Transformation, Loading), for modeling, data analysis and visualization without, or with only minimal, programming.
ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks data-analysis data-mining data-science data-structures data-visualization database datascience deep-learning machine-intelligence machine-learning machine-learning-algorithms machinelearning mining mining-software
Last synced: 14 Mar 2025
https://github.com/quantumudit/uk-student-accommodation-analysis
This project focuses on scraping student properties related data from the UK Student Accommodation website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 27 Apr 2026
https://github.com/i4ds/ecallisto_ng
Ecallisto NG is a Python package tailored for interacting with Ecallisto data.
data-analysis data-visualization e-callisto ecallisto-international-network numpy pandas python spectrometer
Last synced: 13 Oct 2025
https://github.com/enricocid/monitoraggio-vaccini-italia
Sito web statico per github.com/apalladi/covid_vaccini_monitoraggio
covid-19 covid-19-data covid-19-data-analysis data data-analysis data-visualization dataset python python3 python37 sars-cov-2
Last synced: 09 Sep 2025
https://github.com/lykmapipo/scala-spark-product-sales-analysis
Scala application to process, and analyze product sales using Spark
anomaly-detection apache-spark apache-spark-sql customer-segmentation data-analysis data-processing lykmapipo market-basket-analysis product-sales product-sales-analysis rolling-average running-total sbt scala summary-statistics time-series-analysis
Last synced: 18 May 2026
https://github.com/souvik09-tech/walmart_sales_dataanalysis
This end-to-end data analysis project leverages Python for processing and SQL for advanced querying to extract key business insights from Walmart sales data. It's designed for data analysts to enhance skills in data manipulation, querying, and pipeline creation.
data-analysis end-to-end etl-pipeline jupyter-notebook mysql mysql-database pandas python
Last synced: 17 Feb 2026
https://github.com/marios-mamalis/mca-visualisation
A script for automatic visualisation of Multiple Correspondence Analysis (MCA) results from FactoMineR in 3 dimensions using Plotly (exported as html)
3d-scatterplots correspondence-analysis data-analysis factominer html mca multiple-correspondence-analysis plotly visualisation
Last synced: 02 Mar 2025
https://github.com/leonism/customer-predictive-analysis
Explore this repository, a comprehensive resource offering an in-depth guide to conducting customer predictive analysis using cutting-edge machine learning techniques, all within the intuitive framework of Dataiku.
data-analysis data-model data-science data-visualization dataiku machine-learning predictive-modeling
Last synced: 28 Mar 2025
https://github.com/jakebrehm/demesstify
📱Demystifies your messages and allows for easy analysis and visualization of conversations.
data-analysis data-science imessage messages messaging nlp pandas python sentiment-analysis visualization wordcloud
Last synced: 13 Apr 2025
https://github.com/beckversync/probability-and-statis_computer-parts-cpus-and-gpus-ics_
Probability and statistical analysis techniques are employed to explore data related to computer components, such as CPUs, GPUs, and Integrated Circuits (ICs). The objective is to uncover trends, identify patterns, and extract meaningful insights from real-world hardware data.
Last synced: 18 Feb 2026
https://github.com/bebofekry/i_care_graduation_project
Graduation Project
ai artificial-intelligence chatbot cnn computer-vision data-analysis data-science deep-learning ecg graduation-project healthcare machine-learning medical medical-imaging natural-language-processing neural-networks nlp pattern-recognition predictive-modeling python
Last synced: 01 May 2026
https://github.com/johntocci/nullaxe
Nullaxe is a powerful and user-friendly Python library designed for cleaning and preprocessing data. It works seamlessly with both pandas and polars DataFrames, making it a versatile tool for data scientists and developers.
data data-analysis data-science datacleaning pandas polars python
Last synced: 06 Apr 2026
https://github.com/rgalyeon/machine_learning_and_data_analysis
Machine Learning and Data Analysis specialization by Yandex and MIPT
coursera data-analysis data-science machine-learning mipt python yandex
Last synced: 03 Mar 2025
https://github.com/zachlagden/spotify-listening-analyzer
A comprehensive Python tool for analyzing your Spotify listening history data.
analytics data-analysis pandas python spotify-web-api spotipy
Last synced: 31 Jul 2025
https://github.com/elysian01/ml-eda-and-modelling-using-streamlit
Beautiful Web interface made using Streamlit for quick Exploratory Data Analysis and building classification models which are implemented from scratch.
data-analysis data-visualization eda exploratory-data-analysis knn-classification logistic-regression matplotlib ml-model-on-web ml-models naive-bayes-classifier pandas seaborn streamlit streamlit-webapp
Last synced: 12 Apr 2025
https://github.com/dcs-training/from-spss-to-r-how-to-make-your-statistical-analysis-reproducible
Comfortable/aware of how to run your stats in SPSS? Curious to learn how to run them in R? You've come to the right place. Go to the readme file
data-analysis data-visualisation data-wrangling good-practices-digital-research r rmarkdown spss statistics
Last synced: 25 Jan 2026
https://github.com/simoneas02/data-science
🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻
data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql
Last synced: 12 Apr 2026
https://github.com/thisisashukla/survival-analysis
Hands-On Survival Analysis in Python
data-analysis data-science survival-analysis
Last synced: 28 Jul 2025
https://github.com/byancamatos01/powerbi2
Dia2- Tratamento de dados
data-analysis datavisualization powerbi powerquery
Last synced: 16 Feb 2026
https://github.com/stelios-c/gps_analysis
Analysis of GPS interruption data in public domain
data-analysis electronic-warfare gps gps-quality jamming jupyter-notebook osint pandas python spoofing web-scraping
Last synced: 12 Apr 2025
https://github.com/armanx200/diabetes_model
🚀 A machine learning model predicting diabetes with logistic regression, feature scaling, and VIF analysis. 📊🩺
arman-kianian classification data-analysis data-science data-visualization feature-engineering healthcare logistic-regression machine-learning model-evaluation predictive-modeling python scaling scikit-learn statistical-analysis statsmodels
Last synced: 20 May 2026
https://github.com/emptymalei/mini-lab
Some code snippets used to explain stuff to myself in my personal data science wiki
data-analysis data-mining data-science data-visualization datascience
Last synced: 07 Apr 2025
https://github.com/quantumudit/demographic-data-analysis
This project focuses on analyzing and finding correlations between the three important metrics by 195 countries,i.e., birth rate, internet users, and income group.
data-analysis jupyter-notebook power-bi python
Last synced: 15 May 2026
https://github.com/poga/dat-ipynb-demo
use ipython notebook to analyze data in dat archive
dat data-analysis distributed jupyter-notebook
Last synced: 17 Aug 2025
https://github.com/kevinyang372/san-francisco-crime-data-analysis
An ARIMA prediction model for forecasting potential crimes based on users' time and location
data-analysis machine-learning
Last synced: 29 Oct 2025
https://github.com/mohd-faizy/07p_tumor-diagnosis-exploratory-data-analysis-on-breast-cancer-wisconsin-dataset
Tumor Diagnosis: Exploratory Data Analysis With Seaborn
data-analysis data-visualization eda exploratory-data-analysis knn-classification pca-analysis python random-forest random-forest-classifier statistics support-vector-machines tumor-detection visualization
Last synced: 17 May 2026
https://github.com/shramkoweb/bookbot
A Python-based text analyzer that counts words and character frequencies in any .txt file, providing a detailed, sorted report. Perfect for quick text insights and learning text processing basics!
automation beginner-friendly character-frequency data-analysis file-processing open-source python text-analysis text-parser text-processing word-count
Last synced: 02 Feb 2026
https://github.com/vvipjain/ecommerce-sales-analysis
Ecommerce Sales Analysis
data-analysis pandas pandas-dataframe python sql sqlalchemy
Last synced: 16 Apr 2026
https://github.com/gabysbrain/purescript-dataframe
A data structure for row-based data and queries
Last synced: 19 Feb 2026
https://github.com/shibam120302/black-friday-sales-data-analysis
This repository contain Data Analysis on Black Friday Sales Data using various Regression ML algorithms
data-analysis eda machine-learning python random-forest regression
Last synced: 20 May 2026