Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/implicitlayer/alphaanalysis
Full library for analyzing financial data using ML and classical approaches
data-analysis data-science deep-learning finance machine-learning risk-analysis time-series-analysis trading transformers
Last synced: 12 Mar 2026
https://github.com/petulla/readroper
Read single and multi-card ASCII polling datasets in R
ascii data-analysis polling-data r
Last synced: 31 Mar 2025
https://github.com/erictleung/microbiome-analysis-resources
:microscope: Resources and notes on studying the human microbiome
bioinformatics data-analysis gut-microbiome microbiology microbiome notes resources
Last synced: 18 Jan 2026
https://github.com/carlosvinimsouza/dataanalysiswithpython
Learn Data Analysis using Libs Python (Numpy, Pandas, Matplotlib and Seaborn)
data-analysis data-science free-code-camp matplotlib numpy pandas python python3 seaborn
Last synced: 11 Apr 2026
https://github.com/arnavk-09/world-population
🌏 Taipy demo to explore world population data...
data-analysis python taipy world-population
Last synced: 31 Mar 2025
https://github.com/dcs-training/summerschool2024-stream2
Welcome to the repository for all the materials for Stream 2 of the CDCS 2024 summer school. Go to the readme file
data-analysis data-visualisation data-wrangling geographical-data r sentiment-analysis statistics text-analysis web-scraping
Last synced: 25 Apr 2025
https://github.com/tushar2704/instagram-user-analytics
This project revolves around the exploration and analysis of user engagement patterns on the popular social media platform, Instagram. By delving into user data and interaction metrics, this project aims to provide valuable insights into user behavior, content performance, and trends.
artificial-intelligence data-analysis data-science instagram project tushar2704
Last synced: 23 Jan 2026
https://github.com/prathamesh693/04_sales-and-demand-forecasting-for-retail-chains
This project aims to forecast weekly sales for retail stores using historical sales and economic data. By applying advanced time series forecasting models, we enable better inventory management, demand planning, and revenue optimization for retail chains. The project includes both traditional statistical models and deep learning techniques.
arima-forecasting data-analysis data-visualization exploratory-data-analysis jupyter-notebook prophet-facebook python spyder-python-ide streamlit-webapp
Last synced: 01 Jul 2025
https://github.com/olgaele/playing-with-julia
Playing with data!
data data-analysis data-science julia statistics
Last synced: 19 Apr 2026
https://github.com/ihabbendidi/diamond-analysis
Exploratory statistical analysis of a Diamond dataset
data-analysis data-visualization exploratory-data-analysis machine-learning r
Last synced: 17 Oct 2025
https://github.com/willie-conway/meta-data-analyst-portfolio
A comprehensive 📚portfolio showcasing projects and skills developed during the Meta Data Analyst Professional Certificate 🎓course, featuring 📈data analysis, 📊visualization, and 👨🏿💻management using various ⚙️tools.
big-data business-intelligence data-analysis data-cleaning data-driven-decisions data-management data-mining data-visualization exploratory-data-analysis jupyter-notebook machine-learning pandas porfolio predictive-modeling python spreadsheet-analysis sql statistics tableau visualization-tools
Last synced: 11 Apr 2026
https://github.com/mrankitgupta/mrankitgupta
Myself Ankit Gupta, This contains a short & interesting introduction about me.
ai ankit ankit-gupta ankitgupta artificial-intelligence awesome-readme data-analysis data-science data-visualisation data-visualization github github-profile github-profile-readme machine-learning mrankitgupta profile python readme readme-profile social
Last synced: 22 Apr 2025
https://github.com/heiderjeffer/misalignment-between-ownership-and-contribution-affects-system-reliability
Research Proposals RP
archtecture data-analysis data-collection nvivo-software python qualitative-analysis quantative-analysis reliability-engineering software-engineering
Last synced: 23 Feb 2026
https://github.com/llnl/nddav
N-Dimensional Data Analysis and Visualization
data-analysis data-viz high-dimensional-data topological-data-analysis visual-analytics visualization
Last synced: 29 Apr 2025
https://github.com/touppercase78/real-racing-3-vehicles
Datasets and Analyses for All Vehicles in Real Racing 3
data-analysis data-science datasets electronic-arts firemonkey jupyter-notebook mobile-game python racing racing-games real-racing-3 vehicles
Last synced: 01 Apr 2025
https://github.com/ktmud/github-life
A data explorer for GitHub projects' life cycles
data-analysis github scraper time-series
Last synced: 16 May 2026
https://github.com/khuyentran1401/suicide-rates
data-analysis data-science kaggle machine-learning python
Last synced: 13 Apr 2026
https://github.com/julianolaurentino/sql_sample
Estudos voltados para criação de consultas em SQL. Este repositório está sendo alimentando semanalmente.
data-analysis sql sql-query sql-server
Last synced: 05 Mar 2025
https://github.com/pmsipilot/intercom2dw
Intercom2DW is an attempt at loading all the data available in an Intercom application.
Last synced: 11 Apr 2026
https://github.com/agungbudiwirawan/e-commerce_analysis_using_sql
The objective of this project is to provide an analysis of Olist Store (Brazilian E-commerce) sales.
data-analysis data-science e-commerce e-commerce-project microsoft-sql-server sql sql-server
Last synced: 24 Mar 2025
https://github.com/bts-cm/airdrop_tool
Fetch & analyse blockchain tickets. View leaderboards and user tickets. Calculate and perform provably fair airdrops.
airdrop bitshares bitsharesjs blockchain crypto data-analysis data-science electron etl javascript nodejs react ticket tusc
Last synced: 17 Jan 2026
https://github.com/nmicht/food-language-variety
A map to display distribution of food names using their synonyms in different regions.
data-analysis language-distribution language-variety map
Last synced: 31 Jan 2026
https://github.com/thecoderpinar/earthquake-explorer
🌍🔍 Explore and analyze earthquake data worldwide using this interactive data science project. Visualize earthquake occurrences, patterns, and geographical distribution. Dive deep into seismic data, perform exploratory data analysis, and gain insights into earthquake trends.
data-analysis data-science data-visualization earthquake-analysis exploratory-data-analysis geospatial-analysis interactive-maps jupyter-notebook machine-learning python seismic-data
Last synced: 23 Aug 2025
https://github.com/dylan-profiler/tangled-up-in-unicode
Access to the Unicode Character Database (UCD)
data-analysis data-quality exploration linguistic-analysis linguistics python unicode
Last synced: 15 Apr 2025
https://github.com/splch/qbs
An effective and flexible Quantile-Based Balanced Sampling algorithm for addressing class imbalance in datasets while preserving the underlying data distribution, improving model performance across various machine learning applications.
classification data-analysis imbalanced-classification imbalanced-data machine-learning resampling
Last synced: 01 Apr 2025
https://github.com/datalayer/desktop
Ξ 🖥️ Datalayer Destkop.
ai data data-analysis data-science datalayer desktop electron
Last synced: 25 Oct 2025
https://github.com/Lightning-Chart/lcjs-analysis
A data analysis library.
data-analysis javascript-library lcjs lightningchart-js typescript-library
Last synced: 12 May 2025
https://github.com/dra1ex/mind-net.js
Fast and simple to use neural network implementation in pure TypeScript with GPU support!
artificial-intelligence classification data-analysis deep-learning gan generative-adversarial-network gpu machine-learning ml neural-network neural-network-engine neural-networks regression sequential-network supervised-learning unsupervised-learning vae variational-autoencoder
Last synced: 09 Apr 2025
https://github.com/yasminezaatour/heart-disease-predictions
Heart attack predictions using Python
data-analysis data-modeling data-visualization machine-learning
Last synced: 28 Nov 2025
https://github.com/singhkunall/-india-census-2011-population-demographics-dashboard
Interactive Excel dashboard visualizing India's 2011 Census Population Demographics using charts, pivot tables, and slicers.
data-analysis data-visualization excel-dashboard india-census population-data
Last synced: 04 Feb 2026
https://github.com/aadya940/numpyai
A Natural Language Interface to the Numpy Library using LLMs.
ai data-analysis data-science library llm machine-learning numpy python
Last synced: 12 Apr 2025
https://github.com/ehtisham-sadiq/final-year-project-material-
This repository contains all the material and data related to final year project.
data-analysis data-science data-visualization deep-learning implementation-of-algorithms machine-learning machine-learning-algorithms natural-language-processing python3
Last synced: 02 Jan 2026
https://github.com/nghorbani/neuraldataanalysis
Download Data from https://bit.ly/3g8RUmi
data-analysis matlab spike-detection spike-rate
Last synced: 22 Apr 2025
https://github.com/liweitianux/chandra-acis-analysis
Chandra ACIS analysis tools and documents
Last synced: 19 Apr 2025
https://github.com/agungbudiwirawan/socioeconomic_analysis
The objective of this project is to analyze the socio-economic in Chicago.
chicago-crime crime-data data-analysis data-science microsoft-sql-server project sql sql-project sql-server
Last synced: 04 Jul 2025
https://github.com/narius2030/sakila-datawarehouse-ssis
Implement a simple data warehouse to store Saklia data - Create data pipelines for extract, transform and load data from source to warehouse - Retrieve data in warehouse to explore and do several analysis
data-analysis data-integration data-modeling data-visualization excel microsoft-sql-server power-bi ssas ssis
Last synced: 10 Oct 2025
https://github.com/andreabozzo/osservatorio
Osservatorio - Open Data Processing Platform
api-rest data data-analysis data-visualization database datamodelling docker duckdb etl fastapi jwt-authentication open-source pipeline postgresql python react sqlite
Last synced: 02 Sep 2025
https://github.com/andreip/twitter-authorities
Find authorities for Twitter topics. [Licenta][Undergraduate thesis]
data-analysis mongodb python twitter-topics
Last synced: 19 Apr 2026
https://github.com/saltiola7/data-analysis-portfolio
Data engineering & analysis portfolio, which showcases my use of Python & SQL
airflow airtable-block anaconda automation back4app chatgpt csv-parser data-analysis data-engineering docker-compose gcp graphql-api jupyter-notebook nosql prefect python rest-api sql streamlit web-scraping
Last synced: 21 Jan 2026
https://github.com/markmelnic/carsen-desktop
A python dashboard app for scraping and tracking cars for sale on websites such as mobile.de
automation dashboard data-analysis interface scraping scraping-websites tkinter-python
Last synced: 08 Oct 2025
https://github.com/markmelnic/scalg
List scoring algorithm. Analyse data using a range based procentual proximity algorithm.
algorithm data-analysis pypi pypi-package score scorer scoring scoring-algorithm
Last synced: 08 Oct 2025
https://github.com/luminati-io/Amazon-dataset-samples
A sample dataset of over 1,000 Amazon product listings, extracted using the Bright Data API, perfect for competitive analysis, market trends, and eCommerce insights.
amazon api data-analysis data-science dataset ecommerce products web-scraping
Last synced: 09 Apr 2025
https://github.com/ahmednasef3/titanic-full-eda
Simple EDA for Titanic Dataset.
data-analysis data-visualization eda exploratory-data-analysis matplotlib pandas seaborn titanic titanic-data-analytics
Last synced: 27 Jan 2026
https://github.com/adirthaborgohain/community-data-analysis
Data and Visual Analysis on several different communities generated using Louvain Algorithm in Neo4j on the dblp dataset.
Last synced: 10 May 2026
https://github.com/gustavohnsv/teamwork_mqa
Repositório dedicado ao trabalho em grupo baseado nos estudos de métodos para análise de dados da matéria Métodos Quantitativos para Anáise Multivariada.
data-analysis group-project r team-repo
Last synced: 15 Aug 2025
https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm
📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.
big-data data data-analysis data-science data-visualization eda gotomarket
Last synced: 13 Jun 2025
https://github.com/walidalsafadi/titanic-disaster
In this challenge, we ask you to build a predictive model that answers the question: “what sorts of people were more likely to survive?” using passenger data (ie name, age, gender, socio-economic class, etc).
data-analysis data-science decision-trees eda gradient-boosting knearest-neighbors machine-learning-algorithms naive-bayes random-forest titanic-kaggle titanic-survival-prediction
Last synced: 16 Mar 2025
https://github.com/gher-uliege/liege-colloquium-on-ocean-dynamics
Python tools and latex files for the Colloquium
data-analysis data-assimilation numerical-simulations ocean-modelling oceanography remote-sensing submesoscale turbulence
Last synced: 14 Oct 2025
https://github.com/cosmoduende/r-arduino
Interoperability Data-IoT: How to send and receive data and take control of your Arduino, from R. How to establish interoperability between R and Arduino (Data and IoT) using a data flow between the two
arduino arduino-data arduino-dataflow arduino-serial arduino-serial-data arduino-serial-led arduino-uno data-analysis data-arduino data-cleaning data-iot data-visualization interoperability iot-rstudio r-analytics r-data-visualization r-iot rstudio-arduino serial-read serialport
Last synced: 24 Apr 2026
https://github.com/wanglaoshi/wanglaoshi-pypi
Useful tools for DA ML DL
data-analysis deep-learning machine-learning unitility
Last synced: 14 Jan 2026
https://github.com/alieymsxxn/sql_project_data_job_analysis
This project explores top-paying jobs, in-demand skills, and where high demand meets high salary in data analytics.
data-analysis postgresql sql sqlite
Last synced: 16 Apr 2025
https://github.com/draym/covid19tracker
Coronavirus COVID-19 dashboard to track global cases
covid-19 covid19-tracker dashboard data-analysis
Last synced: 07 Jan 2026
https://github.com/DCS-training/intromachinelearning
This course is aimed at providing an introduction to machine learning for those with some beginner level python/Rstudio skills. Go to the readme file
data-analysis data-wrangling machine-learning python statistics
Last synced: 25 Apr 2025
https://github.com/yahia3200/become-an-independent-data-scientist
My final project for the Applied Plotting, Charting & Data Representation in Python Course
data-analysis data-science data-visualization matplotlib
Last synced: 16 Mar 2025
https://github.com/maciekmalachowski/crypto-charts-site
📊Application that returns financial data for selected cryptocurrency.
binance-api data-analysis jupyter-notebook matplotlib mplfinance numpy pandas python python-binance
Last synced: 12 Apr 2026
https://github.com/quantumudit/alteryx-weekly-challenges
This repository contains Alteryx solutions to the weekly challenges published in Alteryx Community
alteryx alteryx-workflow data-analysis data-science data-transformation data-visualization etl
Last synced: 27 Jan 2026
https://github.com/hayesall/babybear
🐼 It's like pandas, but tiny.
data-analysis data-analysis-python data-science dataframe python teaching teaching-tool
Last synced: 31 May 2026
https://github.com/vishrut-b/end-to-end-data-analytics-with-python-and-sql
This project involves the data cleaning and SQL-based analytics of a retail orders dataset using Python and SQL. It focuses on preprocessing data, followed by detailed analytics to extract insights on sales trends and product performance.
data-analysis python retail sql sql-server sqlalchemy
Last synced: 07 Feb 2026
https://github.com/walidbosso/r_data_mining
Extract knowledge from a data using different techniques, including Association Rules Hierarchical Agglomerative Clustering (HAC) K-means Clustering Decision Trees
association-rule-mining association-rules clustering data-analysis data-mining data-science data-visualization decision-tree-classifier decision-trees exportation extract-data hac hierarchical-clustering k-means k-means-clustering k-means-r r-programming r-studio
Last synced: 23 Mar 2025
https://github.com/a-r-j/npview
CLI tools for quickly inspecting CSV/TSV & NumPy (.npy) array files
cli csv data-analysis inspector npy numpy python tsv
Last synced: 18 Jan 2026
https://github.com/pizofreude/data-career-navigator
An interactive dashboard providing deep insights into career opportunities for data-related roles, utilizing a comprehensive dataset sourced from LinkedIn. Features include analysis of experience levels, salaries, key skills, job locations, and industry trends, aiding job seekers and professionals in exploring and identifying optimal career paths.
codeinplace data-analysis data-visualization standford-university
Last synced: 13 Mar 2026
https://github.com/llnl/hdtopology
High-dimensional topological data analysis library for NDDAV
analysis cpp data-analysis data-viz high-dimensional-data topological-data-analysis visualization
Last synced: 29 Apr 2025
https://github.com/jonperk318/machine-learning-analysis-of-hyperspectral-data
Using Non-negative Matrix Factorization (NMF) and Variational Autoencoder (VAE) machine learning architectures to analyze spatial and spectral features of hyperspectral cathodoluminescence (CL) spectroscopy images taken from hybrid inorganic-organic perovskite material
data-analysis data-science deep-neural-networks explained-variance hybrid-perovskite hyperspectral-image-classification machine-learning matplotlib nmf non-negative-matrix-factorization python pytorch scikit-learn semi-supervised-learning signal-processing solar-energy spectroscopy unsupervised-learning vae variational-autoencoder
Last synced: 28 Jan 2026
https://github.com/leechristophermurray/parquetframe
Unlocking the power of Parquets
data data-analysis dataframe entity-framework etl graph interactive python rust workflow worklow zanzibar
Last synced: 28 May 2026
https://github.com/ekosaputro09/Data-Science-References
Some useful resources to learn about Data Science
cheatsheet data-analysis data-science data-visualization machine-learning statistical-learning
Last synced: 22 Nov 2025
https://github.com/goggle/dataisbeautiful
Some data analysis Jupyter notebooks, mainly indented for submissions on the subreddit /r/dataisbeautiful.
data-analysis data-visualisation jupyter-notebook notebook reddit
Last synced: 16 May 2025
https://github.com/mathieu2301/pbsc-tracker
Expérience de tracking des vélos en libre service fonctionnants avec PBSC
ai data-analysis data-mining data-science data-visualization libelo machine-learning pbsc valence velib-tracker
Last synced: 23 Jun 2026
https://github.com/rvalla/chessevolution
Some code to analyze my chess games using the Lichess API.
chess data-analysis lichess lichess-api python
Last synced: 23 Oct 2025
https://github.com/jupyterphysscilab/documentation
Documentation for the Jupyter Physical Science Lab Suite of Packages
analog-to-digital-converter data-acquisition data-analysis education jupyter-notebooks pandas physical-sciences plotting python raspberry-pi
Last synced: 22 Jan 2026
https://github.com/kenvilar/data-analysis-using-python
Transforming a description of a location from an analyzed CSV file data using Pandas with Python 3
bs4 data-analysis jupyter pandas python python3 requests xlrd
Last synced: 04 Oct 2025
https://github.com/code-jl/nfl-point-kicker-data-scraper
A Python-based web scraping toolkit that extracts and processes NFL kicking statistics from Pro-Football-Reference. This project automates the collection of comprehensive game data, with a particular focus on field goal attempts and environmental conditions.
automation beautifulsoup csv data-analysis data-collection field-goals football-statistics kicking-stats nfl python selenium sports-analysis statistics weather-data web-scraping
Last synced: 06 Sep 2025
https://github.com/louislefevre/sstubs-miner
Data mining and analysis for the ManySStuBs4J dataset.
data-analysis data-mining manysstubs4j-dataset msr
Last synced: 30 Mar 2025
https://github.com/briatte/asr
Applied Stats with R and RStudio (first-year social-science tutorials)
course data-analysis data-science data-visualization r statistics
Last synced: 14 Apr 2026
https://github.com/trybnetic/tu7-acceleration-sleep-wake-classification
Supporting material for the paper ''Discrimination of sleep and wake periods from a hip-worn raw acceleration sensor using recurrent neural networks''
accelerometer accelerometry actigraphy data-analysis sensors sleep
Last synced: 01 Jun 2026
https://github.com/camille-maslin/simulfcimage
🔍 SimulFCImage: A professional multispectral image processing application developed for ImViA Laboratory.
academic-project computer-vision data-analysis gui-application image-processing image-viewer multispectral-images pyqt5 python scientific-visualization spectral-analysis
Last synced: 05 Feb 2026
https://github.com/mustafah/dream-my-plots
Create visual plots in Python with the help of text prompting popular LLMs through langchain
ai artificial-intelligence automation data-analysis data-visualization langchain llms machine-learning plotting python
Last synced: 13 Apr 2026
https://github.com/lacerbi/vbmc
Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)
bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference
Last synced: 11 Oct 2025
https://github.com/virajbhutada/cliquebait-digital-marketing-analysis-using-sql
This GitHub repository contains the CliqueBait Digital Marketing Analysis project, utilizing SQL for comprehensive analysis of marketing campaigns, user engagement, product performance, and website interactions within the Clique Bait food app. The project offers actionable insights for optimizing marketing strategies in competitive landscape.
campaign-website data-analysis data-extraction data-science digital-marketing food-store microsoft-excel mysql product-performance sql sql-database sql-project user-engagement website-analytics
Last synced: 27 Feb 2025
https://github.com/codebypinar/reta
🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!
arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series
Last synced: 12 Oct 2025
https://github.com/wiseaidev/corona-virus-data-analysis-modeling-and-visualization
Data analysis of covid-19 and SEIRD model implementation.
coronavirus coronavirus-tracking covid-19 data-analysis data-analysis-python data-visualization folium-maps modeling-dynamic-systems numpy ploty population python3 science science-research seird-model seird-simulator simulation
Last synced: 14 Apr 2025
https://github.com/thennen/py-ivtools
A package for reproducible measurement and analysis of current-voltage characteristics of electronic devices.
current-voltage data-analysis data-visualization electrical-engineering emerging-technology instrumentation measurements
Last synced: 23 Jan 2026
https://github.com/virajbhutada/music-recommendation-system
This project is designed to provide personalized music recommendations for relaxation and meditation. Leveraging ML and data analysis, the system suggests tracks based on user preferences such as tempo, energy, and genre. Join us in enhancing music discovery through advanced algorithms and community-driven contributions.
data-analysis data-science-projects data-visualization eda html machine-learning ml-algortihms model-deployment model-evaluation music-recommendation-system nlp pivot-table principal-component-analysis python python-library similarity-matrix spotify-data streamlit-web user-experience
Last synced: 24 Jan 2026
https://github.com/martial2023/bank-performance-analysis
Analyse de données bancaires du Berka Dataset (1993-1998) pour calculer et visualiser des KPI clés
dashboard data-analysis data-visualization nextjs pandas plotly-express pymongo python recharts-js sqlalchemy
Last synced: 26 Aug 2025
https://github.com/kevinschoon/qviz
QViz Interactive Plotting
data-analysis data-visualization go gonum qframe yaegi
Last synced: 01 Jun 2026
https://github.com/akshat0427/spotify_history
code to find out some insights in spotify streaming data (work in progress)
data-analysis data-visualization
Last synced: 04 Feb 2026
https://github.com/mrjxtr/tokyo_airbnb_analysis_project
Full project case study and analysis to show potential opportunities to start an AirBnb business in Tokyo, Japan.
data-analysis data-cleaning data-science data-visualization pandas python3
Last synced: 24 Feb 2026
https://github.com/ernestaroozoo/memestocks.net
MemeStocks.net is a Python web app that tracks the historical popularity of specific stocks by monitoring Reddit mentions. Users can search for a stock symbol and view information such as the stock's name, price, and historical popularity data. The data is gathered using the Pushshift API and stored in a PostgreSQL database.
dashboard data-analysis financial-data meme-stock python reddit-scraper scraping sql stock streamlit
Last synced: 06 Mar 2026
https://github.com/kyle-wannacott/DataCamp-Projects
DataCamp project solutions.
data-analysis data-mining data-science datacamp-projects machine-learning python r
Last synced: 13 Oct 2025
https://github.com/erictleung/erictleung.github.io
:memo: Source code for my website, portfolio of projects, and more
bioinformatics blog data data-analysis data-science github-jekyll github-page jekyll lanyon open-science open-source software-engineering
Last synced: 21 Jan 2026
https://github.com/iamfoysal/data-analysis
This repository contains various examples and exercises to help learn data science using Python.
data-analysis data-science database jupyter-notebook python3
Last synced: 10 Feb 2026
https://github.com/scailfin/flowserv-core
Reproducible and Reusable Data Analysis Workflow Server
benchmarks data-analysis reproducibility reusability workflows
Last synced: 14 Jan 2026
https://github.com/emso-c/stream-analyser
A tool that analyses YouTube live streams.
cli data-analysis guessing highlights python youtube-video
Last synced: 18 Jan 2026
https://github.com/tathithienthanh/dataanalysis_diagnosis-of-diabetes-based-on-data-set-of-blood-test-result
Implement all learned knowledge about data analysis and data mining to make a complete project about Diagnosis of diabetes based on data set of blood test result
blood-test classification clustering data-analysis data-processing decision-tree diabetes-prediction diagnosis exercise google-colab health hierarchical ipynb kmeans knn knns py python smote-sampling visualization
Last synced: 08 Jan 2026
https://github.com/jimbrig/eda
Exploratory Data Analysis R Package and Shiny App
data-analysis data-visualization eda r shiny
Last synced: 03 Jan 2026
https://github.com/elhaban3ro/thewildtool
TheWildTool is a tool developed with the main objective of saving time when working with audio datasets. Either to prepare them, to get them or to train a model with them. 🤖
ai audio audio-processing data-analysis data-science dataset deeplearning python
Last synced: 03 Sep 2025
https://github.com/giatraskon/clustering-countries-socioeconomic-health-analysis
Exploration and analysis of socio-economic and health data from 167 countries using MATLAB. Application of clustering algorithms to identify development patterns, visualize disparities, and understand global trends.
calinski-harabasz-index clustering country-data data-analysis data-visualization davies-bouldin-index elbow-method feature-selection health-indicators human-development-index k-means-clustering k-median-clustering k-medoids-clustering machine-learning matlab pca pearson-correlation silhouette-score socio-economic-indicators unsupervised-learning
Last synced: 29 Jan 2026
https://github.com/hyperspy/holospy-demos
HoloSpy Jupyter Notebook demos
data-analysis data-visualization electron-holography hyperspy materials-science multi-dimensional physical-sciences tutorial
Last synced: 29 Apr 2026
https://github.com/afondiel/ibm-data-science-professional-certificate-coursera
IBM Data Science Professional Certificate Coursera Notes
ai classification clustering coursera data-analysis data-engineering data-mining data-science data-science-challenges data-science-projects data-scientist data-visualization ibm ibm-certificate ibm-professional-certificate linear-algebra machine-learning python regression statistics
Last synced: 13 Oct 2025
https://github.com/fabienarcellier/qjoin
qjoin is a data manipulation library that provides simple and efficient joining and collection processing functionality
composable data-analysis developer-tools functools python
Last synced: 01 Mar 2026