Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-13 00:08:05 UTC
- JSON Representation
https://github.com/alandefreitas/scistats
High-Performance Descriptive Statistics and Hypothesis Tests in C++20
bayesian-statistics data-analysis descriptive-statistics hypothesis-testing performance-statistics statistics
Last synced: 11 Apr 2025
https://github.com/yusufcinarci/web-scraping-projects
In these project files, I will host the web scraping examples that I will make day by day.
data-analysis data-science jupyter-notebook python web-scraping
Last synced: 01 May 2025
https://github.com/theakashshukla/r-project
🎓 A Collection of Programming Assignment for R Language
algorithms data-analysis data-science data-science-projects ml r
Last synced: 24 Jul 2025
https://github.com/danieldacosta/airbnb-analysis
Data analysis of AirBnb website history in the city of Rio de Janeiro
airbnb-analysis airbnb-website-history data-analysis
Last synced: 30 Apr 2025
https://github.com/irfanchahyadi/ml-notes
Complete personal notes for performing Data Analysis, Preprocessing, and Training ML model.
data-analysis machine-learning plotting python
Last synced: 11 Jul 2025
https://github.com/coderjolly/player-market-value-prediction
There is an intense transfer speculation that surrounds all major player transfers today. An important part of negotiations is predicting the fair market price for a player. Therefore, we are predicting this Market Value of a player using the data provided in csv format.
data-analysis data-visualization decision-tree-regression machine-learning xgboost-regression
Last synced: 22 Jun 2025
https://github.com/sushant1827/traffic-forecasting-using-iot-sensor-data
Demonstrates how to utilize XGBoost for traffic forecasting using data gathered from IoT sensors, highlighting its efficiency in processing complex datasets and delivering accurate predictions.
data-analysis data-visualization exploratory-data-analysis feature-engineering feature-importance feature-selection gridsearchcv hyperparameter-optimization hyperparameter-tuning iot random-search xgboost-regression
Last synced: 08 Mar 2026
https://github.com/rikard-helgegren/leverage_analysis_tool
Analyst tool for portfolio construction. How can levereged certificates be used to increase returns in a portfolio while keeping the risk as low as possible. Use the tool and find out.
cpp data-analysis investment kivy-framework python3
Last synced: 12 Apr 2025
https://github.com/searchformyusername/dark-net-websites-dataset
Dataset of Onion Websites
crawler darknet data-analysis dataset onion search-engine website
Last synced: 16 Jun 2025
https://github.com/cosmoduende/r-ufo-sightings
Are we alone in the universe? - Data Analysis and Data Visualization of UFO sightings with R. How to analyze and visualize data of UFO sightings of the last century in the USA and the rest of the world with R language.
data-analysis data-analytics data-science data-visualisation data-visualization data-visualizations dataviz ovni ovni-dataset r-code r-language r-programming r-stats ufo ufo-analysis ufo-dataset ufo-sighting ufo-sightings
Last synced: 13 May 2025
https://github.com/c0deta1ker/matbasex
MatBaseX is an all-in-one database and analytical tool for photoelectron spectroscopy (PES) analysis, focused on materials and their X-ray interactions. It offers features like a Materials Properties Database, IMFP & XPS Sensitivity Factor Calculator, and PES N-Layer Simulations & Curve Fitting utilities. Explore its powerful capabilities today!
cross-sections crystal-structure crystallography data-analysis data-fitting database electron imfp imfp-calculator-matlab material material-database matlab matlab-application matlab-gui matlab-toolbox pes-modelling photoelectron-spectroscopy photoionization simulation xps
Last synced: 01 Jul 2025
https://github.com/franpog859/top-of-the-world
🌍🔝 Proof that your country is the top of the world using GeoTIFF images and a little bit of geometry. Data mining project
data data-analysis data-mining elevation geometry geotiff image-processing matplotlib nvector rasterio
Last synced: 16 Feb 2026
https://github.com/csparpa/last.fm-stats
Exercise on Last.fm data aggregation
data-analysis exercise lastfm lastfm-api python
Last synced: 21 May 2026
https://github.com/i10mm/gpt-arxiv-fetcher
Revolutionize your research with our GitHub repository, where GPT meets arXiv API for seamless access and analysis of the latest academic papers!
artificial data-analysis intelligence llm machine-learning
Last synced: 14 Jul 2025
https://github.com/rdpeng/analyticdesigntheory
Web site for Analytic Design Theory
analytic-design-theory data-analysis exploratory-data-analysis
Last synced: 29 Jul 2025
https://github.com/rhenkin/visxhclust
A Shiny app and functions for visual exploration of hierarchical clustering.
clustering data-analysis data-science r r-package r-shiny rstats shiny-apps
Last synced: 02 Apr 2025
https://github.com/jimbrig/lossrunAnalyzer
R Package and Shiny App to Analyze Insurance Lossruns
actuarial data-analysis data-mining data-science insurance r record-linkage risk-management shiny
Last synced: 30 Jul 2025
https://github.com/pythondeveloper6/supermarket-eda-seaborn-for-beginners
learn Seaborn basics using a simple EDA
data-analysis eda numpy pandas seaborn visualization
Last synced: 11 Apr 2025
https://github.com/slgobinath/wisdom
An adaptive and self-boosting stream processor
cep complex-event-processing data-analysis distributed self-tuning stream-processing wisdom
Last synced: 14 Apr 2025
https://github.com/grburgess/ronswanson
Ron Swanson builds tables for 3ML
3ml astromodels data-analysis interpolation python ron-swanson spectral-fitting
Last synced: 15 Jul 2025
https://github.com/mch-fauzy/data-science
Repository containing portfolio of data science and machine learning projects. Presented in the form of iPython Notebooks
data-analysis data-science data-visualization ipython-notebooks machine-learning natural-language-processing portfolio
Last synced: 24 Sep 2025
https://github.com/apoorvalal/lalrutils
Misc utility functions in R for personal use.
Last synced: 07 Mar 2026
https://github.com/asifdotexe/sentimentscoringmodel
This project focuses on performing sentiment analysis on Amazon reviews using natural language processing (NLP) techniques. It includes various steps, from data exploration and preprocessing to building and evaluating sentiment models.
data-analysis data-visualization natural-language-processing sentiment-analysis
Last synced: 28 Oct 2025
https://github.com/thecoderpinar/earthquake_prediction_analysis_project
🌍 Welcome to the Earthquake Prediction Analysis Project! 🚀 This project aims to predict earthquake magnitudes using LSTM neural networks and analyze seismic data. Explore, analyze, and forecast earthquakes with ease! 📈🔮
analysis data-analysis data-science earthquake-prediction geocoding geology lstm lstm-neural-networks machine-learning matlab matlab-deep-learning open-source time-series visualization
Last synced: 16 Aug 2025
https://github.com/relvaner/nodes4j-core
Framework for parallel processing based on Actor4j. Useful for data analysis.
actor actor-model actor4j actors batch-processing data-analysis graph-processing java java-17 message-passing parallelization reactive-system stream-processing
Last synced: 14 Jul 2025
https://github.com/quantumudit/basketball-players-analysis
The project focuses on analyzing salaries and various other in-game metrics of top NBA basketball players from 2005-14 by performing exploratory data analysis with Python and Jupyter Notebook and by visualizing the data in an insightful dashboard made with Power BI
data-analysis jupyter-notebook power-bi python
Last synced: 17 May 2026
https://github.com/ficaan/data-analysis-with-python-2023_2024-mooc.fi
These are all the solutions for exercises from Data Analysis with Python 2023/2024, a course offered by the University of Helsinki, Finland.
data-analysis machine-learning mooc-fi programming python
Last synced: 08 Jun 2026
https://github.com/tushar2704/everyday-sql
Welcome to Everyday SQL Sheets – your go-to resource for everyday SQL cheat sheets, pro tips, interview questions, and more. Whether you're a beginner looking to learn SQL or an experienced developer seeking quick reference materials, this application has got you covered.
artificial-intelligence cheatsheet data-analysis data-science database mysql postgresql query-language sql sqlalchemy streamlit streamlit-tushar2704 tushar2704
Last synced: 05 Apr 2026
https://github.com/scottgriv/river-charts
🌊 📉 A Python, Django, Plotly, and Pandas web application that visualizes river data in real time, pulled using an API from the United States Geological Survey (USGS).
api charts data data-analysis data-visualization dataset django pandas plotly python usgs usgs-api visualization webapp
Last synced: 12 Aug 2025
https://github.com/astrodynamic/retailanalitycs-in-postgresql
Develop a SQL script to create a database with tables, views, roles, and functions. Form personalized offers to increase average check, frequency of visits, and cross-selling.
bd csv data-analysis data-export data-input data-manipulation data-validation database-management functions git margin offers postgresql retail role-permission-management selling sql transaction tsv views
Last synced: 06 Apr 2026
https://github.com/louis-heraut/card
🎴 Card of Analyse and Diagnostic in R for a user-friendly experience of data aggregation with parametrisation file.
aggregation climate-change climate-data climate-science data-analysis data-science diagnostic environment environment-variables hydrology hydrology-statistical inrae r statistics tools user-friendly
Last synced: 09 Mar 2026
https://github.com/accurat/react-dataviz
⚛📊🚀 React components to build powerful interactive data visualizations
d3 data-analysis data-visualization react react-components
Last synced: 19 Jun 2025
https://github.com/pepe-god/dataprophet
Extracts the identity information citizens from MySQL, creates a family network based on TC ID No. and exports it to CSV
101m 109m adres data-analysis data-extraction database-connector family-tree genealogy gsm hsys identity mysql-database python-script pyton
Last synced: 13 Jul 2025
https://github.com/quantumudit/movie-ratings-analysis
This project focuses on analyzing and finding correlations between the audience and critic ratings for some of the popular movies released between 2009-2011 using Python & Power BI
data-analysis data-visualization jupyter-notebook power-bi python
Last synced: 19 Apr 2026
https://github.com/avinashkranjan/basic-data-analysis-and-visualization-in-python
📊 Some of the most important python tools in data science for Data Analysis and Data Visualization.
data-analysis data-science matplotlib matplotlib-pyplot numpy pandas plotly seabourne
Last synced: 30 Oct 2025
https://github.com/gxjansen/user-analysis-with-r-google-analytics
Analyzing user behavior of an E-commerce website with R and (mainly) Google Analytics Data
analytics analytics-api conversion-rate-optimization data-analysis ecommerce google google-analytics r
Last synced: 27 Mar 2025
https://github.com/mukhopadhyay/data-visualization
...
data-analysis data-science data-visualization visualization
Last synced: 21 Mar 2025
https://github.com/johnsell620/sentiment-analysis-goodreads-reviews
Document-level sentiment analysis of book reviews scraped from the Goodreads website. Technologies used include TensorFlow, Spark, HDFS, Sqoop, Scrapy, and D3.js.
data-analysis data-visualization recurrent-neural-networks web-scraping
Last synced: 30 Apr 2025
https://github.com/negativenagesh/spam-ham_email_detection_machine_learning
This project focuses on classifying spam/ham emails, using machine learning algorithms like LGR, NB, RF, DT etc.. and based on the accuracy score and precision score I chose logistic regression for the classification. And I have used streamlit for frontend.
app data-analysis data-cleaning data-engineering data-science data-visualization data-visualizations jupyter-notebook logistic-regression machine-learning modeling naive-bayes-classifier nlp python
Last synced: 12 Apr 2025
https://github.com/lisa-ho/three-investigators
Respository for scraping and analysing fan data on a German audio drama called 'Die Drei Fragezeichen' (the three investigators).
data-analysis data-viz datawrapper python webscraping
Last synced: 25 Oct 2025
https://github.com/ziaeemehr/cpp_workshop
Scientific programming toolbox with C++
cpp data-analysis data-science learning-by-doing programming scientific-computing telegram-channel youtube
Last synced: 15 May 2026
https://github.com/stimulsoft/stimulsoft.dashboards.php
Dashboards.PHP is a complete software package for designing and viewing dashboards. Includes the JS data analysis engine, dashboard designer and viewer. Support PHP 5, PHP 7, and PHP 8 versions.
charts dashboard-builder dashboards data-analysis data-grid data-visualization datatable dynamic-dashboard interactive-dashboards live-data mysql-data php php-bi-tools php-dashboard php-kpi php7 php8 pivot-tables sql-datasources statistics
Last synced: 14 Oct 2025
https://github.com/pythondeveloper6/udemy-courses-full-eda
simple EDA for Udemy courses
data-analysis eda matplotlib numpy pandas python seaborn
Last synced: 16 Jun 2025
https://github.com/richiejp/jdp
Automatically collect and normalise data, then run algorithms on it.
automation-framework data-analysis suse-qa
Last synced: 02 Jan 2026
https://github.com/natlee/myanimelist-comment-crawler
Crawl all reviews and infomation of Anime works on MyAnimeList. ;)
anime crawler data-analysis data-mining data-science kaggle kaggle-dataset myanimelist python requests scrapy-crawler sqlite
Last synced: 14 Apr 2025
https://github.com/louis-heraut/exstat
🌾 R package to provide an efficient and simple solution to aggregate and analyze the stationarity of time series
climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics time-series
Last synced: 16 Jun 2025
https://github.com/srinivasrm/mutual-funds-analysis-and-prediction
In this project I have performed analysis and prediction on 1,3,and 5 year returns on 1064 mutual funds in India. I have scraped data from a website which is the most visited website for mutual fund investments.I have tested regression models linear model,SGD Regressor , Random Forest Regressor,Decision Tree Regressor,Ridge,MLP Regressor and linear model (Lasso).After which I have selected the best perorming model and performed Hyper parameter tuning and then deployed an interactive application which can generate the visualization and send an email with the visualization to the users email address.
beautifulsoup data-analysis data-base data-cleaning data-science deployment etl finanace frontend funds machine-learning mutual mutual-funds pgsql python scikit-learn sql streamlit web webapplication
Last synced: 27 Oct 2025
https://github.com/pythondeveloper6/store-sales-eda
simple EDA with some insights on Store Sales
data-analysis eda matplotlib numpy pandas seaborn
Last synced: 11 Apr 2025
https://github.com/davidzajac1/reptoro
A Data Visualization and Analytics Platform for the Reptile Industry
analytics data-analysis data-visualization plotly-dash python
Last synced: 15 May 2026
https://github.com/nicucalcea/raise
An R library that uses ChatGPT / GPT to generate data
chatgpt chatgpt-api chatgpt-app data-analysis gpt gpt-35-turbo openai openai-chatgpt parsing r
Last synced: 05 Mar 2025
https://github.com/sowinskibraeden/schedulegeneratorapp
The Desktop Application for my schedule-generator algorithm, allowing users to easily interact with the algorithm and its variables to generate schedules as documents for students individually as well as the master timetable
algorithm csv data-analysis dataclasses python-docx python-typing python311 xlsxwriter
Last synced: 09 Jul 2025
https://github.com/super-lou/exstat
🌾 R package to provide an efficient and simple solution to aggregate and analyze the stationarity of time series
climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics time-series
Last synced: 13 Apr 2025
https://github.com/alexeyev/hse-spb-bigdata-python-fall2016
Материалы к курсу по программированию и инструментам анализа данных, прочитанному в петербургском филиале НИУ ВШЭ осенью 2016 года
course-materials data-analysis numpy pandas python scikit-learn sklearn
Last synced: 07 Apr 2026
https://github.com/orkunaktas/sofascore-webscraping
⚽️I scraped the shot data of the Fenerbahçe - Adana Demirspor match from Sofascore⚽️
beautifulsoup data-analysis football-analytics football-data selenium webscraping
Last synced: 28 Oct 2025
https://github.com/yash22222/tata-data-visualisation-virtual-internship
Data Visualisation: Empowering Business with Effective Insights Gain insights into leveraging data visualisations as a tool for making informed business decisions.
basics ceo charts cmo data-analysis data-interpretation data-science data-visualization graphs machine-learning mcq microsoft-excel microsoft-power-bi microsoft-word powerpoint-presentations python tableau tata tata-data-visualisation
Last synced: 22 Jul 2025
https://github.com/jimmymugendi/email-sms-spam_classifier
Email SMS Spam Classifier is a cutting-edge machine learning solution designed to combat spam messages in email and SMS communications. Leveraging advanced Natural Language Processing (NLP) techniques, the system preprocesses text data by tokenizing, removing stopwords, and stemming, ensuring the most accurate classification results.
data-analysis data-visualization machine-learning-algorithms pandas seaborn-plots sklearn-library
Last synced: 25 Sep 2025
https://github.com/kellyjadams/spotify-data-analyze
A serverless data pipeline that logs my Spotify listening history to BigQuery using Cloud Run, then visualizes trends with Looker Studio. Built with Python, Flask, Docker, and GCP..
data-analysis data-engineering
Last synced: 07 May 2025
https://github.com/woctezuma/regression
Gaussian Process Regression vs. Relevance Vector Machine.
data-analysis data-science gaussian-process-regression machine-learning python regression relevance-vector-machine statistics
Last synced: 17 Jul 2025
https://github.com/ynikitenko/lena
Lena is an architectural framework for data analysis
analysis-framework analysis-pipeline data-analysis data-science
Last synced: 30 Apr 2025
https://github.com/saksham-joshi/sentiment_analyzer
Analyze the sentiment of a text stored in a string or file and understand the reason why your blogs and posts are not ranking up.
data-analysis data-analytics python sentiment-analyser sentiment-analysis sentiment-analysis-without-nltk
Last synced: 22 Aug 2025
https://github.com/edaaydinea/dataquest-projects
This repository is included data analyst, and data science-guided projects through Dataquest.
Last synced: 07 Feb 2026
https://github.com/happybono/avocadosmoothie
VB.NET project for running-median filtering. Users set kernel radius, border count, and pick MiddleMedian or AllMedian. Processing runs in parallel with a progress bar and smooth UI.
algorithms calibration correction data-analysis median outliers quicksort running-median runningmedian smoothing smoothing-methods statistics visual-basic
Last synced: 10 Feb 2026
https://github.com/dcs-training/digital-method-of-the-month
In this repository you are going to find the documents we produced to support the discussion in our Digital Methods of the Month. These documents will help you orienting yourself if you want to pickup the method in your research. Go to the readme file
3d-data data-analysis data-visualisation data-wrangling geographical-data gis good-practices-digital-research machine-learning network-analysis open-research preregistration statistics text-analysis
Last synced: 25 Feb 2026
https://github.com/cyyeh/duckdb-data-agent
An AI-powered data analysis agent with a built-in SQL playground. Upload data files (CSV, JSON, Parquet, Excel) and ask questions in plain English — the agent delegates to a specialized subagent for SQL queries and renders charts inline — or switch to the SQL editor for direct queries.
agent claude-code csv data-analysis duckdb excel json langfuse llm parquet python react sql typescript
Last synced: 04 Jun 2026
https://github.com/karlyndiary/restaurant-ratings-analysis
Restaurant Ratings Analysis using Microsoft Power BI
dashboard data-analysis data-analysis-powerbi power-bi power-bi-dashboard report restaurant restaurant-ratings-analysis restaurant-ratings-dashboard restaurant-ratings-data-analysis restaurant-ratings-power-bi-dashboard
Last synced: 26 Feb 2026
https://github.com/prabhupavitra/data-visualization-with-python
This repository houses data visualization with Python.
barplot data-analysis data-visualization datavisualization dotplot grouped-bar-chart heatmap matplotlib matplotlib-pyplot pandas python3 seaborn stacked-bar-chart
Last synced: 09 May 2026
https://github.com/mertcandav/julenum
A high-performance library for numerical methods and scientific computing in Jule
data-analysis jule julelang math matrix scientific-computing statistics
Last synced: 09 Feb 2026
https://github.com/depressioncenter/mden
Mobile technologies code from the University of Michigan's Mobile Data Experts Network (MDEN), featuring data cleaning automations, REDCap project templates, and links to useful external modules. [DOI: 10.6084/m9.figshare.25438714]
automation data-analysis data-cleaning fitness-tracker heart-rate-data mobile-data mobile-development mquery powerautomate powerbi powerquery python r sleep-data smartwatch-data tableau
Last synced: 25 Feb 2026
https://github.com/zmyzheng/signature-authentication-pen
Signature Authentication Pen, a cloud based IoT project which realizes identity authentication by exploiting the signature biometric features of the users. Details:
android aws data-analysis identity-authentication iot neural-network signature-authentication-pen
Last synced: 03 May 2026
https://github.com/tirendazacademy/pandasai-tutorials
Tutorials for PandasAI
ai data-analysis data-science data-visualization llms openai pandas pandasai python
Last synced: 27 Mar 2026
https://github.com/rudra496/science
🔬 Interactive science experiments and research simulations — physics, chemistry, biology with 3D visualizations and real-time data analysis
data-analysis education experiments hacktoberfest javascript python research science simulation threejs
Last synced: 09 Jun 2026
https://github.com/quantumudit/regional-sales-analysis
This project focuses on analyzing and visualizing the United States regional sales for a fictitious company in between 2018-2020 using Python & Power BI.
data-analysis data-visualization databases jupyter-notebook power-bi python sqlite
Last synced: 02 May 2026
https://github.com/quantumudit/analyzing-whiskyexchange-whisky
This project focuses on scraping data related to Japanese Whiskey from the Whiskey Exchange website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 02 May 2026
https://github.com/quantumudit/consumer-goods-sales-analysis
This project focuses on analyzing and visualizing the consumer goods sales in the United States between 2015-2016 using Python & Power BI.
data-analysis data-visualization database jupyter-notebook python sqlite
Last synced: 29 Apr 2026
https://github.com/jggautier/dataverse-curation-assistant
A small software application that provides a UI for automating things in repositories that use the Dataverse software
data-analysis dataverse hacktoberfest python
Last synced: 01 Mar 2026
https://github.com/banisterious/obsidian-oneirometrics
OneiroMetrics (Turning Dreams Into Data). A plugin for Obsidian to track and analyze dream journal metrics.
data-analysis dream-analysis dream-diary dream-journal dreams journaling metrics obsidian obsidian-plugin self-improvement tracking
Last synced: 22 Apr 2026
https://github.com/niamoto/niamoto
Niamoto is a command-line application and library focused on processing and publishing botanical data
botany cli-application data-analysis data-processing data-publication python-library
Last synced: 23 Apr 2026
https://github.com/chaitanyac22/house-price-prediction-project-for-a-us-based-housing-company
The goal of this project is to garner data insights using data analytics to purchase houses at a price below their actual value and flip them on at a higher price. This project aims at building an effective regression model using regularization (i.e. advanced linear regression: Ridge and Lasso regression) in order to predict the actual values of prospective housing properties and decide whether to invest in them or not.
advanced-linear-regression business-analytics data-analysis data-cleaning data-manipulation data-visualization exploratory-data-analysis feature-engineering lasso-regression linear-regression machine-learning model-building model-evaluation prediction-model python3 regularization rfe ridge-regression statistics
Last synced: 30 Apr 2026
https://github.com/abrahamkoloboe27/dashboard-streamlit-atut
Lien de l'application
dashboard data-analysis data-visualization pandas plotly python streamlit visualization
Last synced: 05 Mar 2026
https://github.com/mscbuild/mscbuild
🏆 Сreating digital experiences that not only meet user expectations, but also drive engagement, loyalty and, ultimately, business success. Passionate developer from Latvia .
analysis best-practices coding config data-analysis data-science design developer freelance fullstack github-config latvia mscbuild profile readme seo site software-engineering web webapp
Last synced: 31 Jan 2026
https://github.com/jackfiszr/pl2xl
Nodejs-polars wrapper with `readExcel` and `writeExcel` methods.
data-analysis data-science deno excel excel-reader excel-writer nodejs polars
Last synced: 21 Jan 2026
https://github.com/itzmeanjan/indian-railway
Exploring Indian Railways time table dataset, with :heart:
data-analysis data-visualization indian-railways matplotlib python python3 railway
Last synced: 17 Oct 2025
https://github.com/bjpop/gurita
A convenient and expressive tool for data analytics and plotting on the command line
command-line data-analysis data-science pandas plotting python
Last synced: 04 Feb 2026
https://github.com/theengineeringworld/numpy-data-science
NumPy Data Science Essential Traing COurse. Part of Youtube Course Offered by TheEngineeringWorld.
data-analysis data-science numpy numpy-exercises numpy-library numpy-tutorial python python-3-6 python3 scipy2018
Last synced: 09 Oct 2025
https://github.com/sondosaabed/introduction-to-data-analysis-with-pandas-and-numpy
Learning the data analysis process of questioning, wrangling, exploring, analyzing, and communicating data. Working with data in Python using libraries like NumPy and pandas.
data-analysis data-analyst-nanodegree data-wrangling numpy pandas python
Last synced: 09 Apr 2025
https://github.com/nirantak/programming-exercises
Programming exercises / Coding problems
data-analysis image-processing intelligent-systems matlab programming python python3
Last synced: 01 May 2025
https://github.com/1ayanabil1/healthcare-machine-learning
Explore our open-source repository focused on healthcare machine learning. We've developed predictive models for cardiovascular disease, diabetes, breast cancer, and more. Our projects employ diverse machine learning algorithms and data science techniques, enhancing early detection, diagnosis, and patient outcomes.
data-analysis data-science deep-learning disease disease-detection disease-modeling disease-prediction eda healthcare-application heathcare jupyter-notebook machine-learning machine-learning-algorithms machinelearning-python python
Last synced: 28 Apr 2025
https://github.com/zcebeci/adana
A Complete Toolbox for Adaptive and Hybrid Genetic Algorithms in R
adaptive-genetic-algorithms biologically-inspired-algorithm data-analysis data-science evolutionary-algorithms genetic-algorithms global-optimization-algorithms hybrid-genetic-algorithm multi-objective-optimization nature-inspired-algorithms single-objective-optimization
Last synced: 08 Oct 2025
https://github.com/arv-anshul/campusx-project-notebooks
Capstone project by Campusx in DSMP course.
campusx campusx-dsmp data-analysis data-science eda jupyter-notebook machine-learning ml-project nlp project python3 recommender-system regression streamlit
Last synced: 22 Aug 2025
https://github.com/paezha/edashop
An open educational resource to teach a workshop on Exploratory Data Analysis in R
data-analysis exploratory-data-analysis open-educational-resources package r rstats workshop-materials
Last synced: 18 Mar 2025
https://github.com/mikebild/introduction-python
An introduction to Python, Flask, Numpy, MatPlotLib and Pandas
data-analysis flask introduction iterables json jupyter matplotlib microservice numpy pandas python python3 sqlalchemy tutorial
Last synced: 11 Apr 2026
https://github.com/waveform80/structa
A small utility for analyzing data structures (e.g. JSON files)
csv data-analysis data-visualization datajournalism datawrangling json yaml
Last synced: 06 Sep 2025
https://github.com/maksimekin/umd_data_challange_2020
Ocean Clean up data analysis project for the UMD Data Challenge 2020. Data Exploration for a Sustainable Planet.
cleanup competition data-analysis data-science folium geolocation machine-learning ocean planet pollution sklearn sustainability time-series trash umd
Last synced: 05 Jul 2025
https://github.com/tirendazacademy/hands-on-data-science-with-gcp
Google BigQuery Tutorial
big-data big-data-analytics bigdata bigquery bigquery-ml bigqueryml cloud-computing data-analysis data-analytics data-engineering data-science dataanalysis dataengineering google-bigquery google-cloud-platform machienlearning machine-learning
Last synced: 06 Oct 2025
https://github.com/ethan-wickstrom/rrrs
Welcome to RRRS, a rapid, hyper-optimized CSV random sampling tool designed with performance and efficiency at its core. Crafted meticulously in Rust, RRRS offers an unparalleled solution for extracting random data samples from CSV files swiftly and effortlessly.
analytics cli command-line command-line-tool data data-analysis data-science dataset rust rust-lang sample samples
Last synced: 16 May 2025
https://github.com/cheminfo/compass
Strategy for improved characterisation of human metabolic phenotypes using a COmbined Multiblock Principal components Analysis with Statistical Spectroscopy (COMPASS)
data-analysis metabolomics metabonomics multiblock nmr-spectroscopy pca population-analysis population-model
Last synced: 23 Mar 2025
https://github.com/elfgk/diabetes-data-analysis
diabetes data analysis
analysis data-analysis diabetes-data-analysis eda jupiter-notebook
Last synced: 31 Aug 2025
https://github.com/dcs-training/datavisualisationwithr
Data Visualisation with R Workshop (delivered by the Centre in December 2020). This workshop is focusing on visualising your data. Go to the readme file
data-analysis data-visualisation data-wrangling r
Last synced: 25 Apr 2025
https://github.com/shervinnd/bazar_app_store_eda
Bazar App Data analysis code to find the most downloaded category and most popular installed apps
data data-analysis data-science dataanalysis eda python
Last synced: 15 Apr 2025