Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2026-07-03 00:07:42 UTC
- JSON Representation
https://github.com/devinterview-io/tensorflow-interview-questions
🟣 Tensorflow interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions tensorflow tensorflow-interview-questions tensorflow-questions tensorflow-tech-interview
Last synced: 04 Jul 2025
https://github.com/dovolopor-research/data-science-research-toolbox
🧰 数据科学科研工具箱
data-science data-science-research data-science-resourses research-resources research-tool visualization
Last synced: 05 Jan 2026
https://github.com/aaaastark/top-big-data-scientist-questions-for-interview
Top Big Tech Data Science Questions
ai alibaba amazon apple computer-science computer-vision data-engineer data-science deep-learning facebook google ibm intel interview-questions machine-learning netflix nvidia orcale spacex tesla
Last synced: 04 Feb 2026
https://github.com/hoangsonww/standard-deviation-calculator
📊 This repository contains a Standard Deviation Calculator implemented in C++. It provides an efficient algorithm for calculating the statistical standard deviation of a dataset, making it a valuable tool for students, researchers, and analysts seeking a reliable method for data analysis.
algorithms cplusplus cpp data data-analysis data-analytics data-science standard-deviation standard-deviation-calculator standard-deviations
Last synced: 22 Sep 2025
https://github.com/ma7555/kerasgen
A Keras/Tensorflow compatible image data generator for TripletLoss
data-generation data-generator data-generators data-science keras keras-tensorflow tensorflow triplet triplet-loss triplet-neural-network
Last synced: 11 Mar 2025
https://github.com/ashishbamania/tutorials-on-artificial-intelligence
A collection of AI tutorials from Dr. Ashish Bamania
agentic-ai ai ai-agents artificial-intelligence crewai data-science langchain llama machine-learning ml rag retreival-augmented-generation software-engineering
Last synced: 13 May 2025
https://github.com/lungben/tableio.jl
A glue package for reading and writing tabular data. It aims to provide a uniform api for reading and writing tabular data from and to multiple sources.
arrow csv data data-science database dataframe dataframes excel jdf json-format parquet postgresql sqlite zip
Last synced: 12 Oct 2025
https://github.com/devopscorner/nifi
Production Grade Nifi & Nifi Registry. Deploy for VM (Virtual Machine) with Terraform + Ansible, Helm & Helmfile for Kubernetes (EKS)
ansible data-science data-structures docker docker-compose dockerhub ecr eks eks-cluster etl kubernetes machine-learning ml mlops nifi nifi-registry terraform vpn vpn-client
Last synced: 08 Sep 2025
https://github.com/takuti/anompy
A Python library for anomaly detection
anomaly-detection data-science forecasting machine-learning python
Last synced: 15 Apr 2025
https://github.com/techshot25/healthcare
Insurance cost predictor
bayesian-regression data-analysis data-science linear-regression machine-learning polynomial-regression random-forest-regression
Last synced: 24 Apr 2025
https://github.com/mindful-ai-assistants/hackapucsp-2024
🏆 HackaPUCSP 2024 - - Data Science and AI Hackathon - Pontifical Catholic University of São Paulo
automation data-science design github-actions hackathon-project oneness-consciousness package-manager programming pucsp pytest python3 unittest
Last synced: 11 Jul 2025
https://github.com/anshumansinha3301/safetitude-project-finance
My works as an SDE @Safetitude
data-science data-structures dbms
Last synced: 13 Jul 2025
https://github.com/devinterview-io/supervised-learning-interview-questions
🟣 Supervised Learning interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview supervised-learning supervised-learning-interview-questions supervised-learning-questions supervised-learning-tech-interview technical-interview-questions
Last synced: 09 Feb 2026
https://github.com/elliotwutingfeng/twitter200m
Simple analysis of the Twitter 200M Data Dump of January 2023.
200m data-science haveibeenpwned leak osint twitter
Last synced: 16 Mar 2026
https://github.com/correia-jpv/fucking-awesome-datascience
📝 An awesome Data Science repository to learn and apply for real world problems. With repository stars⭐ and forks🍴
analytics awesome awesome-list data-mining data-science data-scientists data-visualization deep-learning hacktoberfest machine-learning science
Last synced: 27 Apr 2025
https://github.com/tristanbilot/airflow-rbac-roles-cli
A tool to create Airflow RBAC roles with dag-level permissions from cli.
airflow cloud-composer data-engineering data-science gcp permissions pipeline rbac-roles
Last synced: 25 Oct 2025
https://github.com/bdist/bdist-workspace
This repository provides containerized applications and microservices for the Information Systems and Databases Course @ Instituto Superior Técnico
data-engineering data-science docker jupyter jupyterlab notebook postgres postgresql python sql sqlite
Last synced: 09 Apr 2026
https://github.com/canagnos/mcp
Tools for Measuring Classification Performance for R, Python and Spark
artificial-intelligence classification data-mining data-science machine-learning machine-learning-algorithms
Last synced: 28 Apr 2025
https://github.com/opengeos/qgis-leafmap-plugin
A QGIS plugin for leafmap
data-science geospatial leafmap python qgis qgis-plugin
Last synced: 30 Jan 2026
https://github.com/eliasdabbas/dash-aggrid-scales
Color scales (continuous and categorical) and bar charts for Dash-Ag-Grid
aggrid color-scales color-scheme data-science data-visualization html plotly-dash table
Last synced: 16 Mar 2026
https://github.com/emptymalei/audiorepr
A python package to represent data using musical notes.
audiolization data data-audiolization data-science
Last synced: 12 Oct 2025
https://github.com/amirhosseinhonardoust/underwriting-decision-safety-lab
A decision-safety lab for loan approval: trains a baseline classifier, calibrates probabilities (ECE/Brier), sweeps confidence thresholds to build a coverage, quality frontier and outputs a defensible abstention policy (auto-decide vs review). Includes a Streamlit dashboard for report cards, triage UI, and data quality checks.
abstention calibration classification credit-risk data-quality data-science decision-policy loan-approval machine-learning mlops model-evaluation monitoring pandas reliability responsible-ai scikit-learn selective-classification streamlit uncertainty underwriting
Last synced: 10 Jun 2026
https://github.com/flbulgarelli/recursos-python
Spanish resources for learning Python
data-science education http imperative-programming object-oriented-programming python testing
Last synced: 30 Oct 2025
https://github.com/shwetajoshi601/world-bank-data-analysis
An Exploratory Data Analysis on the World Bank Dataset.
analysis data-science eda python3 world-bank-api worldbank
Last synced: 02 Aug 2025
https://github.com/bodo-ai/pydough
Analytics DSL for Python
analytics artificial-intelligence big-data data-science defog defog-ai machine-learning pandas python sql text-to-analytics text-to-sql tpch
Last synced: 22 May 2026
https://github.com/srohit0/datasciencegraphalgorithms
Selected Graph Algorithms
astar astar-algorithm astar-pathfinding astar-search cpp data-science datascience depth-first-search dfs-algorithm dijkstra-algorithm dijkstra-shortest-path graph graph-algorithms graph-theory kosaraju kruskal-algorithm prim-algorithm strongly-connected-components topological-sort transpose
Last synced: 15 Apr 2025
https://github.com/chongyasong/youml
YouML: A Machine Learning Toolkit
ai artificial-intelligence big-data data-mining data-science machine-learning matplotlib numpy pandas python scikit-learn scipy
Last synced: 11 Apr 2025
https://github.com/jose-jaen/airbnb
Airbnb price prediction using Machine Learning and Deep Learning
ai algorithms bayes bayesian-optimization bayesian-statistics data-science deep-learning deployment econometrics machine-learning python streamlit xai
Last synced: 15 Apr 2025
https://github.com/mathewroy/ynabr
Analyze and visualize your You Need A Budget (YNAB) data. YNAB meets R programming language.
api data-analysis data-science data-visualization r ynab ynab-api
Last synced: 30 Jul 2025
https://github.com/bsomps/OpenGeoPlotter
A PyQt5 app catered to the exploration industry for visualizing geologic drill hole data with features like cross-sections, simple 3D views, strip logs, scatter plots, and downhole line plots. Includes data transformation techniques like factor analysis, desurveying, and alpha-beta conversion.
cross-sections data-science drilling exploration geology geoscience pyqt5 python strip-logs
Last synced: 05 Mar 2025
https://github.com/subugoe/scholcomm_analytics
Scholarly Communication Analytics with R Blog
bibliometrics data-science distill library rstats scholarly-communication-analytics
Last synced: 24 Feb 2026
https://github.com/inphyt/imdb_sentiment_analysis_bert
BERT Sentiment Classification on the IMDb Large Movie Review Dataset.
bert bert-model data-mining data-mining-algorithms data-mining-python data-science machine-learning machine-learning-algorithms natural-language-processing nlp nlp-machine-learning scikit-learn sentiment-analysis sentiment-classification spacy spacy-models spacy-nlp
Last synced: 30 Apr 2025
https://github.com/qpwedev/blockchain-network-visualizer
Blockchain Network Visualizer for TON.
blockchain data-science network ton toncoin
Last synced: 14 Mar 2025
https://github.com/seandavi/machinelearningintro
Machine learning use cases for teaching
data-science machine-learning r rstats teaching-materials tutorial
Last synced: 05 Apr 2025
https://github.com/csfelix/csfelix.github.io
🌱 My Personal Portfolio 🌱
css data-science data-science-competition data-science-portfolio data-science-projects html javascript js portfolio
Last synced: 05 Aug 2025
https://github.com/h2oai/article-information-2019
Article for Special Edition of Information: Machine Learning with Python
data-science explainable-ai explainable-ml fairness-ai fairness-ml fairness-testing fatml iml interpretable-ai interpretable-machine-learning interpretable-ml machine-learning machine-learning-interpretability python xai
Last synced: 07 Apr 2025
https://github.com/matteocargnelutti/maguire-lab-seizure-detection-webapp
🧠 Maguire Lab's Deep Learning Seizure Detection WebApp.
data-science eeg-signals-processing neuroscience
Last synced: 21 Apr 2025
https://github.com/tomaztk/list_of_r_packages_for_data_scientist
List of useful R packages for data scientists
data-science r r-language r-markdown r-package r-programming statistics
Last synced: 16 May 2025
https://github.com/tomaztk/List_of_R_packages_for_Data_scientist
List of useful R packages for data scientists
data-science r r-language r-markdown r-package r-programming statistics
Last synced: 30 Jul 2025
https://github.com/khadkarajesh/internship-preparation-kit
Repository consist the technical and behavioural questions asked by french tech companies for internship
algorithm algorithms coding-interviews codinggame data-science data-structures data-structures-and-algorithms french hacktoberfest hacktoberfest-accepted hacktoberfest2022 internship interview interview-practice interview-preparation interview-questions interview-test leetcode python software-engineering
Last synced: 15 Jul 2025
https://github.com/sjcobb/ai-duet-3d
3D music animation + machine learning (in development)
3d-animation 3d-audio 3d-game artificial-intelligence browser-game data-science data-visualization game-development generative-music javascript machine-learning music music-bot music-composition music-theory music-visualizer neural-network web-development youtube-channel
Last synced: 28 Oct 2025
https://github.com/jhrcook/tidy-tuesday
#TidyTuesday to practice data analysis in R
data-analysis data-science r regression-models rlang tidytuesday tidytuesday-challenge tidyverse
Last synced: 28 Oct 2025
https://github.com/faridrashidi/cnsplots
🎨 Toolkit for generating publication-quality plots for Cell, Nature and Science journals
data-science data-visualization plotting publication-quality python scientific-publications
Last synced: 06 Apr 2026
https://github.com/giswqs/notebook-share
A repo for sharing notebooks
data-science dataviz geospatial jupyter-notebook mapping notebook
Last synced: 07 May 2025
https://github.com/alan-turing-institute/hds-discussiongroup
Repo of the Turing's Humanities & Data Science Discussion Group
data-science digital-humanities discussion-group
Last synced: 03 Mar 2026
https://github.com/sjcobb/webxr-threejs-midi-visualizer
WebXR, augmented reality MIDI data visualization, built with Three.js and Tone.js. See video: https://youtu.be/lIecCGtbqSM
3d aframe cannonjs data-science data-visualization depth-estimation game-development hit-detection javascript midi music-theory physics three threejs tone tonejs webvr webxr
Last synced: 12 Jul 2025
https://github.com/zen-reportz/zen_dash
Simple, Fast, Scalable , production grade dashboard application . Right solution for team
dashboard data-analytics data-science fastapi flask python3 shiny streamlit
Last synced: 13 Apr 2025
https://github.com/chaganti-reddy/evmarket-india
Electric Vehicle Market Segmentation Analysis in India
data-analysis data-science machine-learning market-segmentation pandas python
Last synced: 12 Apr 2025
https://github.com/eyadsibai/machine-learning-docker-image
Data Science/Machine Learning Docker Image for CPU
data-science docker docker-image google-cloud machine-learning
Last synced: 30 Apr 2025
https://github.com/sdpython/mlstatpy
Mathematics, Algorithmic, Data-Science, Teaching Materials
algorithms data-science mathematics python3 teaching-materials
Last synced: 23 Jun 2025
https://github.com/xuri/excelize-py
Excelize is a Python port of Go Excelize library that allow you to write to and read from XLAM / XLSM / XLSX / XLTM / XLTX files.
calculation chart data-analysis data-science data-visualization ecma-376 excel excelize golang microsoft office ooxml pipy python spreadsheet visualization xlsm xlsx xlsxreader xlsxwriter
Last synced: 07 May 2025
https://github.com/openbridge/ob_pysh-db
pysh-db - The Data Science Toolkit (DSK)
bash data-science mysql postgres python redshift sql
Last synced: 10 Apr 2025
https://github.com/fabsta/interesting_notebooks
A collection of Data Science Jupyter notebook (reference material)
data-science eda jupyter-notebook kaggle machine-learning python
Last synced: 03 Jul 2025
https://github.com/devinterview-io/reinforcement-learning-interview-questions
🟣 Reinforcement Learning interview questions and answers to help you prepare for your next machine learning and data science interview in 2024.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions reinforcement-learning reinforcement-learning-interview-questions reinforcement-learning-questions reinforcement-learning-tech-interview software-engineer-interview technical-interview-questions
Last synced: 15 Jun 2025
https://github.com/devinterview-io/bias-and-variance-interview-questions
🟣 Bias And Variance interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions bias-and-variance bias-and-variance-interview-questions bias-and-variance-questions bias-and-variance-tech-interview coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions
Last synced: 08 Feb 2026
https://github.com/martincastroalvarez/html2vec
Algorithm that converts an HTML to a vectorized object suitable for neural networks.
data-science html2vec natural-language-processing python web-scraping word2vec
Last synced: 11 Apr 2025
https://github.com/alugowski/matrepr
Format matrices and tensors to HTML, string, and LaTeX, with Jupyter integration.
data-science data-visualization data-viz graphblas jupyter numpy numpy-matrix pytorch scipy sparse sparse-data sparse-matrices sparse-matrix sparse-representations tensor tensorflow torch
Last synced: 12 Apr 2025
https://github.com/clojurecivitas/clojurecivitas.github.io
An open effort to structure learning resources with meaningful connections.
blog clay clojure data-science literate markdown notebooks
Last synced: 24 Jun 2025
https://github.com/jimbrig/lossrx
An R package, plumber API, database, and Shiny App for Actuarial Loss Development and Reserving Workflows.
actuarial-science claims-data claims-reserving data-science insurance modelling property-casualty reserving rpackage rshiny rstats workflow
Last synced: 01 Jul 2025
https://github.com/krypty/trefle
Trefle is a scikit-learn compatible estimator implementing the FuzzyCoCo algorithm that uses a cooperative coevolution algorithm to find and build interpretable fuzzy systems.
data-science deap evolutionary-algorithm fuzzy-logic interpretability machine-learning python scikit-learn
Last synced: 29 Oct 2025
https://github.com/zenml-io/template-starter
A template for a starter project for ZenML
cookiecutter copier-template data-science machine-learning mlops zenml
Last synced: 14 Apr 2025
https://github.com/dmedri/roaster
R - Fetch, build and deploy.
build-tool data-science rstats statistical-analysis statistics virtual-environments
Last synced: 30 Jul 2025
https://github.com/doarakko/kagoole
Search kaggle competitions and solutions based on data and predict type, evaluation metric, etc.
artificial-intelligence data-science heroku kaggle kaggle-competition kaggle-solution machine-learning webapp
Last synced: 17 Oct 2025
https://github.com/trafficgcn/osmnx_adjacency_matrix_for_graph_convolutional_networks
Creating an Adjacency Matrix Using the Dijkstra Algorithm for Graph Convolutional Networks GCNs
adjacency-matrix data-science dijkstra dijkstra-algorithm gcn graph graph-algorithms graph-convolutional-networks matrix metrla open-street-map optimal-route osm osmnx python traffic traffic-analysis traffic-congestion
Last synced: 27 Oct 2025
https://github.com/anshchoudhary/xgmodel
This repository contains code to predict the Expected Goals (xG) from shots in football using various machine learning models.
data-science football-analytics football-data machine-learning machine-learning-algorithms
Last synced: 10 Apr 2025
https://github.com/mmore500/teeplot
organize data visualization output, automatically picking meaningful names based on semantic plotting variables
data-science data-visualization python python-package workflow
Last synced: 25 Feb 2026
https://github.com/urbanclimatefr/coursera-learn-sql-basics-for-data-science
This repository contains the materials to "Learn SQL Basics for Data Science", a specialization provided by University of California, Davis through Coursera.
Last synced: 19 Feb 2026
https://github.com/dina-hosny/chaincare
ChainCare is a health information system that uses smart contracts to handle medical procedures and stores the medical history in Block Chains.
api-rest bigchain blockchain blockchain-technology data-science data-storage data-visualization ethereum golang health-informatics-systems healthcare insomnia metamask postgresql postman reactjs solidity truffle web3
Last synced: 13 Apr 2026
https://github.com/fabriziomusacchio/python_neuro_practical
This is the course material for the advanced course into Python for Data Scientists.
data-analysis data-science jupyter jupyter-notebook jupyter-notebooks open-source python teaching teaching-materials
Last synced: 22 Jul 2025
https://github.com/juniortorresmtj/projeto_deupositivo
Projeto de Análise de Dados Abertos - SUS
alura bootcampds brazil data-science projeto python
Last synced: 29 Jul 2025
https://github.com/kennethleungty/wikipedia-scraping-with-llm-agents
Scraping Wikipedia by combining LangChain's agents and tools with OpenAI's LLMs and function calling
artificial-intelligence data-analytics data-mining data-science deep-learning genai generative-ai langchain large-language-models llm machine-learning nlp openai openai-functions web-scraping wikipedia
Last synced: 12 Jul 2025
https://github.com/anshumansinha3301/occupational-hazard-analysis
The Occupational Hazard Analysis Using Industry Data project aims to analyze safety metrics across various industries to identify trends in reported incidents, injuries, and fatalities.
consulting-services data-science industrialisation jupyter-notebook python
Last synced: 09 Oct 2025
https://github.com/devinterview-io/linear-algebra-interview-questions
🟣 Linear Algebra interview questions and answers to help you prepare for your next machine learning and data science interview in 2025.
ai-interview-questions coding-interview-questions coding-interviews data-science data-science-interview data-science-interview-questions data-scientist-interview interview-practice interview-preparation linear-algebra linear-algebra-interview-questions linear-algebra-questions linear-algebra-tech-interview machine-learning machine-learning-and-data-science machine-learning-interview machine-learning-interview-questions software-engineer-interview technical-interview-questions
Last synced: 07 Feb 2026
https://github.com/anaclumos/heart-diagnosis-engine
2019년 민족사관고등학교 졸업 프로젝트
data-science machine-learning pandas python scikit-learn
Last synced: 22 Aug 2025
https://github.com/ndxdeveloper/formation-python
Formation Python - Du débutant à l'avancé | 13 modules (FastAPI, Type Hints, Data Science, SQLAlchemy, asyncio) | 75+ sujets | 100% français | MIT License
api-rest asyncio data-science developpement fastapi formation francais french learning numpy pandas poetry poo programmation pytest python python3 sqlalchemy type-hints
Last synced: 08 Apr 2026
https://github.com/dogukanayd/catch-tweet-with-keyword
Get Tweet by giving keyword and do keyword analysis
data-analysis data-mining data-science datascience keyword-analysis python python27 social-media social-network social-network-analysis tweet tweets twitter twitter-analysis twitter-api twitter-oauth twitter-sentiment-analysis twitterwordcloud wordcloud
Last synced: 30 Aug 2025
https://github.com/firaskahlaoui/heart-disease-analysis-r
R for data visualization and analysis of heart disease datasets.
data-science data-visualization ggplot kaggle-dataset r statistics
Last synced: 14 Apr 2025
https://github.com/hassaku/audio-plot
Python library to converts a line graph to sound and return an object that can be played in Jupyter notebook or Google Colab. Values are represented by pitches, and the timeline is represented by left and right pans. It was created to make data science fun for the visually impaired.
audio-plot colab data-science jupyter-notebook python visually-impaired
Last synced: 01 Nov 2025
https://github.com/networks-learning/discussion-complexity
Code for "On the Complexity of Opinions and Online Discussions", WSDM 2019
complexity data-science discussion online-discussions opinion-mining paper wsdm
Last synced: 10 Aug 2025
https://github.com/mertguvencli/keyword-extractor
This project aims to find "what are the trending techs on Data Science jobs?" using NER.
data-science machine-learning ner nlp python spacy
Last synced: 10 Sep 2025
https://github.com/rasmusrynell/predicting-nhl
The project explores the idea of using different machine learning techniques to determine different stats in NHL games.
ai algorithms data-science database machine-learning ml nhl nhl-api python scikit-learn sports sports-analytics sports-stats sportsanalytics
Last synced: 14 Apr 2025
https://github.com/fozouni/data_science
Source codes of the first "Data Science Course"
artificial-intelligence data-science datascience deep-learning excel machine-learning python
Last synced: 04 Sep 2025
https://github.com/aruizeac/alexandria
The Alexandria Project is an open-source platform where people can share their knowledge through books, podcasts, docs and videos.
alexandria data-science donation ebooks go golang grpc http kafka knowledge knowledge-sharing library microservice podcasts python societies streaming videos webservice
Last synced: 11 Mar 2026
https://github.com/dhimmel/openskistats
The study of skiing where we shred open data like pow. Quantifying alpine ski areas with geospatial metrics derived from OpenStreetMap.
data-science data-visualization downhill elevation geospatial gis mapping open-data openskimap openstreetmap orientation python quarto ski-areas skiing slope snowpack solar-irradiance sunlight topography
Last synced: 21 Jul 2025
https://github.com/strazto/mandrake
📖🐉- Bring reading the manual 📖 closer to your drake 🐉 workflow 🔥
data-science drake high-performance-computing makefile pipeline r r-package reproducibility reproducible-research rstats workflow
Last synced: 13 Jul 2025
https://github.com/bradflaugher/ai-101
Notes, links and code samples and resources for teaching yourself pytorch and tensorflow.
bootcamp course data-engineering data-science learn-to-code learning-by-doing learning-python machine-learning
Last synced: 10 May 2025
https://github.com/blurred-machine/data-science
This repository contains all of my minor projects built by me during the learning plase of Machine Learning and Data Science. Feel free to create a PR for modifications.
algorithms-python data-science jupyter-notebook learning-by-doing machine-learning-algorithms minor-project python
Last synced: 27 Apr 2025
https://github.com/numeract/rflow
Flexible R Pipelines with Caching
cache data-science pipeline r rflow
Last synced: 28 May 2026
https://github.com/lambdaclass/data_etudes
LambdaClass statistics, machine learning and data science etudes
data-science notebook probability statistics
Last synced: 09 Apr 2025
https://github.com/koalaverse/analyticssummit19
Material for 2019 Analytics Summit Machine Learning with R Training
data-science educational-materials machine-learning r workshop-materials
Last synced: 15 May 2025
https://github.com/edaaydinea/op1-prediction-of-the-different-progressive-levels-of-alzheimer-s-disease
This is an optional model development project on a real dataset related to predicting the different progressive levels of Alzheimer’s disease (AD).
alzheimer-disease-prediction anova-test catboost-classifier chi-square-test data-science deep-neural-networks keras-neural-networks lightgbm-classifier logistic-regression machine-learning multi-layer-perceptron-classifier neural-networks random-forest-classifier tensorflow xgboost-classifier
Last synced: 11 Apr 2025
https://github.com/the-akira/datascience
Coleção de recursos sobre Ciência de Dados com Python.
data data-analysis data-science data-structures data-visualization machine-learning machine-learning-algorithms mathematics pandas pandas-dataframe portuguese-language python3 scikit-learn statistics sympy
Last synced: 07 May 2025
https://github.com/laminetourelab/tutorial
Tutorials on machine learning, artificial intelligence in general and in biomedical research.
artificial-intelligence bioinformatics bioinformatics-tutorials computer-vision data-science data-visualization-dashboard deep-learning graph-machine-learning image-analysis machine-learning natural-language-processing plotly-dash python pytorch scrna-seq shiny-apps tensorflow-tutorials transfer-learning tutorial-code tutorials
Last synced: 24 Oct 2025
https://github.com/alexioannides/notes-and-demos
Study notes and demos.
data-engineering data-science ml-engineering mlops python
Last synced: 29 Oct 2025
https://github.com/jeonghunyoon/machine-learning-lecture-notes
Lecture notes and codes for machine learning
data-science decision-tree deep-learning lecture-notes linear-algebra linear-regression lsa machine-learning naive-bayes-classifier statistics
Last synced: 10 Apr 2025
https://github.com/hourout/linora
Simple and efficient tools for data science.
data-analysis data-mining data-science hyperparameter-optimization lightgbm machine-learning python xgboost
Last synced: 04 Apr 2025
https://github.com/alvarobartt/ea-associate-ds
Electronic Arts (EA) NLP Assignment for: Associate Data Scientist
data-science electronic-arts nlp recruitment-task
Last synced: 12 Apr 2025
https://github.com/florents-tselai/sqlite-for-data-scientists
Notebooks and supporting files for SQLite for Data Scientists Online Live Training, on OReilly Learning Platform
data-science learning sql sqlite3 training-materials
Last synced: 11 Apr 2025