Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2026-06-30 00:07:43 UTC
- JSON Representation
https://github.com/vertica/verticapy
VerticaPy is a Python library that exposes sci-kit like functionality to conduct data science projects on data stored in Vertica, thus taking advantage Vertica’s speed and built-in analytics and machine learning capabilities.
big-data data-science data-visualization machine-learning preparation python python-library vertica
Last synced: 06 Apr 2026
https://github.com/aws/amazon-redshift-python-driver
Redshift Python Connector. It supports Python Database API Specification v2.0.
amazon-redshift aws-redshift data-analysis data-science
Last synced: 04 Mar 2026
https://github.com/matyushkin/ds
👨🔬 In Russian: Обновляемая структурированная подборка бесплатных ресурсов по тематикам Data Science: курсы, книги, открытые данные, блоги и готовые решения.
bookshelf cheatsheets courses data data-science machine-learning reddit roadmap russian russian-language statistics
Last synced: 04 Apr 2026
https://github.com/anki-code/xonsh-cheatsheet
Cheat sheet for xonsh shell with copy-pastable examples. The best doc for the new users.
awesome awesome-cheatsheet cheat-sheet cheat-sheets cheatsheet cheatsheets console data-science devops devops-scripts hacking shell terminal xonsh xontrib
Last synced: 31 Dec 2025
https://github.com/vatshayan/final-year-project-cryptographic-technique-for-communication-system
Top B.tech/M.tech Final Year Project "Design and Analysis of Cryptographic Technique for Communication System" with Project Code, Report, PPT, Synopsis, IEEE Research Paper and HD Video Explanation
algorithms btech-project cipher-algorithms ciphers college-project college-projects computer-science-project cryptography cryptography-algorithms cryptography-tools cse-project data-science final-year-project final-year-projects finalyearproject ieee machine-learning mtech-project python research-paper
Last synced: 04 Apr 2025
https://github.com/ocademy-ai/machine-learning
Learn AI together, for free. AI learning and teaching resources for everyone.
ai data-engineering data-science deep-learning jupyter jupyter-notebook machine-learning ml mlops python scikit-learn visualization
Last synced: 16 Apr 2025
https://github.com/ashutosh1919/truvisory
This project is meant to provide resources to users who want to access good LinkedIn posts which contains resources to learn any Technology, Design, Self-Branding, Motivation etc. You can visit project by:
career data-science design full-stack linkedin linkedin-posts linkedin-profile marketing motivation opensource react react-template reactjs self-branding stacks ui-design website-design website-template
Last synced: 03 Apr 2025
https://github.com/toUpperCase78/formula1-datasets
Datasets & Analyses for Formula 1 World Championship
analysis data-science datasets formula1 jupyter-notebook motorsports python racing
Last synced: 26 Sep 2025
https://github.com/coqui-ai/trainer
🐸 - A general purpose model trainer, as flexible as it gets
ai data-science deep-learning machine-learning pytorch
Last synced: 16 May 2025
https://github.com/datatalksclub/datatalksclub.github.io
The web page for DataTalks.Club
data-science jekyll machine-learning
Last synced: 25 Oct 2025
https://github.com/trainingbypackt/data-science-for-marketing-analytics
Achieve your marketing goals with the data analytics power of Python
data-science data-visualization matplotlib numpy pandas python seaborn
Last synced: 09 Apr 2025
https://github.com/zeno-ml/zeno
AI Data Management & Evaluation Platform
ai data-science evaluation evaluation-framework machine-learning python
Last synced: 18 Apr 2025
https://github.com/google-aai/sc17
SuperComputing 2017 Deep Learning Tutorial
data-science deep-learning google-cloud-platform machine-learning tutorial
Last synced: 19 Jul 2025
https://github.com/fastverse/fastverse
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
c cpp data-aggregation data-manipulation data-science data-transformation high-performance low-dependency panel-data r rstats statistical-computing time-series weights
Last synced: 12 Dec 2025
https://github.com/speedml/speedml
Speedml is a Python package to speed start machine learning projects.
data-science machine-learning python
Last synced: 12 Jul 2025
https://github.com/shaildeliwala/delbot
It understands your voice commands, searches news and knowledge sources, and summarizes and reads out content to you.
ai bot bots chatbot data-science flask natural-language-processing python
Last synced: 16 May 2025
https://github.com/Speedml/speedml
Speedml is a Python package to speed start machine learning projects.
data-science machine-learning python
Last synced: 09 May 2025
https://github.com/dylan-profiler/visions
Type System for Data Analysis in Python
data-analysis data-science hacktoberfest numpy pandas python spark type-inference type-system
Last synced: 15 May 2025
https://github.com/Foundations-of-Applied-Mathematics/Labs
Labs for the Foundations of Applied Mathematics curriculum.
algorithms applied-mathematics applied-mathematics-curriculum computational-mathematics curriculum data-science linear-algebra python
Last synced: 22 Jul 2025
https://github.com/activitysim/activitysim
An Open Platform for Activity-Based Travel Modeling
activitysim bsd-3-clause data-science microsimulation python travel-modeling
Last synced: 09 Sep 2025
https://github.com/A3Data/hermione
ML made simple
data-science hermione machine-learning python
Last synced: 27 Mar 2025
https://github.com/dataplane-app/dataplane
Dataplane is an Airflow inspired unified data platform with additional data mesh and RPA capability to automate, schedule and design data pipelines and workflows. Dataplane is written in Golang with a React front end.
airflow data data-analysis data-engineering data-integration data-pipelines data-science dataplane datawarehouse etl finance golang kubernetes pipelines robotics-process-automation rpa scheduler workflow workflow-automation workflows
Last synced: 27 Dec 2025
https://github.com/build-on-aws/cloud-clubs-learner-library
A library for learners! Whether or not you're a part of AWS Cloud Clubs, take a look in this library for free, open, leveled content for students 18+ worldwide
ai aws containers data-analytics data-science databases iot kubernetes ml mobile-development security serverless web web-development
Last synced: 09 Apr 2025
https://github.com/saimadhu-polamuri/DataAspirant_codes
Complete machine learning model codes
data-mining data-science machine-learning python
Last synced: 13 Jul 2025
https://github.com/mvlearn/mvlearn
Python package for multi-view machine learning
data-science machine-learning multiview-learning python
Last synced: 21 Oct 2025
https://github.com/Fixy-TR/fixy
Amacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların eksiklerini gideren open source bir yazım destekleyicisi/denetleyicisi oluşturmak. Kullanıcıların yazdıkları metinlerdeki yazım yanlışlarını derin öğrenme yaklaşımıyla çözüp aynı zamanda metinlerde anlamsal analizi de gerçekleştirerek bu bağlamda ortaya çıkan yanlışları da fark edip düzeltebilmek.
acikhack2 ai artificial-intelligence bert data-science deep-learning deeplearning keras natural-language-processing neural-network neural-networks nlp python
Last synced: 03 May 2025
https://github.com/tirendazacademy/pandas-tutorial
Jupyter Notebooks and Data Sets for Pandas Library
data data-analysis data-preprocessing data-science machine-learning pandas pandas-dataframe pandas-datareader pandas-library pandas-python pandas-series pandas-tricks-for-data-manipulation pandas-tutorial python
Last synced: 06 Apr 2025
https://github.com/tmthyjames/achoo
Achoo uses a Raspberry Pi to predict if my son will need his inhaler on any given day using weather, pollen, and air quality data. If the prediction for a given day is above a specified threshold, the Pi will email his school nurse, and myself, notifying her that he may need preemptive treatment. Community-sourced health monitoring!
air-quality-data data-science diy pollen prediction python r raspberry-pi weather
Last synced: 07 May 2025
https://github.com/ahammadmejbah/fueling-ambitions-via-book-discoveries
This series uncovers the most valuable insights from groundbreaking books in AI, Machine Learning, and Data Science, helping you accelerate your learning journey. Each episode transforms complex theories into practical knowledge, making advanced topics more accessible and actionable.
data-science data-structures data-visualization deep-learning generative-ai machine-learning
Last synced: 12 Apr 2025
https://github.com/predict-idlab/powershap
A power-full Shapley feature selection method.
data-science feature-selection machine-learning shap
Last synced: 07 Jul 2025
https://github.com/george0st/qgate-sln-mlrun
MLRun/Iguazio/Nuclio quality gate solution.
artificial-intelligence data-science e2e feature-store genai iguazio machine-learning mlops mlrun mlrun-test nuclio quality-assessment quality-assurance quality-gate testing
Last synced: 06 Apr 2025
https://github.com/Ronak-59/Stock-Prediction
Smart Algorithms to predict buying and selling of stocks on the basis of Mutual Funds Analysis, Stock Trends Analysis and Prediction, Portfolio Risk Factor, Stock and Finance Market News Sentiment Analysis and Selling profit ratio. Project developed as a part of NSE-FutureTech-Hackathon 2018, Mumbai. Team : Semicolon
algorithms artificial-intelligence data-science lstm-neural-network machine-learning risk-analysis sentiment-analysis stock-prediction stock-price-prediction visualisation
Last synced: 02 Jun 2026
https://github.com/blobcity/python-for-data-science
A collection of Jupyter Notebooks for learning Python for Data Science.
data-science jupyter jupyter-notebook jupyter-notebooks learn-python python
Last synced: 13 Feb 2026
https://github.com/oracle-samples/oci-data-science-ai-samples
This repo contains a series of tutorials and code examples highlighting different features of the OCI Data Science and AI services, along with a release vehicle for experimental programs.
ai conda data-science data-science-notebooks deep-learning jupyter-notebook machine-learning oci oracle-cloud-infrastructure python
Last synced: 15 May 2025
https://github.com/netflix/metaflow-service
:rocket: Metadata tracking and UI service for Metaflow!
ai data-science machine-learning metaflow ml ml-infrastructure ml-platform productivity ui
Last synced: 01 Jul 2025
https://github.com/jgoerner/data-science-stack-cookiecutter
🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
airflow apistar cookiecutter data-science docker docker-image jupyter minio postgres python superset
Last synced: 29 Mar 2025
https://github.com/hukenovs/coursera_ml_da_specialization
Coursera Specialization: Machine Learning and Data Analysis (Yandex & MIPT)
certificates convolutional-neural-networks coursera data-engineering data-mining data-science data-visualization deep-learning jupyter machine-learning machine-learning-algorithms machine-learning-coursera mipt neural-network python supervised-learning unsupervised-learning yandex
Last synced: 25 Jun 2025
https://github.com/iterative/vscode-dvc
Machine learning experiment tracking and data versioning with DVC extension for VS Code
data data-science dvc machine-learning python visual-studio-code vscode vscode-extension
Last synced: 18 Jun 2025
https://github.com/benedekrozemberczki/danmf
A sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
autoencoder cikm clustering community-detection coordinate-descent danmf data-science deep-learning deepwalk dimensionality-reduction embedding gemsec machine-learning mnmf nmf node-embedding node2vec sklearn unsupervised-learning word2vec
Last synced: 06 May 2025
https://github.com/Laurae2/Laurae
Advanced High Performance Data Science Toolbox for R by Laurae
data-science laurae machine-learning r supervised-learning xgboost
Last synced: 20 Jul 2025
https://github.com/rhenanbartels/hrv
A Python package for heart rate variability analysis
data-science hacktoberfest hrv python signal-processing
Last synced: 05 Apr 2025
https://github.com/danaugrs/go-tsne
t-Distributed Stochastic Neighbor Embedding (t-SNE) in Go
3d data-science dimensionality-reduction go machine-learning tsne unsupervised-learning visualization
Last synced: 30 Apr 2025
https://github.com/storieswithsiva/Data-Science-Resources
👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
artificial-intelligence artificial-neural-networks data data-analysis data-analytics data-mining data-science data-science-resource data-science-resources data-scientist data-scientists data-visualization data-world datascience dataset learning learning-kit machine-learning python repository
Last synced: 10 Apr 2025
https://github.com/kevin-hanselman/dud
A lightweight CLI tool for versioning data alongside source code and building data pipelines.
data-engineering data-pipelines data-science dataset dvcs machine-learning mlops
Last synced: 29 Dec 2025
https://github.com/ideonate/cdsdashboards
JupyterHub extension for ContainDS Dashboards
bokeh data-science jupyter jupyterhub panel plotly-dash rshiny streamlit visualization
Last synced: 04 Apr 2025
https://github.com/ahammadmejbah/machine-learning-book-collections
Machine learning is the study and development of data-driven strategies to enhance task performance. AI includes it.
data-science deep-learning machine-learning
Last synced: 05 Mar 2025
https://github.com/benedekrozemberczki/DANMF
A sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
autoencoder cikm clustering community-detection coordinate-descent danmf data-science deep-learning deepwalk dimensionality-reduction embedding gemsec machine-learning mnmf nmf node-embedding node2vec sklearn unsupervised-learning word2vec
Last synced: 27 Mar 2025
https://github.com/jthomasmock/gtextras
A Collection of Helper Functions for the gt Package.
data-science data-visualization datascience ggplot2 gt plots r rstats sparkline sparkline-graphs sparklines tables
Last synced: 12 Apr 2025
https://github.com/ivnvxd/pyquest
Python everything Cheatsheet and a Journey to the land of Python programming
algorithms architecture cheatsheet concurrency data-science data-structures data-types database fundamentals jupyter-notebook learn oop python standard-library tutorial web-development
Last synced: 21 Jan 2026
https://github.com/nteract/bookstore
📚 Notebook storage and publishing workflows for the masses
data-science notebook nteract scheduling storage versioned-buckets
Last synced: 07 Apr 2025
https://github.com/agilescientific/striplog
Lithology and stratigraphic logs for wells or outcrop.
data-mining data-science geology petrophysics sedimentology swung-stack
Last synced: 09 Apr 2025
https://github.com/h2oai/nitro
Create apps 10x quicker, without Javascript/HTML/CSS.
app apps data-analysis data-science developer-tools devtools graphics h2o-nitro low-code python ui ui-components user-interface web-application webapp widget-library widgets
Last synced: 08 Apr 2025
https://github.com/rapidsai/node
GPU-accelerated data science and visualization in node
cuda data-science data-visualization gpgpu gpu nodejs
Last synced: 16 May 2025
https://github.com/LGE-ARC-AdvancedAI/auptimizer
An automatic ML model optimization tool.
automated-machine-learning automl data-engineering data-science deep-learning hpo hyperparameter-optimization hyperparameter-tuning machine-learning neural-networks
Last synced: 08 May 2025
https://github.com/analysiscenter/batchflow
BatchFlow helps you conveniently work with random or sequential batches of your data and define data processing and machine learning workflows even for datasets that do not fit into memory.
data-science machine-learning pipeline pipeline-framework python python3 workflow workflow-engine
Last synced: 25 Oct 2025
https://lge-arc-advancedai.github.io/auptimizer/
An automatic ML model optimization tool.
automated-machine-learning automl data-engineering data-science deep-learning hpo hyperparameter-optimization hyperparameter-tuning machine-learning neural-networks
Last synced: 01 Apr 2025
https://github.com/ActivitySim/activitysim
An Open Platform for Activity-Based Travel Modeling
activitysim bsd-3-clause data-science microsimulation python travel-modeling
Last synced: 15 Mar 2025
https://github.com/yogeshhk/TeachingDataScience
Course notes for Data Science related topics, prepared in LaTeX
course-materials data-science deep-learning jupyter-notebooks latex machine-learning natural-language-processing open-source python
Last synced: 27 Mar 2025
https://github.com/cyb3r-monk/rita-j
Implementation of RITA (Real Intelligence Threat Analytics) in Jupyter Notebook with improved scoring algorithm.
cybersecurity data-science dfir jupyter-notebook threat-hunting
Last synced: 09 Mar 2026
https://github.com/jthomasmock/gtExtras
A Collection of Helper Functions for the gt Package.
data-science data-visualization datascience ggplot2 gt plots r rstats sparkline sparkline-graphs sparklines tables
Last synced: 29 Jul 2025
https://github.com/google-marketing-solutions/feedgen
Optimise Shopping feeds with Generative AI
data-science generative-ai google-cloud large-language-models llm machine-learning shopping-feed vertex-ai
Last synced: 04 Apr 2025
https://github.com/machine-learning-apps/actions-ml-cicd
A Collection of GitHub Actions That Facilitate MLOps
argo data-science devops kubernetes machine-learning machine-learning-platform ml-infrastructure ml-ops mlops
Last synced: 04 Mar 2026
https://github.com/AtomScott/SoccerTrack
A python package for turning sports video into csv files
computer-vision data-science football multi-object-tracking multiobject-tracking python soccer sports sports-analytics tracking
Last synced: 10 Mar 2025
https://github.com/microsoft/finnts
Microsoft Finance Time Series Forecasting Framework (FinnTS) is a forecasting package that utilizes cutting-edge time series forecasting and parallelization on the cloud to produce accurate forecasts for financial data.
business data-science feature-selection finance finnts forecasting machine-learning microsoft r r-package rstats time-series
Last synced: 15 May 2025
https://github.com/alertadengue/pysus
Library to download, clean and analyze openly available datasets from Brazilian Universal health system, SUS.
data-science geospatial health
Last synced: 18 May 2026
https://github.com/drakearch/kaggle-courses
Kaggle courses and tutorials to get you started in the Data Science world.
data-science deep-learning machine-learning pandas python
Last synced: 16 Apr 2025
https://github.com/voila-dashboards/voici
Voici turns any Jupyter Notebook into a static web application
dashboards data-science emscripten jupyter jupyterlite voila-dashboard wasm
Last synced: 19 Jun 2025
https://github.com/coqui-ai/Trainer
🐸 - A general purpose model trainer, as flexible as it gets
ai data-science deep-learning machine-learning pytorch
Last synced: 19 Jul 2025
https://github.com/d5555/tageditor
🏖TagEditor - Annotation tool for spaCy
annotation annotation-tool coreference-resolution data-science labeling-tool machine-learning named-entities named-entity-recognition natural-language-processing neural-networks neuralcoref nlp spacy spacy-visualizer tagging-tool text-annotation text-tagging training-data
Last synced: 20 Aug 2025
https://github.com/robmarkcole/hass-data-detective
Explore and analyse your Home Assistant data
data data-science home home-assistant home-automation
Last synced: 16 May 2025
https://github.com/gyrdym/ml_algo
Machine learning algorithms in Dart programming language
algorithm batch-gradient-descent classifier dart dartlang data-science hyperparameters lasso-regression linear-regression logistic-regression machine-learning machine-learning-algorithms mini-batch-gradient-descent regression sgd softmax softmax-algorithm softmax-classifier softmax-regression stochastic-gradient-descent
Last synced: 06 Apr 2025
https://github.com/chanmenglin/pandasversusexcel
Python数据分析入门,数据分析师入门
charts data-analysis data-charts data-science data-science-learning data-view data-visualization excel histogram learn-pandas learn-python matplotlib pandas pandas-excel python
Last synced: 31 Jul 2025
https://github.com/launchflow/buildflow
BuildFlow, is an open source framework for building large scale systems using Python. All you need to do is describe where your input is coming from and where your output should be written, and BuildFlow handles the rest. No configuration outside of the code is required.
batch data-science pipeline python streaming
Last synced: 16 Jul 2025
https://github.com/d5555/TagEditor
🏖TagEditor - Annotation tool for spaCy
annotation annotation-tool coreference-resolution data-science labeling-tool machine-learning named-entities named-entity-recognition natural-language-processing neural-networks neuralcoref nlp spacy spacy-visualizer tagging-tool text-annotation text-tagging training-data
Last synced: 12 May 2025
https://github.com/saezlab/decoupler-py
Python package to perform enrichment analysis from omics data.
bioinformatics data-science enrichment enrichment-analysis numba python single-cell spatial-transcriptomics transcriptomics
Last synced: 15 May 2025
https://github.com/multimeric/PandasSchema
A validation library for Pandas data frames using user-friendly schemas
data-science pandas schema validation
Last synced: 23 Apr 2025
https://github.com/multimeric/pandasschema
A validation library for Pandas data frames using user-friendly schemas
data-science pandas schema validation
Last synced: 12 Dec 2025
https://github.com/robmarkcole/HASS-data-detective
Explore and analyse your Home Assistant data
data data-science home home-assistant home-automation
Last synced: 06 Apr 2025
https://github.com/giswqs/geebook
Earth Engine and Geemap: Geospatial Data Science with Python
data-science dataviz earth-engine geemap geopython geospatial google-earth-engine ipyleaflet ipywidgets jupyter mapping python
Last synced: 24 Apr 2025
https://github.com/nabeel-oz/qlik-py-tools
Data Science algorithms for Qlik implemented as a Python Server Side Extension (SSE).
advanced-analytics advanced-analytics-integration analytics clustering data-science deep-learning facebook-prophet fbprophet forecasting hdbscan keras machine-learning predictive-analytics python qlik qlik-oss qlik-sense scikit-learn server-side-extension sklearn
Last synced: 09 Mar 2026
https://github.com/tonybeltramelli/deep-spying
Spying using Smartwatch and Deep Learning
data-science deep-learning neural-networks privacy recurrent-neural-networks security wearable-devices
Last synced: 26 Jun 2025
https://github.com/milaan9/machine_learning_algorithms_from_scratch
This repository explores the variety of techniques and algorithms commonly used in machine learning and the implementation in MATLAB and PYTHON.
data-science decision-trees dynamic-time-warping error-functions fitting-algorithm frequentist-methods gaussian-naive-bayes machine-learning machine-learning-algorithms machine-learning-matlab machine-learning-python matlab4datascience naive-bayes-classifier python4datascience random-forest singular-value-decomposition tutor-milaan9 value-iteration-algorithm
Last synced: 30 Jun 2025
https://github.com/explosion/jupyterlab-prodigy
🧬 A JupyterLab extension for annotating data with Prodigy
active-learning annotation annotation-tool artificial-intelligence computer-vision data-annotation data-science jupyter jupyterlab labeling-tool machine-learning machine-teaching natural-language-processing nlp prodigy spacy
Last synced: 07 Apr 2025
https://github.com/tonybeltramelli/Deep-Spying
Spying using Smartwatch and Deep Learning
data-science deep-learning neural-networks privacy recurrent-neural-networks security wearable-devices
Last synced: 18 Jul 2025
https://github.com/plotly/dash-oil-and-gas-demo
Dash Demo App - New York Oil and Gas
dash data-science data-visualization energy plotly python technical-computing
Last synced: 13 Jul 2025
https://github.com/seg/2016-ml-contest
Machine learning contest - October 2016 TLE
contest data-science fun geophysics geoscience machine-learning
Last synced: 19 Jul 2025
https://github.com/epistasislab/tpot2
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
adsp ag066833 aiml alzheimer alzheimers automated-machine-learning automation automl data-science feature-engineering gradient-boosting hyperparameter-optimization lm010098 machine-learning model-selection nia parameter-tuning python random-forest scikit-learn
Last synced: 09 Apr 2025
https://github.com/joeymeyer/raspberryturk
The Raspberry Turk is a robot that can play chess—it's entirely open source, based on Raspberry Pi, and inspired by the 18th century chess playing machine, the Mechanical Turk.
3d-printing chess computer-vision data-science machine-learning raspberry-pi robotics
Last synced: 12 May 2025
https://github.com/swoop-inc/spark-alchemy
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
data-engineering data-science scala spark
Last synced: 07 May 2025
https://github.com/superkogito/voice-based-gender-recognition
:sound: :boy: :girl:Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)
data-science gaussian-mixture-models gender gender-classification gender-detection gender-recognition gender-recognition-by-voice gmm machine-learning mel-frequencies mfcc scikit-learn scikit-learn-python signal speaker speech vocal voice
Last synced: 04 Apr 2025
https://github.com/blue-season/pywarm
A cleaner way to build neural networks for PyTorch.
clean-code data-science deep-learning keras machine-learning neural-network neural-networks python3 pytorch
Last synced: 09 Apr 2025
https://github.com/juliuskunze/jaxnet
Concise deep learning for JAX
data-science deep-learning jax machine-learning neural-networks python
Last synced: 09 Apr 2025
https://github.com/marimo-team/learn
📚 A curated collection of marimo notebooks for education.
artificial-intelligence data-analysis data-science data-visualization education learning machine-learning notebooks python
Last synced: 27 Jun 2025
https://github.com/salvatorera/tutorial
Tutorials on machine learning, artificial intelligence, data science with math explanation and reusable code (in python and R)
artificial-intelligence bioinformatics biology computer-vision convolutional-neural-networks data-science deep-learning graph image machine-learning natural-language-processing nlp python r streamlit streamlit-webapp tutorial tutorials vision-transformer
Last synced: 04 Apr 2025
https://github.com/microsoft/30daysof
30 Day of Learning Resources, Samples and Curricula
azure data-science powerapps pwa serverless staticwebapp
Last synced: 14 Apr 2026
https://github.com/setl-framework/setl
A simple Spark-powered ETL framework that just works 🍺
big-data data-analysis data-engineering data-science data-transformation dataset etl etl-pipeline framework machine-learning modularization pipeline scala setl spark
Last synced: 10 Apr 2025
https://github.com/SETL-Framework/setl
A simple Spark-powered ETL framework that just works 🍺
big-data data-analysis data-engineering data-science data-transformation dataset etl etl-pipeline framework machine-learning modularization pipeline scala setl spark
Last synced: 15 Apr 2025
https://github.com/youssefhosni/my-medium-articles-friendly-links
Friendly link to all of my medium articles
data-science deep-learning machine-learning python
Last synced: 10 Mar 2026
https://github.com/curiousily/Machine-Learning-from-Scratch
Succinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.
artificial-intelligence book classification data-science machine-learning machine-learning-algorithms neural-networks notebook recommender-systems regression reinforcement-learning sentiment-analysis
Last synced: 20 Jul 2025
https://github.com/Azure/DataScienceVM
Tools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
ai azure big-data data-analysis data-science deep-learning dsvm machine-learning ml python r sqlserver
Last synced: 20 Jul 2025