An open API service indexing awesome lists of open source software.

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

https://github.com/lightning-ai/lightning-habana

Lightning support for Intel Habana accelerators.

deep-learning machine-learning python pytorch

Last synced: 11 Apr 2025

https://github.com/lab-cosmo/atomistic-cookbook

A collection of simulation recipes for the atomic-scale modeling of materials and molecules

atomistic-simulations machine-learning modeling molecular-dynamics simulation

Last synced: 03 May 2025

https://github.com/jonathanwenger/pycalib

Non-Parametric Calibration for Classification (AISTATS 2020)

aistats-2020 machine-learning neural-networks probability-calibration

Last synced: 11 Apr 2025

https://github.com/ashwinpn/containers--ml-and-cloud-computing

Everything about how to deploy projects on the cloud, run ML workloads on the HPC cluster and on the cloud and the efficient configuration and management of related collaborative platforms [e.g. container orchestration].

aws cloud cloud-computing containers deep-learning deployment docker docker-image dockerfile google google-cloud-platform k8s-cluster kubernetes kubernetes-cluster machine-learning pytorch vagrant

Last synced: 13 Apr 2025

https://github.com/yashksaini-coder/multivariate-logistic-regression---telecom-churn

Building a logistic regression model for telecom churn prediction, utilizing 21 customer-related variables to predict whether a customer will switch to another telecom provider or not.

logistic-regression machine-learning multivariate-regression supervised-learning supervised-machine-learning

Last synced: 11 Apr 2025

https://github.com/si-cim/prototorch

ProtoTorch is a PyTorch-based Python toolbox for bleeding-edge research in prototype-based machine learning algorithms.

interpretable-ai lvq machine-learning python pytorch

Last synced: 11 Apr 2025

https://github.com/src-d/models

Machine learning models for MLonCode trained using the source{d} stack

machine-learning mlosc model nlp source-code

Last synced: 17 Feb 2026

https://github.com/msusazureaccelerators/intelligent-document-processing-accelerator

Showcase Azure platform’s machine learning capability to recognize document type, extract required fields and push data to downstream applications, significantly reducing manual efforts and creating smoother customer experience. with the Microsoft Intelligent Document Processing (Document Process Automation) Accelerator.

machine-learning microsoft ml solution-accelerator

Last synced: 12 Apr 2025

https://github.com/abhaskumarsinha/minimalgpt

MinimalGPT is a concise, adaptable, and streamlined code framework that encompasses the essential components necessary for the construction, training, inference, and fine-tuning of the GPT model. This framework is implemented exclusively using Keras and TensorFlow, ensuring compatibility and coherence within the broader deep learning ecosystem.

ai artificial-intelligence fine-tuning generative-model gpt gpt-2 gpt-models keras keras-tensorflow language-model llm machine-learning neural-network nlp nlp-machine-learning tensorflow tensorflow2 training transformer transformer-architecture

Last synced: 11 Jul 2025

https://github.com/srzstephen/disavu

A disaster response solution that helps allocate resources to where they're needed.

amazon-sagemaker-lab aws disaster-response fastai2 machine-learning python rust rust-lang sagemaker-studio-lab typescript

Last synced: 11 Apr 2025

https://github.com/octalpixel/skin-extraction-from-image-and-finding-dominant-color

Project is an implementation of skin segmentation using OpenCV and dominant color extraction using SciKit-Learn

image-processing kmeans kmeans-clustering machine-learning opencv python scikit-learn

Last synced: 12 Apr 2025

https://github.com/mapbox/gabbar

Guarding OpenStreetMap from harmful edits using machine learning

banished jupyter-notebook machine-learning openstreetmap scikit-learn vandalism

Last synced: 12 Apr 2025

https://github.com/pedrohserrano/machine-learning-use-cases

Machine Learning Notebooks with Turicreate and Keras in a Docker Container

deep-learning docker machine-learning notebook

Last synced: 10 Apr 2025

https://github.com/eshikashah/skillship-internship-data-science-projects

Utilized this lockdown to do something productive. SkillShip foundation provided and internship opportunity and here's the outcome. The projects made by me in these 2 months.

classification data-science internship machine-learning regression

Last synced: 28 Jul 2025

https://github.com/microsoft/masc

Microsoft's contributions for Spark with Apache Accumulo

accumulo apache big-data machine-learning spark

Last synced: 02 Oct 2025

https://github.com/slimgroup/fno4co2

Learned coupled inversion with Fourier neural operators

carbon ccs deep-learning fno inversion julia machine-learning time-lapse

Last synced: 24 Jul 2025

https://github.com/ubccr/terf

Go library for reading/writing TensorFlow TFRecords file format

golang machine-learning tensorflow-examples

Last synced: 27 Jul 2025

https://github.com/negrinho/research_toolbox

Utilities to help manage a machine learning experimental workflow

machine-learning research-data-management utilities utility-library workflow-management

Last synced: 06 Sep 2025

https://github.com/datacte/sdxl-training-improvements

📊 Research-focused SDXL training framework exploring novel optimization approaches. Goals include enhanced image quality, training stability & comprehensive monitoring. ⭐ Performance-focused research framework.

deep-learning diffusion machine-learning stable-diffusion stable-diffusion-xl

Last synced: 21 Aug 2025

https://github.com/spidy20/rice_disease_prediction_gui

Rice Disease Prediction App using SVM Machine learning algorithm with tkinter

deep-learning machine-learning rice-diseases rise-disease-prediction sklearn sm spidy20 tkinter-gui

Last synced: 18 Aug 2025

https://github.com/pkestene/ms-hpc-ai-gpu

resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI

cuda deep-learning gpu gpu-computing machine-learning physics-informed-neural-networks pinn pinns

Last synced: 19 Aug 2025

https://github.com/heet9022/recruitai

Recruitment Assisting platform which will help recruiters filter out resumes for a particular job profile

linkedin-scraper machine-learning recruiter resume-classfier

Last synced: 23 Sep 2025

https://github.com/BiomedSciAI/DPM360

Repository for Disease Progression Modeling workbench 360 - An end-to-end deep learning model training framework in python on OMOP data

deep-learning healthcare machine-learning ohdsi omop python pytorch sklearn

Last synced: 09 Aug 2025

https://github.com/nanowell/differential-transformer-pytorch

PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model similar to large language models (LLMs). The architecture incorporates a novel Differential Attention mechanism, Multi-Head structure, RMSNorm, and SwiGLU.

differential-transformer large-language-models machine-learning pytorch

Last synced: 31 Jul 2025

https://github.com/xai-demonstrator/xai-demonstrator

The XAI Demonstrator is a modular platform that lets users interact with production-grade Explainable AI (XAI) systems.

ai docker explainable-ai fastapi huggingface-transformers keras machine-learning mobile-first pytorch tensorflow vuejs xai

Last synced: 01 Sep 2025

https://github.com/trainingbypackt/professional-azure-sql-database-administration-second-edition

Equip yourself with the skills required to manage and maintain data on the Cloud.

azure database dtu georeplication machine-learning powershell sql ssms

Last synced: 03 Jul 2025

https://github.com/ahmetfurkandemir/online-istanbul-applied-data-science-102-bootcamp

Online Istanbul Applied Data Science 102 Bootcamp (Start : 15 August, Finish : 7 November)

bootcamp data-science deep-learning kodluyoruz machine-learning

Last synced: 15 Apr 2025

https://github.com/kentonishi/jtr-cvpr-2024

[CVPR 2024] Joint-Task Regularization for Partially Labeled Multi-Task Learning

computer-vision cvpr2024 machine-learning multitask-learning

Last synced: 23 Mar 2025

https://github.com/blankeos/scoliovis

🦴 Automated Cobb Angle Measurement on Anterior-Posterior Spine X-Rays using Multi-Instance Keypoint Detection with Keypoint RCNN Thesis Package

computer-vision fastapi machine-learning pytorch react webapp

Last synced: 03 Jul 2025

https://github.com/vmc-7645/YOLOv8-retail

Detect retail products via the YOLOv8 object recognition engine.

ai computer-vision deep-learning machine-learning object-detection pytorch yolo yolov8

Last synced: 21 Apr 2025

https://github.com/alvertogit/bigdata_docker

Big Data Docker Data Science Spark Spark4 Hadoop HDFS Scala Python Artificial Intelligence Machine Learning Jupyter Lab Notebook

big-data data-science docker jupyter-lab jupyter-notebook machine-learning python scala spark spark4

Last synced: 10 Mar 2026

https://github.com/owlbarn/owl_symbolic

Connect Owl with other accelerators and numerical frameworks with symbolic maths

algebra gpu-computing machine-learning neural-networks numerical onnx scientific-computing symbolic-math

Last synced: 22 Jun 2025

https://github.com/jan-janssen/gmailsorter

Similarity based email sorting for Google Mail using RandomForest classifiers

google-mail-api machine-learning python random-forest-classifier

Last synced: 11 Apr 2026

https://github.com/wassname/phoneme2grapheme

Teaching machines to spell with deep learning (acc=>80%) e.g. a model hears "pɹˈaʊd˺ɚ" and writes "prowder" (but it should be "prouder")

cmudict deep-learning deeplearning machine-learning nlp pronunciation spelling

Last synced: 03 Jul 2025

https://github.com/phreakyphoenix/mxnet-gluoncv-aws-coursera

This repo includes my solutions to the Coursera course offered by AWS titled "AWS Computer Vision: Getting Started with GluonCV", in addition to more tutorials and in-depth handson labs. Please :star2: the repo if you like it :point_up: Create an Issue or preferably a PR for any improvement. :rocket:

aws colab-notebook computer-vision coursera coursera-assignment coursera-assignment-solution deep-learning fashion-mnist gluon gluoncv image-classification machine-learning mnist-handwriting-recognition mxnet mxnet-gluon mxnet-gluon-interface mxnet-imageclassification mxnet-neural-network mxnet-notebooks object-detection

Last synced: 12 Mar 2026

https://github.com/bpesquet/pyfit

A minimalist neural networks library built on a tiny autograd engine

autodifferentiation machine-learning neural-networks python

Last synced: 02 Apr 2026

https://github.com/idiap/zff_vad

Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering

audio-processing machine-learning noise-robust signal-processing speech-activity-detection voice-activity-detection

Last synced: 05 Oct 2025

https://github.com/luizgh/avc_nips_2018

Code to reproduce the attacks and defenses for the entries "JeromeR" in the NIPS 2018 Adversarial Vision Challenge

adversarial-examples machine-learning

Last synced: 10 Apr 2025

https://github.com/zakroum-hicham/football-analysis-cv

This repository contains a computer vision/machine learning football project that uses YOLO for object detection, Kmeans for pixel segmentation, and perspective transformation to analyze player movements in football videos

ai computer-vision data-science football-analytics kmeans-clustering machine-learning opencv yolov8

Last synced: 26 Mar 2025

https://github.com/hdmamin/jabberwocky

An Alexa skill providing a conversational interface to any public figure (as mimicked by GPT3). The legacy GUI is no longer maintained.

alexa alexa-skill audio chatbot gpt3 machine-learning python speech-recognition

Last synced: 14 Jan 2026

https://github.com/fffaraz/ghsom-cpp

Growing Hierarchical Self-Organizing Map (GHSOM) implementation in C++

ai clustering cpp ghsom machine-learning qt som

Last synced: 10 Apr 2025

https://github.com/future-ai-org/desci-ai-experiments-py

👾 resources and experiments for autonomous agents (e.g., language models, reinforcement learning, energy based models)

autism chatgpt deep-learning gpt-3 machine-learning markov-chain openai quantum-ai reinforcement-learning rust supervised-machine-learning

Last synced: 30 Apr 2026

https://github.com/millengustavo/epileptic-seizure-recognition

Repository with Machine Learning techniques to identify Epileptic Seizure from EEG data.

brain-activity deep-learning eeg-data eeg-signal epilepsy machine-learning

Last synced: 10 Apr 2025

https://github.com/mskcc/mimsi

Microsatellite Instability Classification using Multiple Instance Learning

cancer-genomics deep-learning genomics machine-learning multiple-instance-learning

Last synced: 30 Apr 2025

https://github.com/artitw/bert_qa

Accelerating the development of question-answering systems based on BERT and TF 2.0

artificial-intelligence bert machine-learning natural-language-processing natural-language-understanding nlp

Last synced: 21 Mar 2025

https://github.com/amirabbasasadi/rockyml

⛰️ RockyML - A High-Performance Scientific Computing Framework for Non-smooth Machine Learning Problems

cplusplus cpp deep-learning deep-neural-networks distributed-computing high-performance machine-learning mpi optimization parallel-computing scientific-computing

Last synced: 15 Jun 2025

https://github.com/iitzco/deepzoo

:wolf: :tiger: :whale2: :elephant: :monkey: Deep Learning model Zoo

artificial-intelligence deep-learning machine-learning modelzoo python

Last synced: 05 May 2025

https://github.com/rasbt/b3-basic-batchsize-benchmark

Experiments for the blog post "No, We Don't Have to Choose Batch Sizes As Powers Of 2"

deep-learning deep-neural-networks machine-learning neural-networks

Last synced: 05 May 2025

https://github.com/yashksaini-coder/multi-class-prediction-of-obesity-risk

Multi-Class Obesity Risk Prediction Project | Prediction of obesity risk in individuals using various factors, which is related to cardiovascular disease.

lgbm lgbmregressor machine-learning machine-learning-algorithms pipeline prediction prediction-model supervised-learning supervised-machine-learning xgb-regressor xgboost

Last synced: 11 Apr 2025

https://github.com/fieg/knn

k-Nearest Neighbors algorithm in PHP

k-nearest-neighbours knn machine-learning php

Last synced: 07 Jul 2025

https://github.com/pourmand1376/persiancrawler

Open source crawler for Persian websites.

crawler machine-learning news python scrapy tasnim text-classification

Last synced: 28 Oct 2025

https://github.com/andreacossu/relation-network-pytorch

Implementation of Relation Network and Recurrent Relational Network using PyTorch v1.3. Original papers: (RN) https://arxiv.org/abs/1706.01427 (RRN): https://arxiv.org/abs/1711.08028

machine-learning neural-reasoning python pytorch pytorch-implementation relation-network

Last synced: 14 Apr 2025

https://github.com/alessandrocorradini/harvard-data-analysis-for-life-science-xseries

Lectures, Code and Quizzes for the Data Science for Life Science XSeries

data-analysis datascience edx harvardx machine-learning

Last synced: 02 Jan 2026

https://github.com/omarsar/machine_learning_fundamentals

:green_book: Machine Learning - A Friendly Handbook :green_book: (Open Notes)

book handbook machine-learning neural-network

Last synced: 03 Jan 2026

https://github.com/0xnaman1/american-sign-language-detection-using-computer-vision

The project is about translating American Sign Language into English language. It uses Computer Vision and Deep Learning to predict the ASL alphabet and forms sentences on the basis of prediction. It uses text to speech to convert the predicted word into speech. The project was implemented at MNNIT Hack36 Allahabad Hackathon.

computer-vision deeplearning keras machine-learning neural-network opencv tensorflow texttospeech

Last synced: 27 Jun 2025

https://github.com/mcarlomagno/cardrivingresnet

🚗 Browser game where a vehicle is driven through the camera using the ResNet model (Residual Network) to estimate the position of the hands.

css html javascript machine-learning posenet residual-network resnet-model tensorflow tensorflowjs vehicle

Last synced: 28 Jun 2025

https://github.com/fracpete/collective-classification-weka-package

Semi-Supervised Learning and Collective Classification

java machine-learning plugin semi-supervised-learning weka

Last synced: 22 Apr 2025

https://github.com/fbruzzesi/sklearn-smithy

Toolkit to forge scikit-learn compatible estimators

cli data-science machine-learning python scikit-learn webui

Last synced: 16 Sep 2025

https://github.com/trainingbypackt/machine-learning-fundamentals

Use Python and scikit-learn to get up and running with the hottest developments in machine learning

jupyter-notebook machine-learning python3 scikit-learn

Last synced: 08 Mar 2026

https://github.com/microsoft/hack-workshop-lobe

Workshop for student hackathons focused on Lobe.ai

hackathon image-classification lobe lobe-ai machine-learning ml tensorflowjs workshop

Last synced: 11 May 2026

https://github.com/ofai/hub-toolbox-python3

Hubness analysis and removal functions

data-mining high-dimensional-data hubness machine-learning

Last synced: 11 Oct 2025

https://github.com/cossio/restrictedboltzmannmachines.jl

Train and sample Restricted Boltzmann machines in Julia

julia machine-learning rbm

Last synced: 11 Oct 2025

https://github.com/san089/big_data_project

Fake News Detection - Feature Extraction using Vectorization such as Count Vectorizer, TFIDF Vectorizer, Hash Vectorizer,. Then used an Ensemble model to classify whether the news is fake or not.

classifiers ensemble-model fakenewsdetection machine-learning news-classification scikit-learn text-mining textclassification vectorization vectorizers

Last synced: 13 Mar 2026

https://github.com/valohai/tensorflow-example

TensorFlow examples for Valohai platform

machine-learning tensorflow

Last synced: 20 Oct 2025

https://github.com/lanterndata/lantern_extras

Routines for generating, manipulating, parsing, importing vector embeddings into Postgres tables

ai database image-processing knn machine-learning open-source postgres postgresql rust vector ycombinator

Last synced: 07 Oct 2025

https://github.com/qiushisun/statistical-methods-and-machine-learning

2021 Spring (Statistical Methods and Machine Learning) 统计方法与机器学习

anova ecnu-dase linear-regression machine-learning mnist-handwriting-recognition statistics

Last synced: 20 Aug 2025

https://github.com/lucko515/dataset-dimensionality-reduction-python

Here I've demonstrated how and why should we use PCA, KernelPCA, LDA and t-SNE for dimensionality reduction when we work with higher dimensional datasets.

dimensionality-reduction kernel-pca lda machine-learning pca tsne

Last synced: 14 Apr 2025

https://github.com/quantiusbenignus/blahst

Input text from speech in any Linux window, the lean, fast and accurate way, using whisper.cpp offline.

accessibility ai bloat-free bloatfree cli command-line command-line-tool desktop-integration gnome kiss llm machine-learning no-nonsense speech-recognition speech-to-text whisper whisper-cpp

Last synced: 04 Oct 2025

https://github.com/yhtang/graphdot

GPU-accelerated Marginalized Graph Kernel with customizable node and edge features; Gaussian process regression.

cheminformatics cuda gpu graph-algorithms machine-learning python

Last synced: 15 Apr 2025

https://github.com/apexrl/codail

Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>

imitation-learning machine-learning multi-agent-reinforcement-learning

Last synced: 07 Apr 2025

https://github.com/qanastek/drbert

DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domains

bert biomedical french learning machine machine-learning medical ml nlp nlp-machine-learning taln text

Last synced: 16 Aug 2025

https://github.com/kyosek/ngboost-experiments

Play around with NGBoost and compare with LightGBM and XGBoost

boosting-algorithms house-price-prediction lightgbm machine-learning ngboost

Last synced: 15 May 2025

https://github.com/bloomberg/mixce-acl2023

Implementation of MixCE method described in ACL 2023 paper by Zhang et al.

language-model machine-learning nlp python pytorch transformer

Last synced: 07 May 2025

https://github.com/c99koder/AudioClassifier-MQTT

Use the yamnet TensorFlow model to classify live audio from a microphone and publish the predicted results to Home Assistant via MQTT

audio audio-analysis home-assistant machine-learning mqtt python3 tensorflow-lite yamnet

Last synced: 07 Apr 2025

https://github.com/robertklee/kitti-roadseg

A course project for road segmentation using a U-Net Convolutional Neural Network on the KITTI ROAD 2013 dataset

computer-vision image-segmentation kitti-dataset machine-learning neural-network road-segmentation

Last synced: 12 Sep 2025

https://github.com/scikit-multilearn-ng/scikit-multilearn-ng

A new maintained "successor" to scikit-multilearn, a scikit-learn based module for multi-label et. al. classification

classification clustering label-prediction machine-learning multi-label multi-label-classification partitioning scikit scikit-learn scikit-multilearn

Last synced: 21 Aug 2025