An open API service indexing awesome lists of open source software.

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

https://github.com/blakejakopovic/nostr-spam-detection

An experiment in building a machine learning model to label Nostr spam content for filtering and relay rejection.

machine-learning nostr proof-of-concept spam-detection spam-filtering

Last synced: 18 Jan 2026

https://github.com/smith42/xdf-gan

A GAN for the generation of mock astronomical surveys

astronomy deep-learning gan machine-learning

Last synced: 27 Oct 2025

https://github.com/primaprashant/ai-customer-support

📚 Curated collection of blogs and papers on how different companies are using machine learning in production for better customer support.

ai applied-data-science applied-machine-learning applied-ml artificial-intelligence customer-service customer-support data-science deep-learning machine-learning natural-language-processing nlp paper production tech-blog

Last synced: 10 Feb 2026

https://github.com/otonomee/streamstem

Implements ML audio separation algorithm on audio from YouTube or Spotify resulting in "stems" for download (e.g. vocals, drums, bass) in MP3, WAV or FLAC.

audio-classification deep-learning demucs fastapi machine-learning source-separation spotify-stemmer youtube-stemmer

Last synced: 26 Oct 2025

https://github.com/lucastheis/trlda

Implementations of various online inference algorithms for LDA, with Python interface.

lda machine-learning python topic-modeling variational-inference

Last synced: 16 Mar 2026

https://github.com/lanterndata/lantern_extras

Routines for generating, manipulating, parsing, importing vector embeddings into Postgres tables

ai database image-processing knn machine-learning open-source postgres postgresql rust vector ycombinator

Last synced: 07 Oct 2025

https://github.com/gasteigerjo/lcn

Locally corrected Nyström (LCN), as proposed in "Scalable Optimal Transport in High Dimensions for Graph Distances, Embedding Alignment, and More" (ICML 2021)

kernel-approximation machine-learning optimal-transport paper

Last synced: 28 Feb 2026

https://github.com/idiap/zff_vad

Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering

audio-processing machine-learning noise-robust signal-processing speech-activity-detection voice-activity-detection

Last synced: 05 Oct 2025

https://github.com/guildai/guildai-r

Track machine learning experiments

deep-learning keras machine-learning

Last synced: 26 Oct 2025

https://github.com/ofai/hub-toolbox-python3

Hubness analysis and removal functions

data-mining high-dimensional-data hubness machine-learning

Last synced: 11 Oct 2025

https://github.com/wassname/phoneme2grapheme

Teaching machines to spell with deep learning (acc=>80%) e.g. a model hears "pɹˈaʊd˺ɚ" and writes "prowder" (but it should be "prouder")

cmudict deep-learning deeplearning machine-learning nlp pronunciation spelling

Last synced: 03 Jul 2025

https://github.com/valohai/tensorflow-example

TensorFlow examples for Valohai platform

machine-learning tensorflow

Last synced: 20 Oct 2025

https://github.com/san089/big_data_project

Fake News Detection - Feature Extraction using Vectorization such as Count Vectorizer, TFIDF Vectorizer, Hash Vectorizer,. Then used an Ensemble model to classify whether the news is fake or not.

classifiers ensemble-model fakenewsdetection machine-learning news-classification scikit-learn text-mining textclassification vectorization vectorizers

Last synced: 13 Mar 2026

https://github.com/cossio/restrictedboltzmannmachines.jl

Train and sample Restricted Boltzmann machines in Julia

julia machine-learning rbm

Last synced: 11 Oct 2025

https://github.com/0xnaman1/american-sign-language-detection-using-computer-vision

The project is about translating American Sign Language into English language. It uses Computer Vision and Deep Learning to predict the ASL alphabet and forms sentences on the basis of prediction. It uses text to speech to convert the predicted word into speech. The project was implemented at MNNIT Hack36 Allahabad Hackathon.

computer-vision deeplearning keras machine-learning neural-network opencv tensorflow texttospeech

Last synced: 27 Jun 2025

https://github.com/fbruzzesi/sklearn-smithy

Toolkit to forge scikit-learn compatible estimators

cli data-science machine-learning python scikit-learn webui

Last synced: 16 Sep 2025

https://github.com/kenethgarcia/classipygrb

This repository contains all the updates, code, and documentation related to ClassiPyGRB.

astrophysics grbs machine-learning machine-learning-algorithms unsupervised-learning unsupervised-machine-learning

Last synced: 30 Oct 2025

https://github.com/trainingbypackt/machine-learning-fundamentals

Use Python and scikit-learn to get up and running with the hottest developments in machine learning

jupyter-notebook machine-learning python3 scikit-learn

Last synced: 08 Mar 2026

https://github.com/mcarlomagno/cardrivingresnet

🚗 Browser game where a vehicle is driven through the camera using the ResNet model (Residual Network) to estimate the position of the hands.

css html javascript machine-learning posenet residual-network resnet-model tensorflow tensorflowjs vehicle

Last synced: 28 Jun 2025

https://github.com/trainingbypackt/professional-azure-sql-database-administration-second-edition

Equip yourself with the skills required to manage and maintain data on the Cloud.

azure database dtu georeplication machine-learning powershell sql ssms

Last synced: 03 Jul 2025

https://github.com/apssouza22/computer-vision

A collection of computer vision projects

computer-vision deep-learning machine-learning

Last synced: 18 Jan 2026

https://github.com/stanislavgrigoriev/easycntk

C# library for easy Deep Learning and Deep Reinforcement Learning. It is wrapper over C# CNTK API. Has implementation of layers (LSTM, Convolution etc.), optimizers, losses, shortcut-connections, sequential model, sequential multi-output model, agent teachers, policy gradients, actor-critic etc. Contains helpers for work with dataset (split, statistics, SMOTE etc). Allows train, evaluate and inference deep neural networks in style similar to Keras.

c-sharp cntk cognitive-toolkit deep-learning deep-learning-library deep-neural-networks machine-learning

Last synced: 02 Jul 2025

https://github.com/brandondocusen/cntxtjv

A discovery and compression tool for your Java codebase. Creates a knowledge graph for a LLM context window, efficiently outlining your project #LLM #AI #Java #CodeAnalysis #ContextWindow #DeveloperTools #StaticAnalysis #CodeVisualization

architecture-insights code-documentation code-visualization codebase-analysis context-management context-window dependency-analysis dependency-mapping developer-tools knowledge-graph large-language-models llm llm-integration machine-learning module-relationships open-source python python-frameworks spring token-reduction

Last synced: 10 Apr 2025

https://github.com/cassiobotaro/sentibol

:soccer: Notebook feito para analisar o case do Sentibol

machine-learning python3 sentiment-analysis

Last synced: 26 Apr 2025

https://github.com/google-research/slip

SLIP is a sandbox environment for engineering protein sequences with synthetic fitness functions.

computational-biology machine-learning protein-design

Last synced: 24 Apr 2025

https://github.com/mad-lab-fau/tpcp

Pipeline and Dataset helpers for complex algorithm evaluation.

algorithms biosignals data-management data-science machine-learning python

Last synced: 04 Feb 2026

https://github.com/amirabbasasadi/rockyml

⛰️ RockyML - A High-Performance Scientific Computing Framework for Non-smooth Machine Learning Problems

cplusplus cpp deep-learning deep-neural-networks distributed-computing high-performance machine-learning mpi optimization parallel-computing scientific-computing

Last synced: 15 Jun 2025

https://github.com/millengustavo/epileptic-seizure-recognition

Repository with Machine Learning techniques to identify Epileptic Seizure from EEG data.

brain-activity deep-learning eeg-data eeg-signal epilepsy machine-learning

Last synced: 10 Apr 2025

https://github.com/future-ai-org/desci-ai-experiments-py

👾 resources and experiments for autonomous agents (e.g., language models, reinforcement learning, energy based models)

autism chatgpt deep-learning gpt-3 machine-learning markov-chain openai quantum-ai reinforcement-learning rust supervised-machine-learning

Last synced: 30 Apr 2026

https://github.com/fieg/knn

k-Nearest Neighbors algorithm in PHP

k-nearest-neighbours knn machine-learning php

Last synced: 07 Jul 2025

https://github.com/pourmand1376/persiancrawler

Open source crawler for Persian websites.

crawler machine-learning news python scrapy tasnim text-classification

Last synced: 28 Oct 2025

https://github.com/mskcc/mimsi

Microsatellite Instability Classification using Multiple Instance Learning

cancer-genomics deep-learning genomics machine-learning multiple-instance-learning

Last synced: 30 Apr 2025

https://github.com/zetavg/twlm

Taiwanese Mandarin LLM Project

llm machine-learning

Last synced: 20 Aug 2025

https://github.com/abhaskumarsinha/minimalgpt

MinimalGPT is a concise, adaptable, and streamlined code framework that encompasses the essential components necessary for the construction, training, inference, and fine-tuning of the GPT model. This framework is implemented exclusively using Keras and TensorFlow, ensuring compatibility and coherence within the broader deep learning ecosystem.

ai artificial-intelligence fine-tuning generative-model gpt gpt-2 gpt-models keras keras-tensorflow language-model llm machine-learning neural-network nlp nlp-machine-learning tensorflow tensorflow2 training transformer transformer-architecture

Last synced: 11 Jul 2025

https://github.com/mapbox/gabbar

Guarding OpenStreetMap from harmful edits using machine learning

banished jupyter-notebook machine-learning openstreetmap scikit-learn vandalism

Last synced: 12 Apr 2025

https://github.com/polakowo/mlprojects

Some of my ML projects and Kaggle competitions

fastai machine-learning pytorch sklearn tensorflow

Last synced: 07 May 2025

https://github.com/tallamjr/barberbook

Solutions for David Barber's "Bayesian Reasoning and Machine Learning" Book

bayesian-methods book machine-learning

Last synced: 21 Jun 2025

https://github.com/srzstephen/disavu

A disaster response solution that helps allocate resources to where they're needed.

amazon-sagemaker-lab aws disaster-response fastai2 machine-learning python rust rust-lang sagemaker-studio-lab typescript

Last synced: 11 Apr 2025

https://github.com/pb2204/teachable-machine

This Is A Very Simple Live Teachable Machine Using TensorFlow.JS

ai deep-learning google machine-learning programming project teachable-machine

Last synced: 16 Jul 2025

https://github.com/octalpixel/skin-extraction-from-image-and-finding-dominant-color

Project is an implementation of skin segmentation using OpenCV and dominant color extraction using SciKit-Learn

image-processing kmeans kmeans-clustering machine-learning opencv python scikit-learn

Last synced: 12 Apr 2025

https://github.com/ashwinpn/containers--ml-and-cloud-computing

Everything about how to deploy projects on the cloud, run ML workloads on the HPC cluster and on the cloud and the efficient configuration and management of related collaborative platforms [e.g. container orchestration].

aws cloud cloud-computing containers deep-learning deployment docker docker-image dockerfile google google-cloud-platform k8s-cluster kubernetes kubernetes-cluster machine-learning pytorch vagrant

Last synced: 13 Apr 2025

https://github.com/src-d/models

Machine learning models for MLonCode trained using the source{d} stack

machine-learning mlosc model nlp source-code

Last synced: 17 Feb 2026

https://github.com/bbva/mercury-robust

mercury-robust is a framework to perform robust testing on ML models and datasets. It provides a collection of test that are easy to configure and helpful to guarantee robustness in your ML processes.

machine-learning python robustness testing

Last synced: 21 Jun 2025

https://github.com/ybubnov/torch_geopooling

The geospatial pooling modules for neural networks in PyTorch

deep-learning deep-learning-library geospatial machine-learning python pytorch

Last synced: 01 Sep 2025

https://github.com/sanidhyy/quill

Quill is an open-source software to make chatting to your PDF files easy.

ai artifical-intelligence css html js kinde machine-learning next nextjs quill react shadcn-ui tailwindcss typescript vercel

Last synced: 13 Apr 2025

https://github.com/apexrl/codail

Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>

imitation-learning machine-learning multi-agent-reinforcement-learning

Last synced: 07 Apr 2025

https://github.com/xai-demonstrator/xai-demonstrator

The XAI Demonstrator is a modular platform that lets users interact with production-grade Explainable AI (XAI) systems.

ai docker explainable-ai fastapi huggingface-transformers keras machine-learning mobile-first pytorch tensorflow vuejs xai

Last synced: 01 Sep 2025

https://github.com/scikit-multilearn-ng/scikit-multilearn-ng

A new maintained "successor" to scikit-multilearn, a scikit-learn based module for multi-label et. al. classification

classification clustering label-prediction machine-learning multi-label multi-label-classification partitioning scikit scikit-learn scikit-multilearn

Last synced: 21 Aug 2025

https://github.com/mohammed-majid/ml_roadmap

Comprehensive Machine Learning Roadmap

algorithms data-science deep-learning machine-learning roadmap

Last synced: 06 Mar 2025

https://github.com/changhuixu/lstm-sentiment-analysis

LSTM sentiment analysis. Please look at my another repo for SVM and Naive algorithem

dictionary l1-lstm lstm lstm-layes machine-learning review sentiment sentiment-analysis

Last synced: 20 Aug 2025

https://github.com/kentonishi/jtr-cvpr-2024

[CVPR 2024] Joint-Task Regularization for Partially Labeled Multi-Task Learning

computer-vision cvpr2024 machine-learning multitask-learning

Last synced: 23 Mar 2025

https://github.com/bloomberg/mixce-acl2023

Implementation of MixCE method described in ACL 2023 paper by Zhang et al.

language-model machine-learning nlp python pytorch transformer

Last synced: 07 May 2025

https://github.com/kyosek/ngboost-experiments

Play around with NGBoost and compare with LightGBM and XGBoost

boosting-algorithms house-price-prediction lightgbm machine-learning ngboost

Last synced: 15 May 2025

https://github.com/bastian/abstractive-summarization-of-meetings

The source code for my bachelor's thesis "Abstractive Summarization of Meetings"

abstractive-summarization bachelor-thesis bert machine-learning tensorflow texar transformer

Last synced: 23 Mar 2025

https://github.com/yhtang/graphdot

GPU-accelerated Marginalized Graph Kernel with customizable node and edge features; Gaussian process regression.

cheminformatics cuda gpu graph-algorithms machine-learning python

Last synced: 15 Apr 2025

https://github.com/owlbarn/owl_symbolic

Connect Owl with other accelerators and numerical frameworks with symbolic maths

algebra gpu-computing machine-learning neural-networks numerical onnx scientific-computing symbolic-math

Last synced: 22 Jun 2025

https://github.com/c99koder/AudioClassifier-MQTT

Use the yamnet TensorFlow model to classify live audio from a microphone and publish the predicted results to Home Assistant via MQTT

audio audio-analysis home-assistant machine-learning mqtt python3 tensorflow-lite yamnet

Last synced: 07 Apr 2025

https://github.com/robertklee/kitti-roadseg

A course project for road segmentation using a U-Net Convolutional Neural Network on the KITTI ROAD 2013 dataset

computer-vision image-segmentation kitti-dataset machine-learning neural-network road-segmentation

Last synced: 12 Sep 2025

https://github.com/pkestene/ms-hpc-ai-gpu

resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI

cuda deep-learning gpu gpu-computing machine-learning physics-informed-neural-networks pinn pinns

Last synced: 19 Aug 2025

https://github.com/yashksaini-coder/codsoft

This was a simple virtual internship where i mainly created machine learning models to perform tasks like Classification & Prediction

codsoft codsoft-internship codsoftinternship machine-learning machine-learning-algorithms

Last synced: 11 Apr 2025

https://github.com/fffaraz/ghsom-cpp

Growing Hierarchical Self-Organizing Map (GHSOM) implementation in C++

ai clustering cpp ghsom machine-learning qt som

Last synced: 10 Apr 2025

https://github.com/iitzco/deepzoo

:wolf: :tiger: :whale2: :elephant: :monkey: Deep Learning model Zoo

artificial-intelligence deep-learning machine-learning modelzoo python

Last synced: 05 May 2025

https://github.com/rasbt/b3-basic-batchsize-benchmark

Experiments for the blog post "No, We Don't Have to Choose Batch Sizes As Powers Of 2"

deep-learning deep-neural-networks machine-learning neural-networks

Last synced: 05 May 2025

https://github.com/pedrohserrano/machine-learning-use-cases

Machine Learning Notebooks with Turicreate and Keras in a Docker Container

deep-learning docker machine-learning notebook

Last synced: 10 Apr 2025

https://github.com/nasaharvest/crop-maml

Learning to predict crop type from heterogeneous sparse labels using meta-learning

agriculture machine-learning meta-learning remote-sensing

Last synced: 06 Mar 2026

https://github.com/dhruvesh13/audio-genre-classification

Automatic music genre classification using Machine Learning algorithms like- Logistic Regression and K-Nearest Neighbours

audio-gene logistic-regression machine-learning mfcc python-2

Last synced: 22 Aug 2025

https://github.com/fschlatt/clubs

python poker engine for arbitrary community card poker games

blinds clubs community-cards machine-learning poker poker-game python raise-sizes reinforcement-learning

Last synced: 16 Jan 2026

https://github.com/ahmetfurkandemir/online-istanbul-applied-data-science-102-bootcamp

Online Istanbul Applied Data Science 102 Bootcamp (Start : 15 August, Finish : 7 November)

bootcamp data-science deep-learning kodluyoruz machine-learning

Last synced: 15 Apr 2025

https://github.com/cgrassin/keyboard_audio_hack

Python proof-of-concept for breaking passwords with a microphone, using machine learning.

hacking machine-learning proof-of-concept python3

Last synced: 05 May 2025

https://github.com/jan-janssen/gmailsorter

Similarity based email sorting for Google Mail using RandomForest classifiers

google-mail-api machine-learning python random-forest-classifier

Last synced: 11 Apr 2026

https://github.com/alvertogit/bigdata_docker

Big Data Docker Data Science Spark Spark4 Hadoop HDFS Scala Python Artificial Intelligence Machine Learning Jupyter Lab Notebook

big-data data-science docker jupyter-lab jupyter-notebook machine-learning python scala spark spark4

Last synced: 10 Mar 2026