An open API service indexing awesome lists of open source software.

Machine learning

Machine learning is the practice of teaching a computer to learn. The concept uses pattern recognition, as well as other forms of predictive algorithms, to make judgments on incoming data. This field is closely related to artificial intelligence and computational statistics.

https://yomguithereal.github.io/talisman/

Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.

clustering deduplication fuzzy-matching information-retrieval machine-learning natural-language-processing record-linkage

Last synced: 15 Nov 2025

https://github.com/iterative/mlem

🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞

cli data-science deployment developer-tools git machine-learning mlem model-registry python

Last synced: 26 Mar 2025

https://github.com/larq/larq

An Open-Source Library for Training Binarized Neural Networks

binarized-neural-networks binder deep-learning keras larq machine-learning python quantized-neural-networks tensorflow

Last synced: 15 May 2025

https://github.com/rust0258/Deeplearning.ai-Natural-Language-Processing-Specialization

This repository contains my full work and notes on Coursera's NLP Specialization (Natural Language Processing) taught by the instructor Younes Bensouda Mourri and Łukasz Kaiser offered by deeplearning.ai

attention-mechanism coursera deep-learning deeplearning-ai encoder-decoder logistic-regression machine-learning naive-bayes neural neural-networks nlp probabilistic-models sequence-models specialization

Last synced: 19 Jul 2025

https://github.com/ShawhinT/YouTube-Blog

Codes to complement YouTube videos and blog posts on Medium.

data-science example-code machine-learning medium-articles youtube

Last synced: 18 Jul 2025

https://github.com/altdeep/causalML

The open source repository for the Causal Modeling in Machine Learning Workshop at Altdeep.ai @ www.altdeep.ai/courses/causalML

causality machine-learning

Last synced: 26 Apr 2025

https://github.com/ben519/mlpb

Machine Learning Problem Bible | Problem Set Here >>

machine-learning python r

Last synced: 14 Jun 2025

https://github.com/aws/studio-lab-examples

Example notebooks for working with SageMaker Studio Lab. Sign up for an account at the link below!

amazon-sagemaker-lab aws deep-learning deploy huggingface inference machine-learning sagemaker sagemaker-studio-lab training

Last synced: 16 May 2025

https://github.com/ashishpatel26/amazing-feature-engineering

Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.

data-analysis data-mining data-science data-scientists data-visualization deep-learning feature-engineering feature-extraction feature-scaling feature-selection features machine-learning scikit-learn

Last synced: 16 May 2025

https://github.com/ben519/MLPB

Machine Learning Problem Bible | Problem Set Here >>

machine-learning python r

Last synced: 20 Jul 2025

https://github.com/NVIDIA/bionemo-framework

BioNeMo Framework: For building and adapting AI models in drug discovery at scale

drug-discovery gpu machine-learning pytorch

Last synced: 08 Apr 2026

https://github.com/Freemanzxp/GBDT_Simple_Tutorial

python实现GBDT的回归、二分类以及多分类,将算法流程详情进行展示解读并可视化,庖丁解牛地理解GBDT。Gradient Boosting Decision Trees regression, dichotomy and multi-classification are realized based on python, and the details of algorithm flow are displayed, interpreted and visualized to help readers better understand Gradient Boosting Decision Trees

gbdt gnm gradient-boosting gradient-boosting-decision-trees machine-learning

Last synced: 19 Jul 2025

https://github.com/mandiant/stringsifter

A machine learning tool that ranks strings based on their relevance for malware analysis.

fireeye-data-science fireeye-flare learning-to-rank machine-learning malware-analysis reverse-engineering strings

Last synced: 15 May 2025

https://github.com/stevenygd/PointFlow

PointFlow : 3D Point Cloud Generation with Continuous Normalizing Flows

3d-point-clouds computer-vision continuous-normalizing-flows machine-learning pytorch shapes

Last synced: 15 Jul 2025

https://github.com/yomguithereal/talisman

Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.

clustering deduplication fuzzy-matching information-retrieval machine-learning natural-language-processing record-linkage

Last synced: 14 Apr 2025

https://github.com/gordicaleksa/get-started-with-jax

The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series of tutorials (YouTube videos and Jupyter Notebooks) as well as the content I found useful while learning about the JAX ecosystem.

deep-learning flax haiku jax jupyter lax learn-jax machine-learning numpy optax python tutorial xla

Last synced: 04 Apr 2025

https://github.com/abilzerian/LLM-Prompt-Library

Advanced Code and Text Manipulation Prompts for Various LLMs. Suitable for Siri, GPT-4o, Claude, Llama3, Gemini, and other high-performance open-source LLMs.

ai apple-intelligence artificial-intelligence chatbot chatgpt chatgpt-api gpt gpt-3 gpt-4 machine-learning openai prompt prompt-engineering prompt-injection prompt-toolkit prompting prompts python siri text-analysis

Last synced: 27 Mar 2025

https://github.com/hollance/MobileNet-CoreML

The MobileNet neural network using Apple's new CoreML framework

core-ml ios machine-learning mobilenet swift

Last synced: 11 May 2025

https://github.com/hollance/mobilenet-coreml

The MobileNet neural network using Apple's new CoreML framework

core-ml ios machine-learning mobilenet swift

Last synced: 05 Apr 2025

https://github.com/google-research/rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

benchmarking evaluation-metrics google machine-learning reinforcement-learning rl

Last synced: 08 Apr 2025

https://github.com/jubatus/jubatus

Framework and Library for Distributed Online Machine Learning

c-plus-plus distributed machine-learning ml

Last synced: 04 May 2025

https://github.com/locuslab/qpth

A fast and differentiable QP solver for PyTorch.

deep-learning machine-learning optimization pytorch quadratic-programming

Last synced: 15 May 2025

https://github.com/saltudelft/ml4se

A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering

ai4code ai4se code datasets deep-learning llm4code machine-learning ml4code ml4se papers research software-engineering theses tools tudelft

Last synced: 01 Feb 2026

https://github.com/tony-framework/TonY

TonY is a framework to natively run deep learning frameworks on Apache Hadoop.

deep-learning hadoop hadoop-yarn horovod machine-learning tensorflow

Last synced: 20 Apr 2025

https://github.com/tony-framework/tony

TonY is a framework to natively run deep learning frameworks on Apache Hadoop.

deep-learning hadoop hadoop-yarn horovod machine-learning tensorflow

Last synced: 03 Jan 2026

https://github.com/kyegomez/longnet

Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

artificial-intelligence attention attention-is-all-you-need attention-mechanisms chatgpt context-length gpt3 gpt4 machine-learning transformer

Last synced: 16 May 2025

https://github.com/techascent/tech.ml.dataset

A Clojure high performance data processing system

clojure csv dataframe datascience dataset etl-pipeline java machine-learning xlsx

Last synced: 15 May 2025

https://github.com/Yomguithereal/talisman

Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.

clustering deduplication fuzzy-matching information-retrieval machine-learning natural-language-processing record-linkage

Last synced: 15 Mar 2025

https://github.com/zkywsg/daily-deeplearning

🔥机器学习/深度学习/Python/大模型/多模态/LLM/deeplearning/Python/Algorithm interview/NLP Tutorial

cv deep-learning leetcode leetcode-python leetcode-solutions llm machine-learning nlp python pytorch pytorch-nlp pytorch-tutorial pytorch-tutorials tensorflow tensorflow-examples tensorflow-tutorials

Last synced: 15 May 2025

https://github.com/aws-samples/aws-mlu-explain

Visual, Interactive Articles About Machine Learning: https://mlu-explain.github.io/

ai aws d3 datavisualization dataviz deep-learning machine-learning machinelearning mlu svelte

Last synced: 14 Apr 2025

https://github.com/stratospark/food-101-keras

Food Classification with Deep Learning in Keras / Tensorflow

ai deep-learning food-classification image-classification keras machine-learning tensorflow

Last synced: 19 Jul 2025

https://github.com/Xtra-Computing/thundergbm

ThunderGBM: Fast GBDTs and Random Forests on GPUs

cuda gbdt gpu machine-learning random-forest

Last synced: 12 Apr 2025

https://github.com/xtra-computing/thundergbm

ThunderGBM: Fast GBDTs and Random Forests on GPUs

cuda gbdt gpu machine-learning random-forest

Last synced: 14 Dec 2025

https://github.com/ashishpatel26/Amazing-Feature-Engineering

Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.

data-analysis data-mining data-science data-scientists data-visualization deep-learning feature-engineering feature-extraction feature-scaling feature-selection features machine-learning scikit-learn

Last synced: 10 Apr 2025

https://github.com/apache/submarine

Submarine is Cloud Native Machine Learning Platform.

ai deep-learning docker kubernetes machine-learning notebook

Last synced: 10 Jan 2026

https://github.com/dgarnitz/vectorflow

VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.

ai data-engineering embeddings machine-learning nlp vectors

Last synced: 14 Dec 2025

https://github.com/projects-developer/50-final-year-projects-with-source-code

Final year projects are a crucial part of a student's academic journey, particularly in the fields of engineering, computer science, and other technical disciplines.50 Final year Projects Includes Source Code, PPT, Synopsis, Report, Documents, Base Research Paper & Video tutorials

b-techprojects bcaprojects blockchain brain-tumor-detection computer-science-projects computerscienceprojects cryptography fake-news-detection fake-product-identification-system final-year-projects finalyearprojects m-techprojects machine-learning malware-detection mcaprojects python-projects web-developement

Last synced: 21 Feb 2026

https://github.com/decalogue/chat

基于自然语言理解与机器学习的聊天机器人,支持多用户并发及自定义多轮对话

algorithm chat chatbot context database graph kb machine-learning natural-language-processing natural-language-understanding neo4j nlp nlu python python3 qa question-answering sentence-similarity

Last synced: 12 Apr 2025

https://github.com/TrainingByPackt/Data-Science-Projects-with-Python

A Case Study Approach to Successful Data Science Projects Using Python, Pandas, and Scikit-Learn

data-science machine-learning numpy pandas pandas-dataframe python scikit-learn

Last synced: 14 Apr 2025

https://github.com/aleju/mario-ai

Playing Mario with Deep Reinforcement Learning

agent deep-learning deep-reinforcement-learning machine-learning mario reward torch

Last synced: 05 Apr 2025

https://github.com/ika-rwth-aachen/Cam2BEV

TensorFlow Implementation for Computing a Semantically Segmented Bird's Eye View (BEV) Image Given the Images of Multiple Vehicle-Mounted Cameras.

autonomous-vehicles birds-eye-view computer-vision deep-learning ipm machine-learning segmentation sim2real simulation

Last synced: 20 Mar 2025

https://github.com/ibm/ffdl

Fabric for Deep Learning (FfDL, pronounced fiddle) is a Deep Learning Platform offering TensorFlow, Caffe, PyTorch etc. as a Service on Kubernetes

ai artificial-intelligence caffe deep-learning deep-neural-networks deeplearning ibm-research-ai jupyter keras kubernetes-cluster machine-learning ml model python pytorch storage tensorflow watson

Last synced: 12 Apr 2025

https://github.com/evilgix/evil

Optical Character Recognition in Swift for iOS&macOS. 银行卡、身份证、门牌号光学识别

cnn-model keras machine-learning ocr swift4 vision

Last synced: 05 Apr 2025

https://github.com/evilgix/Evil

Optical Character Recognition in Swift for iOS&macOS. 银行卡、身份证、门牌号光学识别

cnn-model keras machine-learning ocr swift4 vision

Last synced: 15 May 2025

https://github.com/dformoso/sklearn-classification

Data Science Notebook on a Classification Task, using sklearn and Tensorflow.

classification-task data docker jupyter learning machine machine-learning notebook roc roc-curve science sklearn tensorflow

Last synced: 04 Apr 2025

https://github.com/trainingbypackt/data-science-projects-with-python

A Case Study Approach to Successful Data Science Projects Using Python, Pandas, and Scikit-Learn

data-science machine-learning numpy pandas pandas-dataframe python scikit-learn

Last synced: 04 Apr 2025

https://github.com/raycad/devops-roadmap

DevOps methodology & roadmap for a devops developer in 2019. Interesting books to learn new technologies.

ai big-data books deep-learning devops experience expert-system machine-learning programming

Last synced: 25 Jan 2026

https://github.com/uliontse/mlgb

MLGB is a library that includes many models of CTR Prediction & Recommender System by TensorFlow & PyTorch. 「妙计包」是一个包含50+点击率预估和推荐系统深度模型的、通过TensorFlow和PyTorch撰写的库。

autoint ctr-prediction dcn deep-learning deepfm din dsin dssm edcn esmm fibinet machine-learning masknet mind mmoe pepnet ple pnn recommender-system xdeepfm

Last synced: 15 May 2025

https://github.com/googleapis/nodejs-speech

This repository is deprecated. All of its content and history has been moved to googleapis/google-cloud-node.

machine-learning nodejs speech speech-to-text

Last synced: 14 Mar 2025

https://github.com/kyegomez/LongNet

Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

artificial-intelligence attention attention-is-all-you-need attention-mechanisms chatgpt context-length gpt3 gpt4 machine-learning transformer

Last synced: 13 May 2025

https://github.com/pplonski/keras2cpp

This is a bunch of code to port Keras neural network model into pure C++.

keras machine-learning neural-network

Last synced: 15 Mar 2025

https://github.com/inspirehep/magpie

Deep neural network framework for multi-label text classification

classification deep-learning machine-learning multi-label-classification neural-network nlp prediction word2vec

Last synced: 04 Apr 2025

https://github.com/BiomedSciAI/causallib

A Python package for modular causal inference analysis and model evaluations

causal causal-inference causal-models causality data-science machine-learning ml

Last synced: 27 Mar 2025

https://github.com/ankane/eps

Machine learning for Ruby

automl machine-learning rubyml

Last synced: 17 Nov 2025

https://github.com/bytefish/opencv

OpenCV projects: Face Recognition, Machine Learning, Colormaps, Local Binary Patterns, Examples...

face-recognition machine-learning opencv

Last synced: 05 Apr 2025

https://github.com/jerryji1993/DNABERT

DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome

deep-learning dnabert-model genome gpu kmer kmer-format machine-learning natural-language-processing nlp sequence

Last synced: 21 Jul 2025

https://github.com/huggingface/exporters

Export Hugging Face models to Core ML and TensorFlow Lite

coreml coremltools deep-learning machine-learning model-converter pytorch tensorflow tflite transformer

Last synced: 14 Oct 2025

https://github.com/koursaros-ai/nboost

NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on different platforms (i.e. Elasticsearch)

cloud deep-learning docker elasticsearch helm kubernetes machine-learning microservices nboost nlp proxy python pytorch search-api search-engine semantic-search tensorflow

Last synced: 30 Mar 2025

https://github.com/tensorflow/decision-forests

A collection of state-of-the-art algorithms for the training, serving and interpretation of Decision Forest models in Keras.

decision-forest decision-trees gradient-boosting interpretability keras machine-learning ml python random-forest tensorflow

Last synced: 14 May 2025

https://github.com/cnkuangshi/LightCTR

Lightweight and Scalable framework that combines mainstream algorithms of Click-Through-Rate prediction based computational DAG, philosophy of Parameter Server and Ring-AllReduce collective communication.

computational-graphs deep-learning distributed-systems factorization-machines machine-learning model-compression parameter-server

Last synced: 15 Mar 2025

https://github.com/google-research/prompt-tuning

Original Implementation of Prompt Tuning from Lester, et al, 2021

flax jax language-model machine-learning nlp prompt-tuning

Last synced: 15 May 2025

https://github.com/googlecloudplatform/tf-estimator-tutorials

This repository includes tutorials on how to use the TensorFlow estimator APIs to perform various ML tasks, in a systematic and standardised way

machine-learning python tensorflow

Last synced: 16 May 2025

https://github.com/ai-infra-curriculum/ai-infra-engineer-learning

AI Infrastructure Engineer Learning Track - Production ML infrastructure curriculum (2-4 years experience)

ai-infrastructure curriculum gpu intermediate kubernetes learning llm machine-learning mlops production terraform

Last synced: 10 Jun 2026

https://github.com/elastic/eland

Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch

big-data data-analysis dataframe dataframes eland elasticsearch etl lightgbm machine-learning pandas python scikit-learn time-series-forecasting

Last synced: 14 Apr 2025