awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
https://github.com/ml13571/awesome-machine-learning
Last synced: 10 days ago
JSON representation
-
Scala
-
General-Purpose Machine Learning
- NDScala - N-dimensional arrays in Scala 3. Think NumPy ndarray, but with compile-time type-checking/inference over shapes, tensor/axis labels & numeric data types
- Scalding - A Scala API for Cascading.
- Summing Bird - Streaming MapReduce with Scalding and Storm.
- Algebird - Abstract Algebra for Scala.
- xerial - Data management utilities for Scala. **[Deprecated]**
- PredictionIO - PredictionIO, a machine learning server for software developers and data engineers.
- BIDMat - CPU and GPU-accelerated matrix library intended to support large-scale exploratory data analysis.
- Spark Notebook - Interactive and Reactive Data Science using Scala and Spark.
- ONNX-Scala - An ONNX (Open Neural Network eXchange) API and backend for typeful, functional deep learning in Scala (3).
- DeepLearning.scala - Creating statically typed dynamic neural networks from object-oriented & functional programming constructs.
- Conjecture - Scalable Machine Learning in Scalding.
- brushfire - Distributed decision tree ensemble learning in Scala.
- ganitha - Scalding powered machine learning. **[Deprecated]**
- adam - A genomics processing engine and specialized file format built using Apache Avro, Apache Spark and Parquet. Apache 2 licensed.
- bioscala - Bioinformatics for the Scala programming language
- BIDMach - CPU and GPU-accelerated Machine Learning Library.
- H2O Sparkling Water - H2O and Spark interoperability.
- Saul - Flexible Declarative Learning-Based Programming.
- doddle-model - An in-memory machine learning library built on top of Breeze. It provides immutable objects and exposes its functionality through a scikit-learn-like API.
- TensorFlow Scala - Strongly-typed Scala API for TensorFlow.
- Figaro - a Scala library for constructing probabilistic models.
- DynaML - Scala Library/REPL for Machine Learning Research.
-
-
Scheme
-
General-Purpose Machine Learning
- layer - Neural network inference from the command line, implemented in [CHICKEN Scheme](https://www.call-cc.org/).
-
-
Swift
-
General-Purpose Machine Learning
- Bender - Fast Neural Networks framework built on top of Metal. Supports TensorFlow models.
- Swift AI - Highly optimized artificial intelligence and machine learning library written in Swift.
- Swift for Tensorflow - a next-generation platform for machine learning, incorporating the latest research across machine learning, compilers, differentiable programming, systems design, and beyond.
- BrainCore - The iOS and OS X neural network framework.
- swix - A bare bones library that includes a general matrix language and wraps some OpenCV for iOS development. **[Deprecated]**
- AIToolbox - A toolbox framework of AI modules written in Swift: Graphs/Trees, Linear Regression, Support Vector Machines, Neural Networks, PCA, KMeans, Genetic Algorithms, MDP, Mixture of Gaussians.
- MLKit - A simple Machine Learning Framework written in Swift. Currently features Simple Linear Regression, Polynomial Regression, and Ridge Regression.
- Swift Brain - The first neural network / machine learning library written in Swift. This is a project for AI algorithms in Swift for iOS and OS X development. This project includes algorithms focused on Bayes theorem, neural networks, SVMs, Matrices, etc...
- Perfect TensorFlow - Swift Language Bindings of TensorFlow. Using native TensorFlow models on both macOS / Linux.
- PredictionBuilder - A library for machine learning that builds predictions using a linear regression.
- Awesome CoreML - A curated list of pretrained CoreML models.
- Awesome Core ML Models - A curated list of machine learning models in CoreML format.
-
-
TensorFlow
-
General-Purpose Machine Learning
- Awesome Keras - A curated list of awesome Keras projects, libraries and resources.
- Awesome TensorFlow - A list of all things related to TensorFlow.
- Golden TensorFlow - A page of content on TensorFlow, including academic papers and links to related topics.
-
-
Tools
-
General-Purpose Machine Learning
- Wallaroo.AI - Production AI plaftorm for deploying, managing, and observing any model at scale across any envirorment from cloud to edge. Let's you go from python notebook to inferencing in minutes.
- Infinity - The AI-native database built for LLM applications, providing incredibly fast vector and full-text search. Developed using C++20
- Synthical - AI-powered collaborative research environment. You can use it to get recommendations of articles based on reading history, simplify papers, find out what articles are trending, search articles by meaning (not just keywords), create and share folders of articles, see lists of articles from specific companies and universities, and add highlights.
- Weaviate - technologies/weaviate) vector search engine and vector database. Weaviate uses machine learning to vectorize and store data, and to find answers to natural language queries. With Weaviate you can also bring your custom ML models to production scale.
- txtai - Build semantic search applications and workflows.
- MLReef - MLReef is an end-to-end development platform using the power of git to give structure and deep collaboration possibilities to the ML development process.
- Chroma - Chroma - the AI-native open-source embedding database
- Pinecone - Vector database for applications that require real-time, scalable vector embedding and similarity search.
- ML Workspace - All-in-one web-based IDE for machine learning and data science. The workspace is deployed as a docker container and is preloaded with a variety of popular data science libraries (e.g., Tensorflow, PyTorch) and dev tools (e.g., Jupyter, VS Code).
- Notebooks - A starter kit for Jupyter notebooks and machine learning. Companion docker images consist of all combinations of python versions, machine learning frameworks (Keras, PyTorch and Tensorflow) and CPU/CUDA versions.
- DVClive - Python library for experiment metrics logging into simply formatted local files.
- Kedro - Kedro is a data and development workflow framework that implements best practices for data pipelines with an eye towards productionizing machine learning models.
- guild.ai - Tool to log, analyze, compare and "optimize" experiments. It's cross-platform and framework independent, and provided integrated visualizers such as tensorboard.
- Sacred - Python tool to help you configure, organize, log and reproduce experiments. Like a notebook lab in the context of Chemistry/Biology. The community has built multiple add-ons leveraging the proposed standard.
- MLFlow - platform to manage the ML lifecycle, including experimentation, reproducibility and deployment. Framework and language agnostic, take a look at all the built-in integrations.
- MachineLearningWithTensorFlow2ed - a book on general purpose machine learning techniques regression, classification, unsupervised clustering, reinforcement learning, auto encoders, convolutional neural networks, RNNs, LSTMs, using TensorFlow 1.14.1.
- m2cgen - A tool that allows the conversion of ML models into native code (Java, C, Python, Go, JavaScript, Visual Basic, C#, R, PowerShell, PHP, Dart) with zero dependencies.
- CML - A library for doing continuous integration with ML projects. Use GitHub Actions & GitLab CI to train and evaluate models in production like environments and automatically generate visual reports with metrics and graphs in pull/merge requests. Framework & language agnostic.
- Pythonizr - An online tool to generate boilerplate machine learning code that uses scikit-learn.
- Flyte - Flyte makes it easy to create concurrent, scalable, and maintainable workflows for machine learning and data processing.
- MLEM - Version and deploy your ML models following GitOps principles
- DockerDL - Ready to use deeplearning docker images.
- Ambrosia - Ambrosia helps you clean up your LLM datasets using _other_ LLMs.
- milvus - io/milvus) vector database for production AI, written in Go and C++, scalable and blazing fast for billions of embedding vectors.
- VDP - open source visual data ETL to streamline the end-to-end visual data processing pipeline: extract unstructured visual data from pre-built data sources, transform it into analysable structured insights by Vision AI models imported from various ML platforms, and load the insights into warehouses or applications.
- DVC - Data Science Version Control is an open-source version control system for machine learning projects with pipelines support. It makes ML projects reproducible and shareable.
- Chaos Genius - ML powered analytics engine for outlier/anomaly detection and root cause analysis.
- Qdrant
- CatalyzeX - Browser extension ([Chrome](https://chrome.google.com/webstore/detail/code-finder-for-research/aikkeehnlfpamidigaffhfmgbkdeheil) and [Firefox](https://addons.mozilla.org/en-US/firefox/addon/code-finder-catalyzex/)) that automatically finds and shows code implementations for machine learning papers anywhere: Google, Twitter, Arxiv, Scholar, etc.
-
Programming Languages
Categories
Python
384
R
87
JavaScript
72
Julia
59
C++
57
Java
53
Go
45
Lua
44
Clojure
44
Tools
29
Scala
28
Ruby
25
Matlab
18
Rust
14
.NET
13
Swift
12
SAS
11
C
8
Objective C
8
Common Lisp
7
Books
7
Perl
6
Haskell
6
PHP
5
OCaml
4
Elixir
4
Perl 6
3
TensorFlow
3
Crystal
2
Credits
2
Fortran
2
OpenCV
1
Scheme
1
Erlang
1
Kotlin
1
APL
1
Sub Categories
Keywords
machine-learning
221
python
125
deep-learning
98
data-science
73
pytorch
47
nlp
35
neural-network
35
tensorflow
30
scikit-learn
29
artificial-intelligence
26
natural-language-processing
25
ml
21
julia
20
ai
19
clojure
19
neural-networks
18
statistics
18
java
17
reinforcement-learning
15
hyperparameter-optimization
15
gpu
15
computer-vision
15
go
14
visualization
14
regression
14
machine-learning-algorithms
13
random-forest
12
automl
12
data-mining
12
numpy
12
c-plus-plus
12
keras
12
data-analysis
12
javascript
12
deep-neural-networks
12
jupyter-notebook
11
golang
11
xgboost
11
big-data
11
classification
10
cuda
10
spark
10
scala
10
llm
10
jupyter
10
r
9
rust
9
machine-learning-library
8
feature-engineering
8
named-entity-recognition
8