https://github.com/OryxProject/oryx
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
https://github.com/OryxProject/oryx
apache-kafka apache-spark cloudera java kafka lambda-architecture machine-learning oryx
Last synced: 8 months ago
JSON representation
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
- Host: GitHub
- URL: https://github.com/OryxProject/oryx
- Owner: OryxProject
- License: apache-2.0
- Archived: true
- Created: 2014-07-25T20:08:44.000Z (over 11 years ago)
- Default Branch: master
- Last Pushed: 2021-08-16T11:16:27.000Z (over 4 years ago)
- Last Synced: 2024-07-30T21:39:27.752Z (over 1 year ago)
- Topics: apache-kafka, apache-spark, cloudera, java, kafka, lambda-architecture, machine-learning, oryx
- Language: Java
- Homepage: http://oryx.io
- Size: 7.12 MB
- Stars: 1,787
- Watchers: 208
- Forks: 405
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- Contributing: .github/CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project
- awesome-java-zh - Oryx 2 - 用于构建实时,大规模机器学习应用程序的框架。包括用于协同过滤、分类、回归和聚类的端到端应用程序。 (项目 / 机器学习)
- useful-java-links - Oryx 2 - Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning. License: [Apache 2](http://www.apache.org/licenses/LICENSE-2.0) , [](https://github.com/OryxProject/oryx).  (II. Databases, search engines, big data and machine learning / 7. Big data)
- awesome-bigdata - Oryx - Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning. (Machine Learning)
- fucking-awesome-bigdata - Oryx - Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning. (Machine Learning)
- fucking-awesome-java - Oryx 2 - Framework for building real-time, large-scale machine learning applications. Includes end-to-end applications for collaborative filtering, classification, regression, and clustering. (Projects / Machine Learning)
- awesome-hadoop - Oryx 2 - Lambda architecture on Spark, Kafka for real-time large scale machine learning (Machine learning and Big Data analytics)
- awesome-java - Oryx 2 - Framework for building real-time, large-scale machine learning applications. Includes end-to-end applications for collaborative filtering, classification, regression, and clustering. (Projects / Machine Learning)
README

Oryx 2 is a realization of the lambda architecture built on [Apache Spark](http://spark.apache.org)
and [Apache Kafka](http://kafka.apache.org), but with specialization for real-time large scale machine
learning. It is a framework for building applications, but also includes packaged, end-to-end
applications for collaborative filtering, classification, regression and clustering.
Proceed to the [Oryx 2 site](http://oryx.io/) for full documentation.
Just looking to deploy a ready-made, end-to-end application for collaborative filtering, clustering or classification? Easy.
Proceed directly to:
- Prepare your Hadoop cluster with [Cluster Setup](http://oryx.io/docs/admin.html)
- Get a [Release](https://github.com/OryxProject/oryx/releases)
- Prepare a config file from the [Configuration Reference](http://oryx.io/docs/endusers.html#Configuration)
- Run the binaries with [Running Oryx](http://oryx.io/docs/endusers.html#Running)
- Learn about the REST API endpoints you can call in the [API Endpoint Reference](http://oryx.io/docs/endusers.html#API_Endpoint_Reference)
Developers can consume Oryx 2 as a framework for building custom applications as well.
Following the architecture overview below, proceed to
[Making an Oryx App](http://oryx.io/docs/developer.html#Making_an_Oryx_App)
to learn how to create a new application. You can review a [module diagram](https://sourcespy.com/github/oryx/)
as well to understand the project structure.

------
[](https://travis-ci.org/OryxProject/oryx)
[](https://scan.coverity.com/projects/2697)
[](https://codecov.io/github/OryxProject/oryx?branch=master)