https://github.com/shink/spark-ml-algorithm-docker
Spark ML algorithms on docker
https://github.com/shink/spark-ml-algorithm-docker
docker-image spark spark-ml
Last synced: about 2 months ago
JSON representation
Spark ML algorithms on docker
- Host: GitHub
- URL: https://github.com/shink/spark-ml-algorithm-docker
- Owner: shink
- License: mit
- Created: 2021-11-24T08:08:29.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2022-02-03T03:53:43.000Z (over 4 years ago)
- Last Synced: 2025-03-26T19:46:09.329Z (over 1 year ago)
- Topics: docker-image, spark, spark-ml
- Language: Scala
- Homepage:
- Size: 203 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Spark ML Algorithms on Docker
中文文档 •
Docker Hub •
GitHub Packages
## Algorithms
- [KMeans](kmeans)
- [Latent Dirichlet Allocation](lda)
- [Gaussian Mixture Model](gmm)
- [Binomial Logistic Regression](binomial-logistic-regression)
- [Multinomial Logistic Regression](multinomial-logistic-regression)
- [Decision Tree Classification](decision-tree-classification)
- [Random Forest Classification](random-forest-classification)
- [Gradient-boosted Tree Classification](gradient-boosted-tree-classification)
- [Isotonic Regression](isotonic-regression)
- [Factorization Machines Regression](factorization-machines-regression)
- [Naive Bayes](naive-bayes)
- [Linear Regression](linear-regression)
## Development
Requirements:
- JDK 8+
- Maven 3+
- Docker 19+
- Hadoop 2+
- Spark 3+
Compile and build:
```shell
mvn clean package -DskipTests
```
Build docker image:
```shell
mvn clean package -DskipTests -Pdocker
```
## References
[v3.1.2 ml-guide](https://spark.apache.org/docs/3.1.2/ml-guide.html)
[examples on GitHub](https://github.com/apache/spark/tree/master/examples/src/main/scala/org/apache/spark/examples/ml)
## License
[MIT](LICENSE)