{"id":17293701,"url":"https://github.com/shink/spark-ml-algorithm-docker","last_synced_at":"2026-05-08T18:32:19.223Z","repository":{"id":108150469,"uuid":"431397996","full_name":"shink/spark-ml-algorithm-docker","owner":"shink","description":"Spark ML algorithms on docker","archived":false,"fork":false,"pushed_at":"2022-02-03T03:53:43.000Z","size":208,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-03-26T19:46:09.329Z","etag":null,"topics":["docker-image","spark","spark-ml"],"latest_commit_sha":null,"homepage":"","language":"Scala","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/shink.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-11-24T08:08:29.000Z","updated_at":"2022-02-01T11:27:27.000Z","dependencies_parsed_at":"2023-03-13T14:28:57.764Z","dependency_job_id":null,"html_url":"https://github.com/shink/spark-ml-algorithm-docker","commit_stats":null,"previous_names":[],"tags_count":2,"template":false,"template_full_name":null,"purl":"pkg:github/shink/spark-ml-algorithm-docker","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shink%2Fspark-ml-algorithm-docker","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shink%2Fspark-ml-algorithm-docker/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shink%2Fspark-ml-algorithm-docker/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shink%2Fspark-ml-algorithm-docker/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/shink","download_url":"https://codeload.github.com/shink/spark-ml-algorithm-docker/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shink%2Fspark-ml-algorithm-docker/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":32792048,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-08T08:22:46.396Z","status":"ssl_error","status_checked_at":"2026-05-08T08:22:45.650Z","response_time":54,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["docker-image","spark","spark-ml"],"created_at":"2024-10-15T10:48:43.424Z","updated_at":"2026-05-08T18:32:19.160Z","avatar_url":"https://github.com/shink.png","language":"Scala","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Spark ML Algorithms on Docker\n\n\u003cp align=\"center\"\u003e\n    \u003ca href=\"https://github.com/shink/spark-ml-algorithm-docker/actions/workflows/release.yml\"\u003e\u003cimg src=\"https://github.com/shink/spark-ml-algorithm-docker/workflows/Release/badge.svg\" /\u003e\u003c/a\u003e\n    \u003ca href=\"LICENSE\"\u003e\u003cimg src=\"https://img.shields.io/github/license/shink/spark-ml-algorithm-docker.svg\" /\u003e\u003c/a\u003e\n    \u003cimg src=\"https://img.shields.io/badge/language-scala-C22D40.svg\" /\u003e\n    \u003cimg src=\"https://img.shields.io/github/v/release/shink/spark-ml-algorithm-docker\" /\u003e\n\u003c/p\u003e\n\n\u003cp align=\"center\"\u003e\n    \u003ca href=\"docs/README_zh.md\"\u003e\u003cb\u003e中文文档\u003c/b\u003e\u003c/a\u003e •\n    \u003ca href=\"https://hub.docker.com/u/tsund\"\u003e\u003cb\u003eDocker Hub\u003c/b\u003e\u003c/a\u003e •\n    \u003ca href=\"https://github.com/shink?tab=packages\"\u003e\u003cb\u003eGitHub Packages\u003c/b\u003e\u003c/a\u003e\n\u003c/p\u003e\n\n## Algorithms\n\n- [KMeans](kmeans)\n- [Latent Dirichlet Allocation](lda)\n- [Gaussian Mixture Model](gmm)\n- [Binomial Logistic Regression](binomial-logistic-regression)\n- [Multinomial Logistic Regression](multinomial-logistic-regression)\n- [Decision Tree Classification](decision-tree-classification)\n- [Random Forest Classification](random-forest-classification)\n- [Gradient-boosted Tree Classification](gradient-boosted-tree-classification)\n- [Isotonic Regression](isotonic-regression)\n- [Factorization Machines Regression](factorization-machines-regression)\n- [Naive Bayes](naive-bayes)\n- [Linear Regression](linear-regression)\n\n## Development\n\nRequirements:\n\n- JDK 8+\n- Maven 3+\n- Docker 19+\n- Hadoop 2+\n- Spark 3+\n\nCompile and build:\n\n```shell\nmvn clean package -DskipTests\n```\n\nBuild docker image:\n\n```shell\nmvn clean package -DskipTests -Pdocker\n```\n\n## References\n\n[v3.1.2 ml-guide](https://spark.apache.org/docs/3.1.2/ml-guide.html)\n\n[examples on GitHub](https://github.com/apache/spark/tree/master/examples/src/main/scala/org/apache/spark/examples/ml)\n\n## License\n\n[MIT](LICENSE)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fshink%2Fspark-ml-algorithm-docker","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fshink%2Fspark-ml-algorithm-docker","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fshink%2Fspark-ml-algorithm-docker/lists"}