{"id":14965903,"url":"https://github.com/shifuml/guagua","last_synced_at":"2025-08-24T16:43:22.756Z","repository":{"id":16765555,"uuid":"19523554","full_name":"ShifuML/guagua","owner":"ShifuML","description":"An iterative computing framework for both Hadoop MapReduce and Hadoop YARN.","archived":false,"fork":false,"pushed_at":"2022-05-20T20:52:22.000Z","size":967,"stargazers_count":71,"open_issues_count":57,"forks_count":40,"subscribers_count":22,"default_branch":"master","last_synced_at":"2025-01-31T07:11:49.636Z","etag":null,"topics":["hadoop","in-memory","iterative","machine-learning","yarn"],"latest_commit_sha":null,"homepage":"https://github.com/ShifuML/guagua/wiki","language":"Java","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ShifuML.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGES.txt","contributing":null,"funding":null,"license":"LICENSE.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2014-05-07T06:23:40.000Z","updated_at":"2024-02-29T04:48:07.000Z","dependencies_parsed_at":"2022-08-24T13:48:53.531Z","dependency_job_id":null,"html_url":"https://github.com/ShifuML/guagua","commit_stats":null,"previous_names":[],"tags_count":14,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ShifuML%2Fguagua","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ShifuML%2Fguagua/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ShifuML%2Fguagua/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ShifuML%2Fguagua/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ShifuML","download_url":"https://codeload.github.com/ShifuML/guagua/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":238147558,"owners_count":19424284,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["hadoop","in-memory","iterative","machine-learning","yarn"],"created_at":"2024-09-24T13:35:32.724Z","updated_at":"2025-02-10T16:30:59.193Z","avatar_url":"https://github.com/ShifuML.png","language":"Java","funding_links":[],"categories":[],"sub_categories":[],"readme":"[\u003cimg src=\"images/guagua_duck_50.png\" alt=\"Guagua\" align=\"left\"\u003e](http://shifu.ml)\u003cdiv align=\"right\"\u003e\u003cdiv\u003e[![Build Status](https://travis-ci.org/ShifuML/guagua.svg)](https://travis-ci.org/ShifuML/shifu)\u003c/div\u003e\u003cdiv\u003e[![Maven Central](https://maven-badges.herokuapp.com/maven-central/ml.shifu/guagua/badge.svg)](https://maven-badges.herokuapp.com/maven-central/ml.shifu/guagua)\u003c/div\u003e\u003c/div\u003e\n\n## Guagua\n\nAn iterative computing framework on both Hadoop MapReduce and Hadoop YARN.\n\n## News\n\nGuagua 0.7.7 is released with a lot of improvements. Check our [changes](https://github.com/ShifuML/guagua/blob/master/CHANGES.txt#L19)\n\n## Conference\n\n[QCON Shanghai 2014](http://2014.qconshanghai.com/node/474) [Slides](http://www.slideshare.net/pengshanzhang/guagua-an-iterative-computing-framework-on-hadoop)\n\n## Getting Started\n\nPlease visit [Guagua wiki site](https://github.com/ShifuML/guagua/wiki) for tutorials.\n\n## What is Guagua?\n**Guagua**, a sub-project of Shifu, is a distributed, pluggable and scalable iterative computing framework based on Hadoop MapReduce and YARN.\n\nThis graph shows the iterative computing process for **Guagua**.\n\n![Guagua Process](images/guagua-process.png)\n\nTypical use cases for **Guagua** are distributed machine learning model training based on Hadoop. By using **Guagua**, we implement distributed neural network algorithm which can reduce model training time from days to hours on 1TB data sets. Distributed neural network algorithm is based on [Encog](http://www.heatonresearch.com/encog) and **Guagua**. Any details please check our example [source code](https://github.com/ShifuML/guagua/tree/master/guagua-mapreduce-examples/src/main/java/ml/shifu/guagua/mapreduce/example/nn).\n\n## Google Group\n\nPlease join [Guagua group](https://groups.google.com/forum/#!forum/shifu-guagua) if questions, bugs or anything else.\n\n## Copyright and License\n\nCopyright 2013-2017, PayPal Software Foundation under the [Apache License V2.0](LICENSE.txt).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fshifuml%2Fguagua","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fshifuml%2Fguagua","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fshifuml%2Fguagua/lists"}