https://github.com/apache/samza-hello-samza
Mirror of Apache Samza
https://github.com/apache/samza-hello-samza
big-data samza scala
Last synced: 9 months ago
JSON representation
Mirror of Apache Samza
- Host: GitHub
- URL: https://github.com/apache/samza-hello-samza
- Owner: apache
- License: apache-2.0
- Created: 2014-03-13T07:00:06.000Z (almost 12 years ago)
- Default Branch: master
- Last Pushed: 2024-09-25T14:35:08.000Z (about 1 year ago)
- Last Synced: 2025-03-30T02:05:50.223Z (9 months ago)
- Topics: big-data, samza, scala
- Language: Java
- Size: 5.13 MB
- Stars: 111
- Watchers: 20
- Forks: 134
- Open Issues: 17
-
Metadata Files:
- Readme: README-gradle.md
- License: LICENSE
Awesome Lists containing this project
README
To use gradle to build/run the hello-samza project:
1) the project is configured to download and use gradle version 2.3 - on first task execution, it will download the required gradle jars.
2) download/install yarn/kafka/zookeeper:
$ ./gradlew installGrid
3) build hello-samza job package:
$ ./gradlew distTar
4) deploy hello-samza project to grid:
$ ./gradlew deployHelloSamza
5) start the grid (starts up yarn/kafka/zookeeper):
$ ./gradlew startGrid
6) run the various Samza tasks that are part of hello-samza project:
$ ./gradlew runWikiFeed
$ ./gradlew runWikiParser
$ ./gradlew runWikiStats
7) view all the current Kafka topics:
$ ./gradlew listKafkaTopics
8) view the Kafka topics output by the various Samza tasks:
$ ./gradlew dumpWikiRaw
( output of Kafka topic scrolls by)
CTRL-c
$ ./gradlew dumpWikiEdits
( output of Kafka topic scrolls by)
CTRL-c
$ ./gradlew dumpWikiStats
( output of Kafka topic scrolls by)
CTRL-c
9) stop all the components:
$ ./gradlew stopGrid
Shortcut: using the 'runWiki*' tasks directly will do steps 3-6 automatically.