https://github.com/geektimus/spark-training
Project containing all the exercises related to the spark, some of them with explanations.
https://github.com/geektimus/spark-training
Last synced: 25 days ago
JSON representation
Project containing all the exercises related to the spark, some of them with explanations.
- Host: GitHub
- URL: https://github.com/geektimus/spark-training
- Owner: geektimus
- Created: 2017-01-11T20:20:59.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2025-01-08T17:54:33.000Z (4 months ago)
- Last Synced: 2025-01-08T18:38:01.605Z (4 months ago)
- Language: Scala
- Size: 70.3 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Spark Certification Preparation
## Exercises
**TextAnalyzer:** Find the top n most common words in a text file
```shell
java -jar target/spark-certification-1.0-SNAPSHOT-dev.jar text-file.txt number-of-words
```Notes:
* We need to improve the parameter handling in the main class.
## Tests
To run the tests we just run.
```sbt test```