Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/tupol/spark-learning
Various test/learning bits of code
https://github.com/tupol/spark-learning
Last synced: 26 days ago
JSON representation
Various test/learning bits of code
- Host: GitHub
- URL: https://github.com/tupol/spark-learning
- Owner: tupol
- License: apache-2.0
- Created: 2015-11-01T18:42:28.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2015-11-29T22:29:48.000Z (almost 9 years ago)
- Last Synced: 2023-03-09T02:01:32.686Z (over 1 year ago)
- Language: Scala
- Size: 473 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# spark-learning
Various test/learning bits of code## sparcass
Learning about the Spark-Cassandra integration### Prerequisites
Install a local Cassandra DB, the simplest way possible:
unzip the targz in your environment.### Importing GitHub logs data through Spark into Cassandra
- `run-sparcass-import.sh /full/path/to/json/github/log/file.json` all around trivial script
Inside the script one can configure a few parameters that are sent to the Spark driver.### Doing a simple Cassandra query through Spark
- `run-sparcass-query.sh`
Inside the script one can configure a few parameters that are sent to the Spark driver.## spark
Learning about Spark... forgot what it was all aboutCommand line run: `run-spark.sh`