Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/wcygan/spark-minio-k8s
Spark + MinIO on k8s
https://github.com/wcygan/spark-minio-k8s
Last synced: 2 days ago
JSON representation
Spark + MinIO on k8s
- Host: GitHub
- URL: https://github.com/wcygan/spark-minio-k8s
- Owner: wcygan
- Created: 2024-02-14T05:21:05.000Z (9 months ago)
- Default Branch: main
- Last Pushed: 2024-02-20T04:11:36.000Z (9 months ago)
- Last Synced: 2024-02-20T18:06:25.174Z (9 months ago)
- Language: Scala
- Homepage:
- Size: 87.9 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# spark-minio-k8s
This project uses Spark and MinIO running on Kubernetes.
## Prerequisites
Make sure you have the following tools installed:
- [kubectl](https://kubernetes.io/docs/tasks/tools/install-kubectl/)
- [helm](https://helm.sh/docs/intro/install/)
- [skaffold](https://skaffold.dev/docs/install/)Further, you will need a cluster to deploy to. You can use [minikube](https://minikube.sigs.k8s.io/docs/start/) for
local development.## Quickstart
This project uses https://skaffold.dev/ to build and deploy the project.
Deploy the project:
```
skaffold run
```Delete the deployment:
```
skaffold delete
```## FAQ
Q: Why didn't my spark application start immediatey?
A: It's using a cron schedule. For example, `schedule: "@every 1m"` means it will start every minute, and it won't start
immediately.## Notes
I'm recording some notes here for future reference.
- [Spark Setup Notes](documentation/spark-setup-notes.md)
- [MinIO Setup Notes](documentation/minio-setup-notes.md)## References
- [Running Spark on Kubernetes](https://spark.apache.org/docs/latest/running-on-kubernetes.html)
- [Spark, MinIO and Kubernetes](https://blog.min.io/spark-minio-kubernetes/)
- [Skaffold Documentation](https://skaffold.dev/docs/)