An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with apache-spark-framework

A curated list of projects in awesome lists tagged with apache-spark-framework .

https://github.com/bayudwiyansatria/library-java-apache-spark

Apache Spark Libraries. Apache Spark has as its architectural foundation the resilient distributed dataset (RDD), a read-only multiset of data items distributed over a cluster of machines, that is maintained in a fault-tolerant way. The Dataframe API was released as an abstraction on top of the RDD, followed by the Dataset API. In Spark 1.x, the RDD was the primary application programming interface (API), but as of Spark 2.x use of the Dataset API is encouraged even though the RDD API is not deprecated. The RDD technology still underlies the Dataset API.

apache-spark apache-spark-framework apache-spark-library bayudwiyansatria java-library libraries library

Last synced: 28 Feb 2025