Fast Data Processing with Spark

The current planned chapters are
  • Installing Spark & Setting up your cluster
  • Using the Spark Shell
  • Building & running a spark job (with and without maven/sbt)
  • Creating a spark context
  • Creating & Saving an RDD
  • Manipulating your RDD
  • Using Spark with Hive
  • Testing
  • Tips and Tricks