Plan 04 - Getting started with Spark, develop word count program and Spark execution life cycle


By this time you should be comfortable with Scala basic programming and Scala collections

Here are the topics which you will learn in next couple of weeks

  • Getting started with Spark
  • Understand transformations and actions in Spark
  • Develop word count program using spark shell or sbt console
  • Develop program using Intellij with Scala and SBT or Eclipse with Scala IDE and SBT
  • Build jar file and ship it to cluster
  • Run job on the cluster
  • Understand execution life cycle

Here is the playlist and you should watch videos 5 and 6:

Next Session: Understand HDFS commands and develop programs using spark apis.