HDPCD - Transformation and Actions Overview


Originally published at: http://www.itversity.com/topic/hdpcd-transformation-and-actions-overview-scala/

Introduction Transformations and Actions form Spark Core API. Here are the details Reading and Writing from HDFS Joining Disparate Data Sets Aggregating data sets Filtering data Sorting and Ranking As part of this, you will learn Spark APIs mentioned below map, flatMap filter join count, reduce countByKey, groupByKey, reduceByKey, aggregateByKey and few more Lifecycle of an…