Apache Spark 1.6 - Transform, Stage and Store - Row Level Transformations – map

map

map is used for

  • Perform row level transformations where one record transforms into another record
  • Number of records in input RDD and output RDD will be equal
  • map is typically followed by other APIs used for
    • joining the data
    • performing aggregations
    • sorting etc

Learn Spark 1.6.x or Spark 2.x on our state of the art big data labs

  • Click here for access to state of the art 13 node Hadoop and Spark Cluster