Joining data sets - join, cogroup and cartesian

Originally published at: http://www.itversity.com/topic/joining-data-sets-join-cogroup-and-cartesian/

It is quite common to join multiple data sets. Following are different ways to join the data join join (inner join) rightOuterJoin leftOuterJoin fullOuterJoin cogroup cartesian All these transformations require 2 data sets Each data set need to be key value pair (except for cartesian) Join or group will happen based on key of both…