I have to compare data from csv and a hive table in pyspark. What step I should follow



I have data in sas and then I have exported that data to Hive and now I want to load that data in spark and want to compare both data which I have exported in hive and same data i am loading in spark . How to achieve this ?


Learn Spark 1.6.x or Spark 2.x on our state of the art big data labs

  • Click here for access to state of the art 13 node Hadoop and Spark Cluster