Creating RDD from HDFS


#1

Originally published at: https://kaizen.itversity.com/topic/cca175-creating-rdd-from-hdfs-scala/

As part of this topic we will see details about creating RDD out of data from different sources Read text data from HDFS using sc.textFile Preview data using actions such as count first take Read data of other file formats from HDFS using sqlContext.read or load Reading data using SparkContext Previewing data using actions Following…