Resilient Distributed Datasets - from files

Originally published at: http://www.itversity.com/topic/resilient-distributed-datasets-from-files/

Let us see how we can create RDDs by reading files We can read data from local file system in local execution mode We can read data from HDFS, AWS S3, Azure Blob etc in any of the 4 modes (local, stand alone, YARN and Mesos) On your computer If you run spark-shell, spark shell…