Setup Data Sets


#1

Originally published at: https://kaizen.itversity.com/topic/cca175-setup-data-sets-scala/

As part of this topic, We will setup datasets Go to git Clone or Download on to Virtual Machines created using Cloudera Quickstart or Hortonworks Sandbox You can setup locally for practising for Spark, but it is highly recommended to use HDFS which comes out of the box with Cloudera Quickstart or Hortonworks or our…