As it is currently the CCA exam uses Cloudera’s 5.15 version, which supports Python 3.X and Spark 2.X. THATS GREAT BUT
the quick start vm we use to practice is still 5.13, uses python 2.7 and spark 1.6, also because of this we run into issues when practicing with data-sets that need utf-8 encoding/decoding. When will cloudera fix this?
Learn Spark 1.6.x or Spark 2.x on our state of the art big data labs
- Click here for access to state of the art 13 node Hadoop and Spark Cluster