Should I go for HDPCD Spark using Python or CCA175 Spark using Python exam


#1

Hi,

I am undergoing training on BIG DATA technologies and planing for certification.
However I am confused for which one I should go HDPCD or CCA175.
Can you please advice ?
So far I have no experience in BIG DATA and didn’t work in any live project.
So for me which one will be easy as beginners ?

Also please let me know if for both of the exam I need to have a computer with 8GB or more than 8 GB RAM ?

Thanks


Learn Spark 1.6.x or Spark 2.x on our state of the art big data labs

  • Click here for access to state of the art 13 node Hadoop and Spark Cluster


#2

i have this subscription but to read avro file when i try to import databricks lib, i can’t. what steps to follow:
scala> import com.databricks.spark.avro._
:25: error: object databricks is not a member of package com
import com.databricks.spark.avro._


#3

Hi @Shantanil_C

These 2 commands will solve your problem:

launch spark by using below command:–
spark-shell --master yarn --packages com.databricks:spark-avro_2.10:2.0.1 --conf spark.ui.port=12567

once you are in spark shell, import Avro package like below:–
``import com.databricks.spark.avro._;`