I constantly refer to the page https://arun-teaches-u-tech.blogspot.sg/p/file-formats.html for various file formats and it’s quite helpful.
However, while practicing, with “pyspark”, the import command " import com.databricks.spark.avro._;" never works. It was mentioned in one of the older posts that pyspark should be invoked using below to deal with avro files
pyspark --packages com.databricks:spark-avro_2.10:2.0.1
Is this the only way to do it to deal with avro files? Even in the CCA175 test, if there is any question to read from (or) write to avro files, should that session of pyspark be initated like above?