Sqoop Import Industry standard


As part of my organization we have started the hadoop development , As part of the first task we need to import the RDBMS data to hdfs , I want to know which file formats are using as the industry standards (Avro or Parquet) and please provide the your input how to impose the security while navigating from RDBMS to hdfs means do we need to apply any compression codec , please suggest the industry standard approach.

@srinivas.akshinthala, I am currently a student but have a few inputs on this.

From the numerous talks I have seen, many in the Industry use Parquet format to store in HDFS, since it is a columnar storage it offers performance benefit while working with Hive or Impala.

Regarding the security @itversity @venkatreddy-amalla should be able to provide pointers.