Parquet File Import Error

Hi All,
I am trying to Import Data From MYSQL using sqoop. File format I am using is Parquet File. When Importing Data I am using --query (define --target-dir, --split-by). I am getting error “ERROR sqoop.Sqoop: Got exception running Sqoop: org.kitesdk.data.ValidationException: Dataset name 01101847000000982_14799_quickstart.cloudera_departments is not alphanumeric (plus ‘’)" / "Dataset name 01101847000000982_14799_quickstart.cloudera_departments is not alphanumeric (plus '’)” . Not sure about error. Pleae suggest remedy.

Rgd’s

@Vineet_Anand - Do you have query which you used ?

@Vineet_Anand

Please paste the complete sqoop import command.

Here is the Query:
Sqoop Import
–connect jdbc:mysql://quickstart.cloudera:3306/retail_db
–username retail_dba
–password cloudera
–table departments
–as-parquetfile
–warehouse-dir /user/hive/warehouse/sqoop_import.db
–fields-terminated-by ‘\t’
–lines-terminated-by ‘\n’
–where ‘department_id > 10’ -m 3

Like to share a point. If --target-dir / --warehouse-dir is define “/user/cloudera/warehouse/dept_parqt” & try to
Import, script runs without error, where as if --target-dir or --warehouse-dir is define like “/user/cloudera/warehouse/dept_parqt.db” facing error. Not sure about the reason what goes wrong when ‘.db’ extension is added.

Rgd’s

can you run hadoop fs -ls /user/hive/warehouse and paste the output here?

Looks like Sqoop has a Jira Ticket for this problem which is not yet resolved

https://issues.apache.org/jira/browse/SQOOP-2874