Unable to access the database

I tried to do exercises demonstrated on YouTube class for Spark and used cd /data/retail-db using my username and password on Itversity.com. It gave an error that there is no such directory. Can you please help so that I can finish the exercises?

Common issues we encounter on our state of the art Big Data Cluster with Hadoop, Spark and many others - https://labs.itversity.com

This is to simplify our support process so that we can answer technical issues as well.

Its cd /data/retail_db and not cd /data/retail-db

underscore and not dash

1 Like

i am facing issues to read the data.

My command is to read a file and assigning to an RDD.

Please find the below issues:
File “/usr/hdp/current/spark-client/python/lib/py4j-0.9-src.zip/py4j/protocol.py”, line 308, in get_return_value
py4j.protocol.Py4JJavaError: An error occurred while calling o44.partitions.
: org.apache.hadoop.mapred.InvalidInputException: Input path does not exist: hdfs://nn01.itversity.com:8020/user/revanth0110/data/retail_db/orders
** at org.apache.hadoop.mapred.FileInputFormat.singleThreadedListStatus(FileInputFormat.java:287)**