Hi please help me with pandas udfs in pyspark.
I am getting the error ‘no module named pyarrow’.
How can we install pyarrow in a cluster
Learn Spark 1.6.x or Spark 2.x on our state of the art big data labs
- Click here for access to state of the art 13 node Hadoop and Spark Cluster