i’m using below command to read & join the data from hive tables
spark.sql(“select o.order_id,sum(oi.order_item_subtotal) from orders o join orders_items oi on o.order_id = oi.order_item_order_id where o.order_status = ‘COMPLETE’ group by o.order_id limit 10”);
can you please suggest me is it correct way to read & to join hive tables of Spark 2.3.0.
Learn Spark 1.6.x or Spark 2.x on our state of the art big data labs
- Click here for access to state of the art 13 node Hadoop and Spark Cluster