Hive optimization

Dear Durga,

        I have a big query in hive with multiple joins. Could you please advise me what is best solution for hive big query optimization. 

Thanks
Ramkumar V

Hi @ramkumar,

I can advice you to use these two tips which can help a lot:

  1. Use stream table for joins (choose your main table to stream): https://www.linkedin.com/pulse/20141002060036-90038370-hive-join-optimization-stream-table-in-joins

  2. Be sure you have your tables and columns with statistics computed: https://www.cloudera.com/documentation/enterprise/5-9-x/topics/cm_mc_hive_table_stats.html

Best of luck,
David