Apache NiFi Apache Pig This is all about Apache Pig - a data flow language used to process both structured and unstructured data. Apache Kafka This category is to discuss all about Kafka Apache Flume This is the topic to track the issues with respect to Flume. Apache Hadoop This is to discuss all the topics around Hadoop core components such as Apache HBase This category is to discuss more about HBase. Virtual Machines This category is to discuss about all the issues related to virtual machine images for big data Administration This is to discuss all about Big Data administration Apache Spark This subcategory of big data is all about discussing Apache Spark Workshop Exercises This category is to create Exercises who are part of live training sessions. Apache Sqoop This is to discuss all about Apache Sqoop which is used to export and import data between relational databases and Hadoop Apache Hive Let us start discussing all topics with respect to Apache Hive.
Spark cluster configuration- Daily data size -- spark job submit configuration parameters (--num-executors, --executor-memory, --driver-memory) [Apache Spark] (1)
Unable to create foreachwriter to load data from kafka to hbase using structured streaming [Apache Spark] (1)
Hive SCD 1,2 (Slowly changing dimension) implementation- considering Hive is being a Datawarehouse solution [Big Data] (4)