Big Data


Apache NiFi Apache Pig This is all about Apache Pig - a data flow language used to process both structured and unstructured data. Apache Kafka This category is to discuss all about Kafka Apache Flume This is the topic to track the issues with respect to Flume. Apache Hadoop This is to discuss all the topics around Hadoop core components such as Apache HBase This category is to discuss more about HBase. Virtual Machines This category is to discuss about all the issues related to virtual machine images for big data Administration This is to discuss all about Big Data administration Apache Spark This subcategory of big data is all about discussing Apache Spark Workshop Exercises This category is to create Exercises who are part of live training sessions. Apache Sqoop This is to discuss all about Apache Sqoop which is used to export and import data between relational databases and Hadoop Apache Hive Let us start discussing all topics with respect to Apache Hive.
About the Big Data category [Big Data] (1)
Spark cluster configuration- Daily data size -- spark job submit configuration parameters (--num-executors, --executor-memory, --driver-memory) [Apache Spark] (1)
Cleared CCA175 - Next step? [Big Data] (1)
Hive : sort data based on a string field that has date in (dd/mm/yy) format [Apache Hive] (1)
Unable to create foreachwriter to load data from kafka to hbase using structured streaming [Apache Spark] (1)
Saving Text File in snappy compression [Apache Spark] (14)
Are we allowed to run sqoop help during cca175 certification? [Big Data] (3)
Conversion of File Format in Spark 2.0 [Apache Spark] (2)
Value reduceByKey is not a member of org.apache.spark.rdd.RDD[String] [Apache Spark] (4)
Not able to open Spark Web UI [Apache Spark] (2)
Unable to perform listTables on spark.catalog class [Apache Spark] (1)
Unable to updating an array inside list's map function. Scala Programming [Apache Spark] (1)
Data migration to new cluster and adding up of nodes [Big Data] (1)
Hive SCD 1,2 (Slowly changing dimension) implementation- considering Hive is being a Datawarehouse solution [Big Data] (4)
Load .DAT file into HDFS [Apache Hadoop] (5)
Hive sorting using cluster by [Apache Hive] (1)
Sqoop , hive arguments [Apache Hive] (1)
Tab Autocomplete does not work in pyspark shell [Apache Spark] (5)
Not able to launch pyspark shell [Apache Spark] (1)
How to create hive meta store table in parquet with snappy? [Apache Hive] (4)
analysis of the spark [Apache Spark] (1)
Regular expression for dates using pyspark(CCA175) [Apache Spark] (2)
Pyspark is not working from Nov 14 2018 [Apache Spark] (2)
Unable to enter spark shell [Apache Spark] (3)
Not Able to Start Spark Shell in console [Big Data] (2)
Sqoop export exception [Apache Sqoop] (2)
Cannot read parquet file in dataframe [Apache Spark] (1)
Sqoop hive import [Apache Sqoop] (4)
aggregateBykey() for String using pyspark(CCA175) [Apache Spark] (1)
Unable to connect flume - Permission issue [Apache Flume] (3)