Big Data


Apache HBase This category is to discuss more about HBase. Apache NiFi Administration This is to discuss all about Big Data administration Apache Hadoop This is to discuss all the topics around Hadoop core components such as Workshop Exercises This category is to create Exercises who are part of live training sessions. Apache Pig This is all about Apache Pig - a data flow language used to process both structured and unstructured data. CCA 131 - Cloudera Certified Associate - Admin Apache Spark This subcategory of big data is all about discussing Apache Spark Virtual Machines This category is to discuss about all the issues related to virtual machine images for big data Apache Sqoop This is to discuss all about Apache Sqoop which is used to export and import data between relational databases and Hadoop Apache Flume This is the topic to track the issues with respect to Flume. cca131 Apache Hive Let us start discussing all topics with respect to Apache Hive. Apache Kafka This category is to discuss all about Kafka
Topic Replies Activity
About the Big Data category 1 November 6, 2016
Apache Spark 2.x - Processing Data using Data Frames - Basic Transformations - Analytics Functions or Windowing Functions 2 November 29, 2020
Apache Spark 2.x - Processing Data using Data Frames - Basic Transformations - Development Life Cycle 2 November 29, 2020
Apache Spark 2.x - Processing Data using Data Frames - Basic Transformations - Sorting data 2 November 29, 2020
Apache Spark 2.x - Processing Data using Data Frames - Basic Transformations - Grouping data and performing aggregations 2 November 29, 2020
Apache Spark 2.x - Processing Data using Data Frames - Basic Transformations - Joining multiple Data Frames 2 November 29, 2020
Apache Spark 2.x - Processing Data using Data Frames - Basic Transformations - Selection or Projection of Data in Data Frames 2 November 29, 2020
Apache Spark 2.x - Processing Data using Data Frames - Basic Transformations - Filtering Data from Data Frames 2 November 29, 2020
Apache Spark 1.6 - Transform, Stage and Store - Aggregations – groupByKey – Get revenue for each order id 3 November 29, 2020
I have a doubt on this topic. when we apply groupbykey on following dataset, groupby is performed 1 November 29, 2020
How to start spark 2 in lab? 4 November 28, 2020
Regarding history command in pyspark 5 November 24, 2020
Unable to read files in folders to create rdd, Can anyone explain me the issue 3 November 24, 2020
Cca-175-spark-and-hadoop-developer-certification-scala tutorial has missing file in section 10 1 November 23, 2020
Unable to insert into --> wordCount.saveAsTextFile("/user/training/bootcamp/wordcount") 2 November 22, 2020
Accessing Spark UI on the Cluster 10 November 20, 2020
Install CM and CDH - Setup CM, Install CDH and Setup Cloudera Management Service - Install CM and CDH on all nodes 6 November 19, 2020
Issue with writing DF to textFile - Text Data supports only single column 2 November 18, 2020
Sqoop Export - Using update-mode - allow-insert 2 November 18, 2020
ORA-01562: failed to extend rollback segment number 270 1 November 18, 2020
Spark issue - some admin commands are getting executed automatically in every 10-12 seconds 2 November 16, 2020
Sqoop Import error - Yashwanth 232 2 November 16, 2020
Hive is not connecting 8 November 13, 2020
Set High Priority while executing spark-submit 1 November 9, 2020
Can't create Hive database 2 November 9, 2020
Bash: kafka-topics.sh: command not found 11 November 9, 2020
Unable to create Objects in Hive 6 November 9, 2020
Unable to run Sqoop Jobs 2 November 9, 2020
Could not find any available broker. Check your StreamsConfig setting 'bootstrap.servers' 2 November 6, 2020
Avro Format Not working 3 November 6, 2020