Big Data


Apache HBase This category is to discuss more about HBase. Apache NiFi Apache Hadoop This is to discuss all the topics around Hadoop core components such as Apache Pig This is all about Apache Pig - a data flow language used to process both structured and unstructured data. Workshop Exercises This category is to create Exercises who are part of live training sessions. Apache Spark This subcategory of big data is all about discussing Apache Spark CCA 131 - Cloudera Certified Associate - Admin Virtual Machines This category is to discuss about all the issues related to virtual machine images for big data Apache Sqoop This is to discuss all about Apache Sqoop which is used to export and import data between relational databases and Hadoop Apache Flume This is the topic to track the issues with respect to Flume. Apache Kafka This category is to discuss all about Kafka Apache Hive Let us start discussing all topics with respect to Apache Hive. Administration This is to discuss all about Big Data administration cca131
Topic Replies Activity
About the Big Data category 1 November 6, 2016
Adding Sentry to Kerberized Hadoop Cluster 1 September 17, 2019
PySpark RDD issue 1 September 18, 2019
Validating avro Files using avro-tools 3 September 17, 2019
Fields-terminated-by and lines-terminated-by not working as expected 1 September 11, 2019
Getting Hadoop configuration details with python 1 September 16, 2019
Incremental import 1 September 16, 2019
Hadoop mini cluster 1 September 16, 2019
Flume exec source 1 September 15, 2019
Spark CCA 175 Lecture 239 issue Streaming not working 1 September 15, 2019
Sqoop import 'sort by' and 'group by ' issue 4 September 13, 2019
Develop Application using IDE - Externalize Properties 1 September 13, 2019
Add Dependencies to the Project 1 September 13, 2019
Apache Spark 2.x - Data Frames and Pre-Defined Functions - Create Data Frames using JDBC 3 September 12, 2019
org.apache.hadoop.ipc.RemoteException(org.apache.hadoop.security.AccessControlException): Permission denied: user=user@xxx.xxxxx.net, access=WRITE, inode="/user":hdfs:hadoop:drwxr-xr-x 1 September 12, 2019
Flume error...writing to hive 2 September 11, 2019
Writing file into HDFS as tab delimited file from spark 1 September 11, 2019
Not able to create directories in hadoop using mkdir 1 September 10, 2019
Enable Kerberos on Hadoop and Spark Cluster using Cloudera Manager 1 September 6, 2019
Kerberos Essentials or Core Concepts 1 September 6, 2019
Need Help, Very TOugh Task 1 September 5, 2019
Not able to read a topic in Kafka using Pyspark 4 September 5, 2019
Generic question on compression 2 September 4, 2019
Issue while running spark-sql commands in case of empty string 4 September 3, 2019
How to connect hive from Pyspark 2 September 3, 2019
Getting error in using vi editor 1 September 2, 2019
Spark getting hung up: needs tuning 3 September 2, 2019
Develop Application - Get the monthly revenue for each customer (Exercise) 1 August 31, 2019
Need help on reading a topic using pyspark 6 August 31, 2019
Save output to a table using Spark SQL 1 August 30, 2019