Big Data


Apache HBase This category is to discuss more about HBase. Apache NiFi Workshop Exercises This category is to create Exercises who are part of live training sessions. Administration This is to discuss all about Big Data administration Apache Hadoop This is to discuss all the topics around Hadoop core components such as Virtual Machines This category is to discuss about all the issues related to virtual machine images for big data Apache Spark This subcategory of big data is all about discussing Apache Spark Apache Pig This is all about Apache Pig - a data flow language used to process both structured and unstructured data. CCA 131 - Cloudera Certified Associate - Admin Apache Sqoop This is to discuss all about Apache Sqoop which is used to export and import data between relational databases and Hadoop Apache Flume This is the topic to track the issues with respect to Flume. Apache Hive Let us start discussing all topics with respect to Apache Hive. cca131 Apache Kafka This category is to discuss all about Kafka
Topic Replies Activity
About the Big Data category 1 November 6, 2016
Introduction - CCA 175 Spark and Hadoop Developer – Curriculum(Old syllabus) 1 September 26, 2019
Cannot connect to hive database from beeline 2 July 12, 2020
Getting Started - Setup Environment – using Cloudera Quickstart VM 2 July 10, 2020
Not possible to download cloudera vm anymore 1 July 10, 2020
DataFrame - Register Temp Table - Table not found issue 3 July 8, 2020
Load JSON file in HIVE 5 July 10, 2020
No data in /public path 4 July 5, 2020
Packages to read/write avro files - General Question 2 July 8, 2020
Thread 84 spilling sort data of 334.0 MiB to disk (3 times 1 July 7, 2020
Apache Spark 1.6 - Transform, Stage and Store -Create RDD from HDFS files 2 July 8, 2020
Could you please providie us sample order and order item data for hands-on practise 3 July 8, 2020
Pyspark connect to hive remotely 1 July 7, 2020
Json file loading to Hive 11 July 2, 2020
Https://www.udemy.com/course/cca-175-spark-and-hadoop-developer-python-pyspark/learn/lecture/14286636#announcements ordersDF = spark.read.csv('Users/itversity/Research/data/retail_db/orders') the query is not executing 4 July 2, 2020
Not able to run spark in yarn mode 2 July 2, 2020
Hadoop Userspace Setup 2 July 1, 2020
User Spaces or Home Directories in HDFS 2 June 30, 2020
Access issue for user senjoydeep_cca175 to /apps/hive/warehouse/ while writing file 2 June 29, 2020
How to access the hdfs structure of table 1 June 26, 2020
Can't create a partition in hive 10 June 28, 2020
During Pyspark execution throw some Error 2 June 27, 2020
Pyspark not working on Windows cmd 2 June 24, 2020
Flume Spark streaming 1 June 8, 2020
Why do we use WHERE $CONDITIONS in Sqoop -EXPLAINED an ANSWERED 2 June 20, 2020
Not able to see hadoop filesystem /user/mschoudhary1 2 June 19, 2020
Python Program is not working 1 June 16, 2020
Preserving order while saving DF to a file 1 June 16, 2020
MySQL - Access denied for all - hr_export/retail_export/h1b_export/hr_user/retail_user/h1b_user 2 June 15, 2020
Error: value textfile is not a member of org.apache.spark.SparkContext 2 June 14, 2020