Big Data


Apache NiFi Apache Flume This is the topic to track the issues with respect to Flume. Apache Pig This is all about Apache Pig - a data flow language used to process both structured and unstructured data. Apache Hadoop This is to discuss all the topics around Hadoop core components such as Administration This is to discuss all about Big Data administration Apache Kafka This category is to discuss all about Kafka Apache HBase This category is to discuss more about HBase. Apache Spark This subcategory of big data is all about discussing Apache Spark Virtual Machines This category is to discuss about all the issues related to virtual machine images for big data Workshop Exercises This category is to create Exercises who are part of live training sessions. Apache Sqoop This is to discuss all about Apache Sqoop which is used to export and import data between relational databases and Hadoop Apache Hive Let us start discussing all topics with respect to Apache Hive.
About the Big Data category [Big Data] (1)
Spark latest versions [Apache Spark] (3)
Sqoop job showing error [Apache Sqoop] (2)
Pyspark in jupyter: not working [Big Data] (2)
Overview of NoSQL Technologies and HBase [Apache HBase] (1)
Getting Started – Apache Kafka [Apache Kafka] (1)
SPARK ERROR - cannot find the file specified [Apache Spark] (1)
java.lang.NumberFormatException: while doing df.show() [Apache Spark] (1)
AggregateByKey issue [Apache Spark] (3)
Require Assistance over remote [Apache Spark] (2)
Prerequisites for learning Hadoop & big data? [Apache Hadoop] (5)
Data Frame Operations - Analytic Functions [Apache Spark] (7)
HDPCA - Wierd Error while installing ambari-metrics - related to kerbores library [Administration] (4)
Sqoop mysql connection issues, [Apache Sqoop] (4)
Unable to configure pyspark on my system [Apache Spark] (8)
Getting error while running pyspark in pycharm : Unable to load native-hadoop library for your platform... using builtin-java classes where applicable [Apache Spark] (2)
How to connect to Cassandra in itversity [Big Data] (15)
Spark Streaming - end to end [Apache Spark] (1)
Exercise 03 (regex to split ) [Apache Spark] (2)
Gw01.itversity.com Lab is too slow to run any commands [Apache Spark] (1)
Do we need to format datanode with HDFS file system to store data or just we need to format namenode [Big Data] (1)
sparkSQL : creating external hive table from ORC and parquet file [Apache Hive] (1)
Query is not working [Big Data] (2)
Object spark is not a member of package org.apache [Apache Spark] (4)
How to load multiple tables from rdbms into Hive without using All table, and i want to write sqoop job which will have mappers information for big tables also [Apache Sqoop] (1)
Saving dataframe result in exam [Apache Spark] (2)
Laod data in hbase [Apache Sqoop] (1)
Oozie wf to extract query output (hive) into csv format [Apache Hive] (1)
Exception while running spark-submit [Apache Spark] (6)
Cannot access mySQL retail_export or retail_import dbs from spark but able to access retail_db [Apache Spark] (7)