Streaming Analytics using Flume, Kafka and Spark Streaming


Originally published at:

While we can perform data ingestion from databases into HDFS using Sqoop, at times we need to get the data from web server logs into HDFS or some other target. Also we might have to process data before loading data into target like HDFS. To achieve this we need to understand getting data from web…