Data Ingest using Flume and HDFS


Originally published at:

Apache Flume is open source tool which can capture data generated by web logs in real time to Hadoop eco system. Distributed and reliable It can collect, aggregate and move large amounts of log data It is robust, fault tolerant and tunable Uses a simple extensible data model that allows for online analytic application Flume…