Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. It uses a simple extensible data model that allows for online analytic application.

Flume Study

  1. flume-installation
  2. flume-collecting-twitter-data
  3. flume-netcat-agent-configuration
  4. apache-flume-moving-tomcat-logs-to-hdfs
  5. flume-seqgen-agent-configuration