Flume HDFS Sink generates lots of tiny files on HDFS

后端 未结 3 1230
北恋
北恋 2021-01-24 18:14

I have a toy setup sending log4j messages to hdfs using flume. I\'m not able to configure the hdfs sink to avoid many small files. I thought I could configure the hdfs sink to

3条回答
  •  清歌不尽
    2021-01-24 18:30

    This can possibly happen because of the memory channel and its capacity. I guess its dumping data to HDFS as soon as its capacity becomes full. Did you try using file channel instead of memory ?

提交回复
热议问题