The project I\'m working on is indexing a certain number of data (with long texts) and comparing them with list of words per interval (about 15 to 30 minutes).
Use compound index to reduce file count. When this flag is set, lucene will write a segment as single .cfs file instead of multiple files. This will reduce the number of files significantly.