I\'m ussing the Appache Jackrabbit JCA 2.7.5, the problem is that files .docx and .xlsx is not indexed.
My steps :
Ref: http://jackrabbit.510166.n4.nabble.com/Office-2007-documents-not-being-indexed-in-Jackrabbit-2-4-3-td4657380.html
On the same line, I have observed commons-compress-1.5.jar is required by Tika parser in case of OOXML types of documents (i.e. office 2007 documents).
Now, I am able to index & search most of types of documents (office 2007 - docx, pptx, xlsx , office 2003 - doc, ppt, xls, PDF) using below 2 steps:
(1) Updated repository.xml & added Further details can be found at https://issues.apache.org/jira/browse/JCR-3287
(2) Added commons-compress-1.5.jar classpath while running jackrabbit-standalone-2.6.2.jar