I have the following in my code that tries to implement the Tika library
ContentHandler handler = basicHandlerFactory.getNewContenthandler(); parser.parse(stream,hand