I have configured Solr 7.7.3 to detect English and Japanese documents. It can work normally with text based files like docx, x