As far as I know, Tesseract 3.x comes with 6 English (correct me if I\'m wrong) fonts. I need to train Tesseract for more 5 types of fonts. I need only capital letters and digit
Should use a different name, e.g., eng1.traineddata
. That way you can use the new data with the original one by specifying the language option -l eng+eng1
.
If you have new trained data with different font, I think you don't have dictionary correction for your new font.
To add new trained data you can do this (I'm using PHP code here)
// as you new trained data, it must be 3 letter prefix
// what ever 3 letter you want
$languange = "eng+deu";
$settingLanguage = $tesseract -> setLanguage($language) ;
By seeing the tesseract.php function setLanguage()
, you can set the language by that function.