I have to analyze informal english text with lots of short hands and local lingo. Hence I was thinking of creating the model for the stanford tagger.
How do i create my
To train the PoS tagger, see this mailing list post which is also included in the JavaDocs for the MaxentTagger class.
The javadocs for the edu.stanford.nlp.tagger.maxent.Train class specifies the training format:
The training file should be in the following format: one word and one tag per line separated by a space or a tab. Each sentence should end in an EOS word-tag pair. (Actually, I'm not entirely sure that is still the case, but it probably won't hurt. -wmorgan)