how do I create my own training corpus for stanford tagger?

后端 未结 4 1485
你的背包
你的背包 2021-02-05 13:20

I have to analyze informal english text with lots of short hands and local lingo. Hence I was thinking of creating the model for the stanford tagger.

How do i create my

4条回答
  •  面向向阳花
    2021-02-05 13:55

    To train the PoS tagger, see this mailing list post which is also included in the JavaDocs for the MaxentTagger class.

    The javadocs for the edu.stanford.nlp.tagger.maxent.Train class specifies the training format:

    The training file should be in the following format: one word and one tag per line separated by a space or a tab. Each sentence should end in an EOS word-tag pair. (Actually, I'm not entirely sure that is still the case, but it probably won't hurt. -wmorgan)

提交回复
热议问题