how do I create my own training corpus for stanford tagger?

后端 未结 4 1475
你的背包
你的背包 2021-02-05 13:20

I have to analyze informal english text with lots of short hands and local lingo. Hence I was thinking of creating the model for the stanford tagger.

How do i create my

4条回答
  •  小鲜肉
    小鲜肉 (楼主)
    2021-02-05 13:59

    For the Stanford Parser, you use Penn treebank format, and see Stanford's FAQ about the exact commands to use. The JavaDocs for the LexicalizedParser class also give appropriate commands, particularly:

    java -mx1500m edu.stanford.nlp.parser.lexparser.LexicalizedParser [-v] \
       -train trainFilesPath fileRange
       -saveToSerializedFile serializedGrammarFilename
    

提交回复
热议问题