What does the dependency-parse output of TurboParser mean?

风格不统一 提交于 2019-11-30 21:18:23

I don't know TurboParser, but my guess is that the first number indicates the id of the token and that the second number indicates the id of its governor. That is, for your example:

solved(
 I,
 problem(the),
 with(statistics),
 .
)

Actually, that's CoNLL-X format. You can get more information here: http://ilk.uvt.nl/conll/#dataformat

Here is the meaning of each of the columns TurboParser outputs:

  1. id of the token, i.e. its one-based index in the sentence
  2. original token as it was in the original text
  3. lemma, the lemmatized form of the token (empty here, because no lemmatizer has been set)
  4. tag (coarse-grained part-of-speech tag)
  5. tag (fine-grained part-of-speech tag, which is the same as 4. with TurboParser)
  6. morphological features (empty here)
  7. head of the token, represented by its index (the root token has a head value of 0)
  8. relation of the current token with its head

The generated output you gave can be represented as a dependency-based parse tree:

For further information on the CoNLL-X format:

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!