Named entity recognition with a small data set (corpus)

﹥>﹥吖頭↗ 提交于 2019-12-07 07:55:31

Yes. With your help: it is a work in progress. It is JS and "No training ..."

Please see https://github.com/redaktor/nlp_compromise/ !

It is a fork where I worked on NER during the last days and it will be optimized for usage with different languages !!!

It is a combination of a dictionary for words, dictionary for rules + build tool. It would be awesome to work on persian support (I am working on german) ... It is planned to support NER of

  • 'CARDINAL' -> [ready]
  • 'DATE' -> calendar based [gregorian calendar is ready]
  • 'DURATION' -> see above [date ranges are ready]
  • 'MEASURE' -> systems based [metric system and SI units ready, 80+ categories]
  • 'MONEY' -> currencies based [ready in a few days]
  • 'PERSON' -> word/rules based [english/european names are ready]
  • 'ORGANIZATION'
  • 'LOCATION'

I think it could be a starting point ? I did not find the time to document the new features - feel free to open issues on github.

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!