How can I detect a user's input language using Ruby without using an online service?

后端 未结 4 721
星月不相逢
星月不相逢 2021-01-15 06:11

I\'m looking for a library or technique to detect the input language of blocks of text provided by users. Online lookups (like Google translate) won\'t work for this task as

相关标签:
4条回答
  • 2021-01-15 06:19

    Here are two more n-gram-based gems you might want to try. They work offline.

    • https://github.com/echen/unsupervised-language-identification, optimized for separating english and other languages (has a live demo)
    • https://github.com/feedbackmine/language_detector, less specialized, will detect more languages. Some languages may need some extra training — I found it to be not precise enough for German text.
    0 讨论(0)
  • 2021-01-15 06:38

    A quick demo of WhatLanguage in Ruby:

    http://www.youtube.com/watch?v=lNqZ2cqOReo&list=UUJ_3fstMOH-g4yBxtvgAWkw&index=0&feature=plcp

    0 讨论(0)
  • 2021-01-15 06:39

    For anyone interested, I've found http://rubygems.org/gems/kenwaln-whatlanguage, which is performing excellently.

    0 讨论(0)
  • 2021-01-15 06:41

    I'm using CLD which I really like, succinct and easy to use. Give it a try.

    0 讨论(0)
提交回复
热议问题