I've done some experimenting with Apache OpenNLP with jRuby. Its quite nice and solid but at the time I write this poorly documented. If you try OpenNLP I suggest you read following articles:
- Getting started with OpenNLP (Natural Language Processing)
- Mining Wikipedia with Hadoop and Pig for Natural Language Processing
- OpenNLP Tutorial
- An UIMA Sentence Annotator using OpenNLP
Documentation for OpenNLP can be found here.
This is code from my project where I do named entity recognition with OpenNLP. Its written in jRuby. OpenNLP models are stored in database because the code runs on Heroku and you can't write on file system there.
- Politiki Named Entity Recognition API for w/ OpenNLP, jRuby and Grape