wordnet | 易学教程

Arabic WordNet with not-formatted words

阅读更多关于 Arabic WordNet with not-formatted words

问题 Is it necessary for the word input to WordNet to be formatted like "التُّفَّاحْ" and can't expect "التفاح"... is there any library or service taking not-formatted Arabic word returning a list of all its possible synonyms. 回答1: From التُّفَّاحْ to التفاح , you simply want to remove the diacritics then you need a lexical normalization tool. Try Tashaphyne, download and install then use the normalize module http://pythonhosted.org/Tashaphyne/Tashaphyne.normalize-module.html : from Tashaphyne

Mapping Wordnet Senses to Verbnet

阅读更多关于 Mapping Wordnet Senses to Verbnet

问题 http://digital.library.unt.edu/ark:/67531/metadc30973/m2/1/high_res_d/Mihalcea-2005-Putting_Pieces_Together-Combining_FrameNet.pdf In the link above on the sixth page, the paper mentions that a mapping was made. "The process of mapping VerbNet to WordNet is thus semi-automatic. We first manually link all semantic constraints defined in VerbNet (there are 36 such constraints) to one or more nodes in the WordNet semantic hierarchy." I am trying to use this mapping on NLTK Python with Verbnet

lemmatize plural nouns using nltk and wordnet

阅读更多关于 lemmatize plural nouns using nltk and wordnet

问题 I want to lemmatize using from nltk import word_tokenize, sent_tokenize, pos_tag from nltk.stem.wordnet import WordNetLemmatizer from nltk.corpus import wordnet lmtzr = WordNetLemmatizer() POS = pos_tag(text) def get_wordnet_pos(treebank_tag): #maps pos tag so lemmatizer understands from nltk.corpus import wordnet if treebank_tag.startswith('J'): return wordnet.ADJ elif treebank_tag.startswith('V'): return wordnet.VERB elif treebank_tag.startswith('N'): return wordnet.NOUN elif treebank_tag

WordNet(JWI MIT) : How to find High Frequency words list?

阅读更多关于 WordNet(JWI MIT) : How to find High Frequency words list?

问题 Using JWI MIT interface libraries http://projects.csail.mit.edu/jwi/ , how can I find the list of most frequently used English words in daily life from WordNet api (http://wordnet.princeton.edu/)? Is there any way I can accomplish this if API initially does not provide this? Because initially API does not filter words on a level. 回答1: WordNet comes with usage word counts, but the man page describes them as unreliable and not updated since 2001: http://wordnet.princeton.edu/wordnet/man/cntlist

Find synonyms using JAWS in netbeans

阅读更多关于 Find synonyms using JAWS in netbeans

问题 import edu.smu.tspell.wordnet.NounSynset; import edu.smu.tspell.wordnet.Synset; import edu.smu.tspell.wordnet.SynsetType; import edu.smu.tspell.wordnet.WordNetDatabase; import javax.swing.JApplet; import javax.swing.JFrame; public class JavaApplication4 { String a[]=new String[2]; public static void main(String a[]) { String ar[]={"faith"}; int j=0; while(j<2) { System.setProperty("wordnet.database.dir", "C:\\ProgramFiles(x86)\\WordNet\\2.1\\dict"); NounSynset nounSynset; NounSynset[]

WordnetSynonymParser in Lucene

阅读更多关于 WordnetSynonymParser in Lucene

问题 I am new to Lucene and I'm trying to use WordnetSynonymParser to expand queries using the wordnet synonyms prolog. Here is what I have till now: public class CustomAnalyzer extends Analyzer { @Override protected TokenStreamComponents createComponents(String fieldName, Reader reader){ // TODO Auto-generated method stub Tokenizer source = new ClassicTokenizer(Version.LUCENE_47, reader); TokenStream filter = new StandardFilter(Version.LUCENE_47, source); filter = new LowerCaseFilter(Version

Get synonyms from synset returns error - Python

阅读更多关于 Get synonyms from synset returns error - Python

问题 I'm trying to get synonyms of a given word using Wordnet. The problem is that despite I'm doing the same as is written here: here, it returns error. Here is my code: from nltk.corpus import wordnet as wn import nltk dog = wn.synset('dog.n.01') print dog.lemma_names >>> <bound method Synset.lemma_names of Synset('dog.n.01')> for i,j in enumerate(wn.synsets('small')): print "Synonyms:", ", ".join(j.lemma_names) >>> Synonyms: Traceback (most recent call last): File "C:/Users/Python

How do I print out just the word itself in a WordNet synset using Python NLTK?

阅读更多关于 How do I print out just the word itself in a WordNet synset using Python NLTK?

问题 Is there a way in Python 2.7 using NLTK to just get the word and not the extra formatting that includes "synset" and the parentheses and the "n.01" etc? For instance if I do wn.synsets('dog') My results look like: [Synset('dog.n.01'), Synset('frump.n.01'), Synset('dog.n.03'), Synset('cad.n.01'), Synset('frank.n.02'), Synset('pawl.n.01'), Synset('andiron.n.01'), Synset('chase.v.01')] How can I instead get a list like this? dog frump cad frank pawl andiron chase Is there a way to do this using

How do I print out just the word itself in a WordNet synset using Python NLTK?

阅读更多关于 How do I print out just the word itself in a WordNet synset using Python NLTK?

What is the connection or difference between lemma and synset in wordnet?

阅读更多关于 What is the connection or difference between lemma and synset in wordnet?

问题 I am a complete beginner to NLP and NLTK. I was not able to understand the exact difference between lemmas and synsets in wordnet , because both are producing nearly the same output. for example for the word cake it produce this output. lemmas : [Lemma('cake.n.01.cake'), Lemma('patty.n.01.cake'), Lemma('cake.n.03.cake'), Lemma('coat.v.03.cake')] synsets : [Synset('cake.n.01'), Synset('patty.n.01'), Synset('cake.n.03'), Synset('coat.v.03')] please help me to understand this concept. Thank you.