WordNet

java.lang.Object
- com.articulate.sigma.wordNet.WordNet

All Implemented Interfaces:

java.io.Serializable
```
public class WordNet
extends java.lang.Object
implements java.io.Serializable
```
This program finds and displays SUMO terms that are related in meaning to the English expressions that are entered as input. Note that this program uses four WordNet data files, "NOUN.EXC", "VERB.EXC" etc, as well as four WordNet to SUMO mappings files called "WordNetMappings-nouns.txt", "WordNetMappings-verbs.txt" etc The main part of the program prompts the user for an English term and then returns associated SUMO concepts. The two primary public methods are initOnce() and page().

See Also:

Serialized Form

Field Summary

Fields
Modifier and Type	Field and Description
`static int`	`ADJECTIVE`
`static int`	`ADJECTIVE_SATELLITE`
`java.util.Hashtable<java.lang.String,java.lang.String>`	`adjectiveDocumentationHash`
`java.util.Hashtable<java.lang.String,java.lang.String>`	`adjectiveSUMOHash`
`java.util.HashMap<java.lang.String,java.util.HashSet<java.lang.String>>`	`adjectiveSynsetHash`
`static int`	`ADVERB`
`java.util.Hashtable<java.lang.String,java.lang.String>`	`adverbDocumentationHash`
`java.util.Hashtable<java.lang.String,java.lang.String>`	`adverbSUMOHash`
`java.util.HashMap<java.lang.String,java.util.HashSet<java.lang.String>>`	`adverbSynsetHash`
`static java.lang.String`	`baseDir`
`static java.io.File`	`baseDirFile`
`java.util.HashMap<java.lang.String,java.lang.String>`	`caseMap`
`static boolean`	`debug`
`java.util.Hashtable<java.lang.String,java.lang.String>`	`exceptionNounHash` list of irregular plural forms where the key is the plural, singular is the value.
`java.util.Hashtable<java.lang.String,java.lang.String>`	`exceptionVerbHash`
`static boolean`	`initNeeded`
`java.lang.String`	`maxNounSynsetID`
`java.lang.String`	`maxVerbSynsetID`
`MultiWords`	`multiWords`
`static int`	`NOUN`
`java.util.Hashtable<java.lang.String,java.lang.String>`	`nounDocumentationHash`
`java.util.Hashtable<java.lang.String,java.lang.String>`	`nounSUMOHash`
`java.util.HashMap<java.lang.String,java.util.HashSet<java.lang.String>>`	`nounSynsetHash`
`java.util.HashMap<java.lang.String,java.util.HashMap<java.lang.String,java.lang.String>>`	`OMW` A HashMap with language name keys and HashMap values.
`java.lang.String`	`origMaxNounSynsetID`
`java.lang.String`	`origMaxVerbSynsetID`
`java.util.Hashtable<java.lang.String,java.util.ArrayList<AVPair>>`	`relations` Keys are POS-prefixed synsets, values are ArrayList(s) of AVPair(s) in which the attribute is a pointer type according to http://wordnet.princeton.edu/man/wninput.5WN.html#sect3 and the value is a POS-prefixed synset @see WordNetUtilities.convertWordNetPointer
`java.util.HashMap<java.lang.String,java.lang.String>`	`reverseSenseIndex` A HashMap where the keys are 9 digit POS prefixed WordNet synset byte offsets, and the values are of the form word_POS_sensenum (alpha POS like "VB").
`java.util.HashMap<java.lang.String,java.lang.Integer>`	`senseFrequencies` a HashMap where the key is a 9-digit POS-prefixed sense and the value is a the number of times that sense occurs in the Brown corpus.
`java.util.HashMap<java.lang.String,java.lang.String>`	`senseIndex` A HashMap where the keys are of the form word_POS_sensenum (alpha POS like "VB") and values are 8 digit WordNet synset byte offsets.
`java.util.ArrayList<java.lang.String>`	`stopwords` English "stop words" such as "a", "at", "them", which have no or little inherent meaning when taken alone.
`java.util.Hashtable<java.lang.String,java.util.ArrayList<java.lang.String>>`	`SUMOHash` Keys are SUMO terms, values are ArrayLists(s) of POS-prefixed 9-digit synset String(s) meaning that the part of speech code is prepended to the synset number.
`java.util.Hashtable<java.lang.String,java.util.ArrayList<java.lang.String>>`	`synsetsToWords` Keys are String POS-prefixed synsets.
`static int`	`VERB`
`java.util.Hashtable<java.lang.String,java.lang.String>`	`verbDocumentationHash`
`java.util.HashMap<java.lang.String,java.util.ArrayList<java.lang.String>>`	`verbFrames` A HashMap where keys are 8 digit WordNet synset byte offsets or synsets appended with a dash and a specific word such as "12345678-foo".
`java.util.Hashtable<java.lang.String,java.lang.String>`	`verbSUMOHash`
`java.util.HashMap<java.lang.String,java.util.HashSet<java.lang.String>>`	`verbSynsetHash`
`static WordNet`	`wn`
`static java.util.HashMap<java.lang.String,WordNet>`	`wns`
`java.util.HashMap<java.lang.String,java.util.HashMap<java.lang.String,java.lang.Integer>>`	`wordCoFrequencies` a HashMap of HashMaps where the key is a word sense of the form word_POS_num signifying the word, part of speech and number of the sense in WordNet.
`protected java.util.HashMap<java.lang.String,java.util.TreeSet<AVPair>>`	`wordFrequencies` a HashMap of HashMaps where the key is a word and the value is a HashMap of 9-digit POS-prefixed senses which is the value of the AVPair, and the number of times that sense occurs in the Brown corpus, which is the key of the AVPair
`java.util.HashMap<java.lang.String,java.util.ArrayList<java.lang.String>>`	`wordsToSenseKeys` A HashMap with words as keys and ArrayList as values.

Constructor Summary

Constructors
Constructor and Description

WordNet()

Constructors
Constructor and Description
`WordNet()`

Method Summary

All Methods Static Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`void`	`addToWordFreq(java.lang.String word, AVPair avp)` Add an entry to the wordFrequencies list, checking whether it has a valid count and synset pair.
`static void`	`checkWordsToSenses()`
`java.util.HashMap<java.lang.String,java.lang.Integer>`	`collectCountedWordSenses(java.lang.String sentence)` Collect all the synsets that represent the best guess at meanings for all the words in a sentence.
`boolean`	`containsWord(java.lang.String word)` Does WordNet contain the given word.
`boolean`	`containsWord(java.lang.String word, int pos)` Does WordNet contain the given word.
`java.lang.String`	`displayByKey(java.lang.String sumokbname, java.lang.String key, java.lang.String params)`
`java.lang.String`	`displaySynset(java.lang.String sumokbname, java.lang.String synset, java.lang.String params)`
`java.lang.String`	`generateNounSynsetID()` Generate a new eight digit noun synset ID that doesn't have an existing hash
`java.lang.String`	`generateSynsetID(java.lang.String l)` Generate a new 8 digit synset ID that doesn't have an existing hash
`java.lang.String`	`generateVerbSynsetID()` Generate a new eight digit verb synset ID that doesn't have an existing hash
`java.lang.String`	`getDocumentation(java.lang.String synset)`
`static void`	`getEntailments()`
`MultiWords`	`getMultiWords()`
`java.util.TreeMap<java.lang.String,java.util.ArrayList<java.lang.String>>`	`getSenseKeysFromWord(java.lang.String word)` Get all the synsets for a given word.
`java.lang.String`	`getSUMOMapping(java.lang.String synset)` Get the SUMO mapping for a POS-prefixed synset
`java.io.File`	`getWnFile(java.lang.String key, java.lang.String override)` Returns the WordNet File object corresponding to key.
`java.util.ArrayList<java.lang.String>`	`getWordsFromSynset(java.lang.String synset)`
`java.util.TreeMap<java.lang.String,java.lang.String>`	`getWordsFromTerm(java.lang.String SUMOterm)` Get the words and synsets corresponding to a SUMO term.
`static void`	`initOnce()` Read the WordNet files only on initialization of the class.
`boolean`	`isFile(java.lang.String s)`
`boolean`	`isHyponym(java.lang.String synset, java.lang.String hypo)`
`boolean`	`isHyponymRecurse(java.lang.String synset, java.lang.String hypo, java.util.ArrayList<java.lang.String> visited)`
`boolean`	`isStopWord(java.lang.String word)` Check whether the word is a stop word
`static void`	`loadSerialized()` Load the most recently save serialized version.
`static void`	`main(java.lang.String[] args)` A main method, used only for testing.
`void`	`mergeWordCoFrequencies(java.util.HashMap<java.lang.String,java.util.HashMap<java.lang.String,java.lang.Integer>> senses)` Merge a new set of word co-occurrence statistics into the existing set.
`java.lang.String`	`nounRootForm(java.lang.String mixedCase, java.lang.String input)` Return the root form of the noun, or null if it's not in the lexicon.
`java.lang.String`	`nounSynsetFromTermFormat(java.lang.String tf, java.lang.String SUMOterm, KB kb)` Generate a new noun synset from a termFormat
`java.lang.String`	`page(java.lang.String inp, int pos, java.lang.String kbname, java.lang.String synset, java.lang.String params)` This is the regular point of entry for this class.
`protected boolean`	`processNounLine(java.lang.String line)`
`java.lang.String`	`processPrologString(java.lang.String doc)` Double any single quotes that appear.
`void`	`readSenseCount()` Read word sense frequencies into a HashMap of PriorityQueues containing AVPairs where the value is a word and the attribute (on which PriorityQueue is sorted) is an 8 digit String representation of an integer count.
`void`	`readSenseIndex(java.lang.String filename)` Note that WordNet forces all these words to lowercase in the index.xxx files
`void`	`readStopWords()`
`void`	`readWordCoFrequencies()` Return a HashMap of HashMaps where the key is a word sense of the form word_POS_num signifying the word, part of speech and number of the sense in WordNet.
`java.util.ArrayList<java.lang.String>`	`removeStopWords(java.util.ArrayList<java.lang.String> sentence)` Remove stop words from a sentence.
`java.lang.String`	`removeStopWords(java.lang.String sentence)` Remove stop words from a sentence.
`java.lang.String`	`senseKeyPOS(java.lang.String senseKey)`
`static void`	`serialize()` save serialized version.
`static boolean`	`serializedOld()` Check whether sources are newer than serialized version.
`protected void`	`setMaxNounSynsetID(java.lang.String synset)`
`protected void`	`setMaxVerbSynsetID(java.lang.String synset)`
`static java.util.ArrayList<java.lang.String>`	`splitToArrayList(java.lang.String st)` Return an ArrayList of the string split by spaces.
`static java.util.ArrayList<java.lang.String>`	`splitToArrayListSentence(java.lang.String st)` Return an ArrayList of the string split by periods.
`java.lang.String`	`sumoFileDisplay(java.lang.String pathname, java.lang.String counter, java.lang.String params)` A routine which takes a full pathname as input and returns a sentence by sentence display of sense and sentiment analysis
`java.lang.String`	`sumoSentenceDisplay(java.lang.String input, java.lang.String context, java.lang.String params)` A routine which looks up a given list of words in the hashtables to find the relevant word definitions and SUMO mappings.
`java.lang.String`	`sumoSentimentDisplay(java.lang.String sentence)` A routine that uses computeSentiment in DB.java to display a sentiment score for a single sentence as well as the individual scores of scored descriptors.
`void`	`synsetFromTermFormat(Formula form, java.lang.String tf, java.lang.String SUMOterm, KB kb)` Generate a new synset from a termFormat statement
`void`	`termFormatsToSynsets(KB kb)` Generate a new synset from a termFormat
`static void`	`testProcessPointers()` A method used only for testing.
`static void`	`testWordFreq()` A method used only for testing.
`java.lang.String`	`verbRootForm(java.lang.String mixedCase, java.lang.String input)` Return the present tense singular form of the verb, or null if it's not in the lexicon.
`java.lang.String`	`verbSynsetFromTermFormat(java.lang.String tf, java.lang.String SUMOterm, KB kb)` Generate a new verb synset from a termFormat
`void`	`writeProlog(KB kb)`
`static void`	`writeWordCoFrequencies(java.lang.String fname, java.util.HashMap<java.lang.String,java.util.HashMap<java.lang.String,java.lang.Integer>> senses)` Write a HashMap of HashMaps where the key is a word sense of the form word_POS_num signifying the word, part of speech and number of the sense in WordNet.
`void`	`writeWordNetG()`
`void`	`writeWordNetHyp()`
`void`	`writeWordNetProlog()`
`void`	`writeWordNetS()` Write WordNet data to a prolog file with a single kind of clause in the following format: s(Synset_ID, Word_No_in_the_Synset, Word, SS_Type, Synset_Rank_By_the_Word,Tag_Count)
`void`	`writeXML()`

Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

- Field Detail
  - debug
```
public static boolean debug
```
  - wn
```
public static WordNet wn
```
  - wns
```
public static java.util.HashMap<java.lang.String,WordNet> wns
```
  - baseDir
```
public static java.lang.String baseDir
```
  - baseDirFile
```
public static java.io.File baseDirFile
```
  - initNeeded
```
public static boolean initNeeded
```
  - nounSynsetHash
```
public java.util.HashMap<java.lang.String,java.util.HashSet<java.lang.String>> nounSynsetHash
```
  - verbSynsetHash
```
public java.util.HashMap<java.lang.String,java.util.HashSet<java.lang.String>> verbSynsetHash
```
  - adjectiveSynsetHash
```
public java.util.HashMap<java.lang.String,java.util.HashSet<java.lang.String>> adjectiveSynsetHash
```
  - adverbSynsetHash
```
public java.util.HashMap<java.lang.String,java.util.HashSet<java.lang.String>> adverbSynsetHash
```
  - verbDocumentationHash
```
public java.util.Hashtable<java.lang.String,java.lang.String> verbDocumentationHash
```
  - adjectiveDocumentationHash
```
public java.util.Hashtable<java.lang.String,java.lang.String> adjectiveDocumentationHash
```
  - adverbDocumentationHash
```
public java.util.Hashtable<java.lang.String,java.lang.String> adverbDocumentationHash
```
  - nounDocumentationHash
```
public java.util.Hashtable<java.lang.String,java.lang.String> nounDocumentationHash
```
  - nounSUMOHash
```
public java.util.Hashtable<java.lang.String,java.lang.String> nounSUMOHash
```
  - verbSUMOHash
```
public java.util.Hashtable<java.lang.String,java.lang.String> verbSUMOHash
```
  - adjectiveSUMOHash
```
public java.util.Hashtable<java.lang.String,java.lang.String> adjectiveSUMOHash
```
  - adverbSUMOHash
```
public java.util.Hashtable<java.lang.String,java.lang.String> adverbSUMOHash
```
  - maxNounSynsetID
```
public java.lang.String maxNounSynsetID
```
  - maxVerbSynsetID
```
public java.lang.String maxVerbSynsetID
```
  - origMaxNounSynsetID
```
public java.lang.String origMaxNounSynsetID
```
  - origMaxVerbSynsetID
```
public java.lang.String origMaxVerbSynsetID
```
  - SUMOHash
```
public java.util.Hashtable<java.lang.String,java.util.ArrayList<java.lang.String>> SUMOHash
```
    Keys are SUMO terms, values are ArrayLists(s) of POS-prefixed 9-digit synset String(s) meaning that the part of speech code is prepended to the synset number.
  - synsetsToWords
```
public java.util.Hashtable<java.lang.String,java.util.ArrayList<java.lang.String>> synsetsToWords
```
    Keys are String POS-prefixed synsets. Values are ArrayList(s) of String(s) which are words. Note that the order of words in the file is preserved.
  - exceptionNounHash
```
public java.util.Hashtable<java.lang.String,java.lang.String> exceptionNounHash
```
    list of irregular plural forms where the key is the plural, singular is the value.
  - exceptionVerbHash
```
public java.util.Hashtable<java.lang.String,java.lang.String> exceptionVerbHash
```
  - relations
```
public java.util.Hashtable<java.lang.String,java.util.ArrayList<AVPair>> relations
```
    Keys are POS-prefixed synsets, values are ArrayList(s) of AVPair(s) in which the attribute is a pointer type according to http://wordnet.princeton.edu/man/wninput.5WN.html#sect3 and the value is a POS-prefixed synset @see WordNetUtilities.convertWordNetPointer
  - wordCoFrequencies
```
public java.util.HashMap<java.lang.String,java.util.HashMap<java.lang.String,java.lang.Integer>> wordCoFrequencies
```
    a HashMap of HashMaps where the key is a word sense of the form word_POS_num signifying the word, part of speech and number of the sense in WordNet. The value is a HashMap of words and the number of times that word cooccurs in sentences with the word sense given in the key.
  - wordFrequencies
```
protected java.util.HashMap<java.lang.String,java.util.TreeSet<AVPair>> wordFrequencies
```
    a HashMap of HashMaps where the key is a word and the value is a HashMap of 9-digit POS-prefixed senses which is the value of the AVPair, and the number of times that sense occurs in the Brown corpus, which is the key of the AVPair
  - caseMap
```
public java.util.HashMap<java.lang.String,java.lang.String> caseMap
```
  - senseFrequencies
```
public java.util.HashMap<java.lang.String,java.lang.Integer> senseFrequencies
```
    a HashMap where the key is a 9-digit POS-prefixed sense and the value is a the number of times that sense occurs in the Brown corpus.
  - stopwords
```
public java.util.ArrayList<java.lang.String> stopwords
```
    English "stop words" such as "a", "at", "them", which have no or little inherent meaning when taken alone.
  - senseIndex
```
public java.util.HashMap<java.lang.String,java.lang.String> senseIndex
```
    A HashMap where the keys are of the form word_POS_sensenum (alpha POS like "VB") and values are 8 digit WordNet synset byte offsets. Note that all words are from index.sense, which reduces all words to lower case
  - reverseSenseIndex
```
public java.util.HashMap<java.lang.String,java.lang.String> reverseSenseIndex
```
    A HashMap where the keys are 9 digit POS prefixed WordNet synset byte offsets, and the values are of the form word_POS_sensenum (alpha POS like "VB"). Note that all words are from index.sense, which reduces all words to lower case
  - verbFrames
```
public java.util.HashMap<java.lang.String,java.util.ArrayList<java.lang.String>> verbFrames
```
    A HashMap where keys are 8 digit WordNet synset byte offsets or synsets appended with a dash and a specific word such as "12345678-foo". Values are ArrayList(s) of String verb frame numbers.
  - wordsToSenseKeys
```
public java.util.HashMap<java.lang.String,java.util.ArrayList<java.lang.String>> wordsToSenseKeys
```
    A HashMap with words as keys and ArrayList as values. The ArrayList contains word senses which are Strings of the form word_POS_num (alpha POS like "VB") signifying the word, part of speech and number of the sense in WordNet. Note that all words are from index.sense, which reduces all words to lower case
  - multiWords
```
public MultiWords multiWords
```
  - NOUN
```
public static final int NOUN
```
    See Also:
    
    Constant Field Values
  - VERB
```
public static final int VERB
```
    See Also:
    
    Constant Field Values
  - ADJECTIVE
```
public static final int ADJECTIVE
```
    See Also:
    
    Constant Field Values
  - ADVERB
```
public static final int ADVERB
```
    See Also:
    
    Constant Field Values
  - ADJECTIVE_SATELLITE
```
public static final int ADJECTIVE_SATELLITE
```
    See Also:
    
    Constant Field Values
  - OMW
```
public java.util.HashMap<java.lang.String,java.util.HashMap<java.lang.String,java.lang.String>> OMW
```
    A HashMap with language name keys and HashMap values. The interior HashMap has String keys which are PWN30 synsets with 8-digit synsets a dash and then a alphabetic part of speech character. Values are words in the target language.
- Constructor Detail
  - WordNet
```
public WordNet()
```
- Method Detail
  - getMultiWords
```
public MultiWords getMultiWords()
```
  - getWnFile
```
public java.io.File getWnFile(java.lang.String key,
                              java.lang.String override)
```
    Returns the WordNet File object corresponding to key.
    
    Parameters:
    
    key - A descriptive literal String that maps to a regular expression pattern used to obtain a WordNet file.
    
    Returns:
    
    A File object
  - splitToArrayList
```
public static java.util.ArrayList<java.lang.String> splitToArrayList(java.lang.String st)
```
    Return an ArrayList of the string split by spaces.
  - splitToArrayListSentence
```
public static java.util.ArrayList<java.lang.String> splitToArrayListSentence(java.lang.String st)
```
    Return an ArrayList of the string split by periods.
  - getSUMOMapping
```
public java.lang.String getSUMOMapping(java.lang.String synset)
```
    Get the SUMO mapping for a POS-prefixed synset
  - setMaxNounSynsetID
```
protected void setMaxNounSynsetID(java.lang.String synset)
```
  - setMaxVerbSynsetID
```
protected void setMaxVerbSynsetID(java.lang.String synset)
```
  - processNounLine
```
protected boolean processNounLine(java.lang.String line)
```
  - mergeWordCoFrequencies
```
public void mergeWordCoFrequencies(java.util.HashMap<java.lang.String,java.util.HashMap<java.lang.String,java.lang.Integer>> senses)
```
    Merge a new set of word co-occurrence statistics into the existing set.
  - writeWordCoFrequencies
```
public static void writeWordCoFrequencies(java.lang.String fname,
                                          java.util.HashMap<java.lang.String,java.util.HashMap<java.lang.String,java.lang.Integer>> senses)
```
    Write a HashMap of HashMaps where the key is a word sense of the form word_POS_num signifying the word, part of speech and number of the sense in WordNet. The value is a HashMap of words and the number of times that word cooccurs in sentences with the word sense given in the key.
  - readWordCoFrequencies
```
public void readWordCoFrequencies()
```
    Return a HashMap of HashMaps where the key is a word sense of the form word_POS_num signifying the word, part of speech and number of the sense in WordNet. The value is a HashMap of words and the number of times that word cooccurs in sentences with the word sense given in the key.
  - readStopWords
```
public void readStopWords()
```
  - readSenseIndex
```
public void readSenseIndex(java.lang.String filename)
```
    Note that WordNet forces all these words to lowercase in the index.xxx files
  - readSenseCount
```
public void readSenseCount()
```
    Read word sense frequencies into a HashMap of PriorityQueues containing AVPairs where the value is a word and the attribute (on which PriorityQueue is sorted) is an 8 digit String representation of an integer count.
  - addToWordFreq
```
public void addToWordFreq(java.lang.String word,
                          AVPair avp)
```
    Add an entry to the wordFrequencies list, checking whether it has a valid count and synset pair.
  - sumoSentenceDisplay
```
public java.lang.String sumoSentenceDisplay(java.lang.String input,
                                            java.lang.String context,
                                            java.lang.String params)
```
    A routine which looks up a given list of words in the hashtables to find the relevant word definitions and SUMO mappings.
    
    Parameters:
    
    input - is the target sentence to be parsed. See WordSenseBody.jsp for usage.
    
    context - is the larger context of the sentence. Can mean more accurate results.
    
    params - is the set of html parameters
  - sumoSentimentDisplay
```
public java.lang.String sumoSentimentDisplay(java.lang.String sentence)
```
    A routine that uses computeSentiment in DB.java to display a sentiment score for a single sentence as well as the individual scores of scored descriptors.
    
    Parameters:
    
    sentence - is the target sentence to be scored. See WordSenseBody.jsp for usage.
  - sumoFileDisplay
```
public java.lang.String sumoFileDisplay(java.lang.String pathname,
                                        java.lang.String counter,
                                        java.lang.String params)
```
    A routine which takes a full pathname as input and returns a sentence by sentence display of sense and sentiment analysis
    
    Parameters:
    
    pathname -
    
    counter - is used to keep track of which sentence is being displayed
    
    params - is the set of html parameters
  - isFile
```
public boolean isFile(java.lang.String s)
```
    Returns:
    
    true if the input String is a file pathname. Determined by whether the string contains a forward or backward slash. This is only used in WordSense.jsp and will fail if a sentence that is not a file contains a forward or back slash.
  - isHyponymRecurse
```
public boolean isHyponymRecurse(java.lang.String synset,
                                java.lang.String hypo,
                                java.util.ArrayList<java.lang.String> visited)
```
    Returns:
    
    true if the first POS-prefixed synset is a hyponym of the second POS-prefixed synset. This is a recursive method.
  - isHyponym
```
public boolean isHyponym(java.lang.String synset,
                         java.lang.String hypo)
```
    Returns:
    
    true if the first POS-prefixed synset is a hyponym of the second POS-prefixed synset. This is a recursive method.
  - removeStopWords
```
public java.lang.String removeStopWords(java.lang.String sentence)
```
    Remove stop words from a sentence.
  - removeStopWords
```
public java.util.ArrayList<java.lang.String> removeStopWords(java.util.ArrayList<java.lang.String> sentence)
```
    Remove stop words from a sentence.
  - isStopWord
```
public boolean isStopWord(java.lang.String word)
```
    Check whether the word is a stop word
  - collectCountedWordSenses
```
public java.util.HashMap<java.lang.String,java.lang.Integer> collectCountedWordSenses(java.lang.String sentence)
```
    Collect all the synsets that represent the best guess at meanings for all the words in a sentence. Keep track of how many times each sense appears.
  - serializedOld
```
public static boolean serializedOld()
```
    Check whether sources are newer than serialized version.
  - loadSerialized
```
public static void loadSerialized()
```
    Load the most recently save serialized version.
  - serialize
```
public static void serialize()
```
    save serialized version.
  - initOnce
```
public static void initOnce()
```
    Read the WordNet files only on initialization of the class.
  - nounRootForm
```
public java.lang.String nounRootForm(java.lang.String mixedCase,
                                     java.lang.String input)
```
    Return the root form of the noun, or null if it's not in the lexicon.
  - verbRootForm
```
public java.lang.String verbRootForm(java.lang.String mixedCase,
                                     java.lang.String input)
```
    Return the present tense singular form of the verb, or null if it's not in the lexicon.
  - getSenseKeysFromWord
```
public java.util.TreeMap<java.lang.String,java.util.ArrayList<java.lang.String>> getSenseKeysFromWord(java.lang.String word)
```
    Get all the synsets for a given word.
    
    Returns:
    
    a TreeMap of sense keys in the form of word_POS_num and values that are ArrayLists of synset Strings
  - getWordsFromTerm
```
public java.util.TreeMap<java.lang.String,java.lang.String> getWordsFromTerm(java.lang.String SUMOterm)
```
    Get the words and synsets corresponding to a SUMO term. The return is a Map of words with their corresponding synset number.
  - getWordsFromSynset
```
public java.util.ArrayList<java.lang.String> getWordsFromSynset(java.lang.String synset)
```
  - containsWord
```
public boolean containsWord(java.lang.String word,
                            int pos)
```
    Does WordNet contain the given word.
  - containsWord
```
public boolean containsWord(java.lang.String word)
```
    Does WordNet contain the given word.
  - page
```
public java.lang.String page(java.lang.String inp,
                             int pos,
                             java.lang.String kbname,
                             java.lang.String synset,
                             java.lang.String params)
```
    This is the regular point of entry for this class. It takes the word the user is searching for, and the part of speech index, does the search, and returns the string with HTML formatting codes to present to the user. The part of speech codes must be the same as in the menu options in WordNet.jsp and Browse.jsp
    
    Parameters:
    
    inp - The string the user is searching for.
    
    pos - The part of speech of the word 1=noun, 2=verb, 3=adjective, 4=adverb
    
    Returns:
    
    A string contained the HTML formatted search result.
  - getDocumentation
```
public java.lang.String getDocumentation(java.lang.String synset)
```
    Parameters:
    
    synset - is a synset with POS-prefix
  - displaySynset
```
public java.lang.String displaySynset(java.lang.String sumokbname,
                                      java.lang.String synset,
                                      java.lang.String params)
```
    Parameters:
    
    synset - is a synset with POS-prefix
  - displayByKey
```
public java.lang.String displayByKey(java.lang.String sumokbname,
                                     java.lang.String key,
                                     java.lang.String params)
```
    Parameters:
    
    key - is a WordNet sense key
    
    Returns:
    
    9-digit POS-prefix and synset number
  - writeXML
```
public void writeXML()
```
  - writeProlog
```
public void writeProlog(KB kb)
```
  - senseKeyPOS
```
public java.lang.String senseKeyPOS(java.lang.String senseKey)
```
  - writeWordNetS
```
public void writeWordNetS()
```
    Write WordNet data to a prolog file with a single kind of clause in the following format: s(Synset_ID, Word_No_in_the_Synset, Word, SS_Type, Synset_Rank_By_the_Word,Tag_Count)
  - writeWordNetHyp
```
public void writeWordNetHyp()
```
  - processPrologString
```
public java.lang.String processPrologString(java.lang.String doc)
```
    Double any single quotes that appear.
  - writeWordNetG
```
public void writeWordNetG()
```
  - writeWordNetProlog
```
public void writeWordNetProlog()
                        throws java.io.IOException
```
    Throws:
    
    java.io.IOException
  - generateSynsetID
```
public java.lang.String generateSynsetID(java.lang.String l)
```
    Generate a new 8 digit synset ID that doesn't have an existing hash
  - generateNounSynsetID
```
public java.lang.String generateNounSynsetID()
```
    Generate a new eight digit noun synset ID that doesn't have an existing hash
  - generateVerbSynsetID
```
public java.lang.String generateVerbSynsetID()
```
    Generate a new eight digit verb synset ID that doesn't have an existing hash
  - nounSynsetFromTermFormat
```
public java.lang.String nounSynsetFromTermFormat(java.lang.String tf,
                                                 java.lang.String SUMOterm,
                                                 KB kb)
```
    Generate a new noun synset from a termFormat
  - verbSynsetFromTermFormat
```
public java.lang.String verbSynsetFromTermFormat(java.lang.String tf,
                                                 java.lang.String SUMOterm,
                                                 KB kb)
```
    Generate a new verb synset from a termFormat
  - synsetFromTermFormat
```
public void synsetFromTermFormat(Formula form,
                                 java.lang.String tf,
                                 java.lang.String SUMOterm,
                                 KB kb)
```
    Generate a new synset from a termFormat statement
    
    Parameters:
    
    form - is the entire termFormat statement
    
    tf - is the lexical item (word). note that in the case of a multi-word lexical item it should already have had spaces replaced by underscores
    
    SUMOterm - is the SUMO term that the lexical item is mapped to
  - termFormatsToSynsets
```
public void termFormatsToSynsets(KB kb)
```
    Generate a new synset from a termFormat
  - testWordFreq
```
public static void testWordFreq()
```
    A method used only for testing. It should not be called during normal operation.
  - testProcessPointers
```
public static void testProcessPointers()
```
    A method used only for testing. It should not be called during normal operation.
  - checkWordsToSenses
```
public static void checkWordsToSenses()
```
  - getEntailments
```
public static void getEntailments()
```
  - main
```
public static void main(java.lang.String[] args)
```
    A main method, used only for testing. It should not be called during normal operation.

Class WordNet

Field Summary

Constructor Summary

Method Summary

Methods inherited from class java.lang.Object

Field Detail

debug

wn

wns

baseDir

baseDirFile

initNeeded

nounSynsetHash

verbSynsetHash

adjectiveSynsetHash

adverbSynsetHash

verbDocumentationHash

adjectiveDocumentationHash

adverbDocumentationHash

nounDocumentationHash

nounSUMOHash

verbSUMOHash

adjectiveSUMOHash

adverbSUMOHash

maxNounSynsetID

maxVerbSynsetID

origMaxNounSynsetID

origMaxVerbSynsetID

SUMOHash

synsetsToWords

exceptionNounHash

exceptionVerbHash

relations

wordCoFrequencies

wordFrequencies

caseMap

senseFrequencies

stopwords

senseIndex

reverseSenseIndex

verbFrames

wordsToSenseKeys

multiWords

NOUN

VERB

ADJECTIVE

ADVERB

ADJECTIVE_SATELLITE

OMW

Constructor Detail

WordNet

Method Detail

getMultiWords

getWnFile

splitToArrayList

splitToArrayListSentence

getSUMOMapping

setMaxNounSynsetID

setMaxVerbSynsetID

processNounLine

mergeWordCoFrequencies

writeWordCoFrequencies

readWordCoFrequencies

readStopWords

readSenseIndex

readSenseCount

addToWordFreq

sumoSentenceDisplay

sumoSentimentDisplay

sumoFileDisplay

isFile

isHyponymRecurse

isHyponym

removeStopWords

removeStopWords

isStopWord

collectCountedWordSenses

serializedOld

loadSerialized

serialize