Systems and methods for word recognition
First Claim
1. A method for ascertaining an ambiguous word that occurs in a data string which includes at least one context word, comprising the steps ofgenerating a list of one or more choice words to represent choices for the ambiguous word,providing a reference source having one or more passages, each passage including a relevance indicator and a series of words associated with the relevance indicator,selecting as a function of one of said choice words and said relevance indicators, at least one of said passages from said reference source,generating one or more correlation signals, as a function of said one or more selected passages, and at least one of said context words, to represent the likelihood that said choice word is a match for the ambiguous word, andselecting, as a function of said one or more correlation signals, at least one of said choice words to represent the ambiguous word being identified.
9 Assignments
0 Petitions
Accused Products
Abstract
In one aspect, the invention provides word recognition systems that operate to recognize an unrecognized or ambiguous word that occurs within a passage of words. The system can offer several words as choice words for inserting into the passage to replace the unrecognized word. The system can select the best choice word by using the choice word to extract from a reference source, sample passages of text that relate to the choice word. For example, the system can select the dictionary passage that defines the choice word. The system then compares the selected passage to the current passage, and generates a score that indicates the likelihood that the choice word would occur within that passage of text. The system can select the choice word with the best score to substitute into the passage. The passage of words being analyzed can be any word sequence including an utterance, a portion of handwritten text, a portion of typewritten text or other such sequence of words, numbers and characters. Alternative embodiments of the present invention are disclosed which function to retrieve documents from a library as a function of context.
404 Citations
43 Claims
-
1. A method for ascertaining an ambiguous word that occurs in a data string which includes at least one context word, comprising the steps of
generating a list of one or more choice words to represent choices for the ambiguous word, providing a reference source having one or more passages, each passage including a relevance indicator and a series of words associated with the relevance indicator, selecting as a function of one of said choice words and said relevance indicators, at least one of said passages from said reference source, generating one or more correlation signals, as a function of said one or more selected passages, and at least one of said context words, to represent the likelihood that said choice word is a match for the ambiguous word, and selecting, as a function of said one or more correlation signals, at least one of said choice words to represent the ambiguous word being identified.
-
13. A method for ascertaining an ambiguous word that occurs in a data string that includes one or more context words, comprising the steps of
generating a list of one or more choice words to represent choices for the ambiguous word, providing a reference source having one or more passages each of which includes a relevance indicator and a series of passage words associated with the relevance indicator, selecting as a function of each of said relevance indicators and said context words, at least one passage from said reference source for at least one of said context words, generating a correlation signal for each of said choice words as a function of said passages and said choice words, to represent the likelihood that a respective choice word is a match for the ambiguous word, and selecting as a function of said correlation signals, at least one of said choice words to represent said ambiguous word.
-
24. A method for ascertaining an ambiguous word that occurs in a data string which includes one or more context words, comprising the steps of
generating a list of one or more choice words to represent choices for said ambiguous word, providing a reference source having one or more passages, each passage including a relevance indicator and a series of passage words associated with the relevance indicator, comparing a respective one of said choice words with each said relevance indicator to select one or more of said passages from said reference source. comparing one of said context words with each said relevance indicator to select one or more of said passages from said reference source, and generating a correlation signal for each of said respective choice words by correlating said passages of said context words with said passages of said respective choice words, to represent the likelihood that said choice word is a match for the ambiguous word.
-
35. A method for ascertaining an ambiguous phrase that occurs in a data string which includes at least one context word, comprising the steps of
generating a list of one or more choice phrases, each of which represents one or more words, to represent choices for the ambiguous phrase, providing a reference source having one or more passages, each passage including a relevance indicator and a series of words associated with the relevance indicator, selecting as a function of one or said words in said choice phrase and each said relevance indicator, at least one of said passages from said reference source, generating a correlation signal, as a function of said selected passage, and at least one of said context words, to represent the likelihood that said choice phrase is a match for the ambiguous phrase, and selecting, as a function of said correlation signal, at least one of said choice phrases to represent the ambiguous phrase being identified.
-
39. Apparatus for recognizing an ambiguous word that occurs in a data string which includes at least one context word, comprising
means for providing a list of one or more choice words to represent choices for said ambiguous word, means for selecting a choice word and for accessing a reference source having one or more passages, each passage including a relevance indicator and a series of words associated with the relevance indicator, to provide a list of passage words having a known association with said selected choice word, means for generating a correlation signal for each of said choice words as a function of said passage words, to represent the likelihood that said choice word is substantially similar to said ambiguous word, and means for selecting, as a function of said correlation signal, at least one of said choice words to represent said ambiguous word.
-
42. In an apparatus that employs context for recognizing an ambiguous word that occurs in a data string which includes one or more context words, the improvement comprising
means for selecting a list of one or more choice words to represent choices for said ambiguous word, means for accessing a reference source, having one or more each passage including a relevance indicator and a series of words associated with the relevance indicator, to provide a list of passage words having a known association with said selected word, and means for selecting, as a function of each relevance indicator, at least one of said choice words to represent said ambiguous word.
-
43. Apparatus for recognizing an ambiguous word that occurs in a data string which includes at least one context word, comprising
means for providing a list of one or more choice words to represent choices for said ambiguous word, means for selecting said context word and for accessing a reference source, having one or more passages, each passage including a relevance indicator and a series of words associated with the relevance indicator, to provide a list of passage words having a known association with said selected context word, means for generating a correlation signal for each of said choice words as a function of said passage words, to represent the likelihood that said choice word is substantially similar to said ambiguous word, and means for selecting, as a function of said correlation signal, at least one of said choice words to represent said ambiguous word.
Specification