Sentence reconstruction using word ambiguity resolution
First Claim
1. A sentence reconstruction method, for resolving word ambiguities in a selected language sentence structure entered using single stroke activation of a key set including text entry keys each representing a group of letters, comprising the steps of:
- (a) providing first key stroke indicia each ambiguously representing a letter of a group of letters and second key stroke indicia including indicia representing spaces between words;
(b) partitioning, by use of said second key stroke indicia, said first key stroke indicia into a sequence of word positions, each word position comprising a code block represented by at least one of said first key stroke indicia;
(c) accessing a database including a word list to identify for an individual word position a word group including alternative word choices formable from the letter groups represented by the code block for said word position;
(d) repeating step (c) for said sequence of word positions to identify a corresponding word group including at least one word choice for each of a plurality of word positions;
(e) utilizing a stored word use rule set representative of relative frequency of particular word usage in said selected language to derive, for the word group for one of said word positions, probability values for word choices for said word position;
(f) utilizing a stored language rule set representative of usage in said selected language to derive probability values for a sequencing of individual word choices for said word position relative to at least one word choice for an adjacent word position in said sentence structure, said language rule set including rules in both of the following categories (i) rules based on transitional probability of use of particular word sequences, and (ii) rules based on probability of relative positioning of words of particular word categories in a sentence structure;
(g) repeating steps (e) and (f) for any additional word positions having an associated word group including a plurality of alternative word choices; and
(h) selecting, by use of said probability values derived in steps (e) and (f), one word from each said word group for inclusion at a respective word position in a reconstructed sentence structure.
1 Assignment
0 Petitions
Accused Products
Abstract
Single stroke text entry via a telephone keypad is subject to ambiguities resulting from the overloading of three letters on one key. After text is entered the three letter per position code block represented by the key strokes for a word position is hashed to all matches in a stored dictionary type word list. The resulting word group of alternative word choices for that word position is subjected to probability analysis. Probabilities of usage of specific words are evaluated based on frequency of usage in the selected language, such as English. Syntax type probabilities of word sequencing are evaluated through (i) rules based on transitional probability of use of two particular words in sequence in English usage and (ii) rules based on probability of relative positioning of words of particular word categories (e.g., nouns and adjectives) in a sentence structure in English usage. A word trellis or lattice represents choice paths for alternative sentence structures. By selecting the path with the highest probability values, highly accurate sentence reconstruction is provided. Communication with hearing impaired persons via any telephone keypad is facilitated by described systems and methods also applicable to a variety of systems wherein computer stored text is subject to ambiguities as to intended words.
252 Citations
15 Claims
-
1. A sentence reconstruction method, for resolving word ambiguities in a selected language sentence structure entered using single stroke activation of a key set including text entry keys each representing a group of letters, comprising the steps of:
-
(a) providing first key stroke indicia each ambiguously representing a letter of a group of letters and second key stroke indicia including indicia representing spaces between words; (b) partitioning, by use of said second key stroke indicia, said first key stroke indicia into a sequence of word positions, each word position comprising a code block represented by at least one of said first key stroke indicia; (c) accessing a database including a word list to identify for an individual word position a word group including alternative word choices formable from the letter groups represented by the code block for said word position; (d) repeating step (c) for said sequence of word positions to identify a corresponding word group including at least one word choice for each of a plurality of word positions; (e) utilizing a stored word use rule set representative of relative frequency of particular word usage in said selected language to derive, for the word group for one of said word positions, probability values for word choices for said word position; (f) utilizing a stored language rule set representative of usage in said selected language to derive probability values for a sequencing of individual word choices for said word position relative to at least one word choice for an adjacent word position in said sentence structure, said language rule set including rules in both of the following categories (i) rules based on transitional probability of use of particular word sequences, and (ii) rules based on probability of relative positioning of words of particular word categories in a sentence structure; (g) repeating steps (e) and (f) for any additional word positions having an associated word group including a plurality of alternative word choices; and (h) selecting, by use of said probability values derived in steps (e) and (f), one word from each said word group for inclusion at a respective word position in a reconstructed sentence structure. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A sentence reconstruction method wherein text in a selected language is entered by key stokes each ambiguously representing one letter of a group of letters, key stroke indicia are partitioned into a sequence of word positions each represented by a code block, and a code block is used to identify a word group of one or more word choices formable from letter groups represented by the respective code block, to thereby provide a sequence of word groups representing an input sentence structure, said method characterized by the steps of:
-
(a) utilizing a stored word use rule set representative of relative frequency of particular word usage in said selected language to derive, for the word group for one of said word positions, values for word choices for said word position; (b) utilizing a stored language rule set representative of usage in said selected language to derive probability values for sequencing of individual word choices for said word position relative to at least one word choice for an adjacent word position in said sentence structure, said language rule set including rules in both of the following categories (i) rules based on transitional probability of use of particular word sequences, and (ii) rules based on probability of relative positioning of words of particular word categories in a sentence structure; and (c) selecting, by use of said probability values derived in steps (a) and (b), one word from each said word group for inclusion at a respective word position in a reconstructed sentence structure. - View Dependent Claims (11)
-
-
12. A sentence reconstruction system to resolve word ambiguities in a selected language sentence structure comprising:
-
an input terminal for coupling of key activation indicia from a key set including keys each representing a group of letters; a memory unit arranged to store a sentence structure having a sequence of word positions each comprising a code block represented by at least one indicia ambiguously representing one letter of a group of letters; a word list of words of said selected language; a word use rule set representative of frequency of particular word usage in said selected language; and a language rule set including rules in both of the following categories (i) rules based on transitional probability of use of particular word sequences, and (ii) rules based on probability of relative positioning of particular word categories in a sentence structure; a processor arranged to (i) use said word list to identify, for said sequence of word positions, word groups including alternative word choices formable from letter groups represented by said indicia, (ii) use said word use rule set to derive probability values for word choices for the word group for each said word position, and (iii) use both categories of rules of said language rule set to derive probability values for sequencing of individual word choices for individual word positions relative to at least one word choice for an adjacent word position in said sentence structure, and to select, by use of said probability values, one word from each said word group for inclusion at a respective word position in a reconstructed sentence structure; and an output device arranged to provide a representation of said reconstructed sentence structure. - View Dependent Claims (13, 14, 15)
-
Specification