Electronic spelling machine having ordered candidate words
First Claim
1. In an electronic machine spelling corrector to provide an output list of candidate words in response to an input query word, the method of merging a list of phonetic correction candidate words with a list of typographic correction candidate words to provide a single merged output list for presentation to a user, comprising:
- for each candidate word on the phonetic list, providing a first set of similarity values including a first value P1 vased on a phonetic comparison of the candidate word with the query word and a second value A1 based on a typographic comaprison of the candidate word with the query word,for each candidate word on the typographic list, providing a second set of similarity values including a third value P2 based on a phonetic comparison of the candidate word with the query word and a fourth value A2 based on a typographic comparison of the candidate word with the query word,merging candidate words on the phonetic list and typographic list into the output list, andordering said candidate words on said merged list as a partial function of said four similarity values.
11 Assignments
0 Petitions
Accused Products
Abstract
A spelling corrector develops a phonetic correction candidate list and a typographic correction candidate list. These lists are merged using similarity values that measure the degree of similarity between the query (input) word and each word on the candidate lists. The similarity values include one based on phonetic comparison one based on typographic (ASCII, for example) comparison, one based on a vowel and consonant typographic pattern comparison and one based on a consonant phonetic comparison.
58 Citations
11 Claims
-
1. In an electronic machine spelling corrector to provide an output list of candidate words in response to an input query word, the method of merging a list of phonetic correction candidate words with a list of typographic correction candidate words to provide a single merged output list for presentation to a user, comprising:
-
for each candidate word on the phonetic list, providing a first set of similarity values including a first value P1 vased on a phonetic comparison of the candidate word with the query word and a second value A1 based on a typographic comaprison of the candidate word with the query word, for each candidate word on the typographic list, providing a second set of similarity values including a third value P2 based on a phonetic comparison of the candidate word with the query word and a fourth value A2 based on a typographic comparison of the candidate word with the query word, merging candidate words on the phonetic list and typographic list into the output list, and ordering said candidate words on said merged list as a partial function of said four similarity values. - View Dependent Claims (2, 3, 4, 5)
-
-
6. In an electronic machine spelling corrector which provides an output list of candidate words in response to an input query word, the output list being the merger of a list of phonetic correction candidate words and a list of typographic correction candidate words comprising:
-
a first class of words comprising (a) candidate words from said phonetic list wherein a phonetic comparison of the candidate word with the query word and a typographic comaprison of the candidate word with the query word indicates a perfect match, and (b) candidate words from said typographic list wherein a phonetic comparison of the candidate word with the query word and a typographic comparison of the candidate word with the query word indicates a perfect match, a second class of words excluding words in said first class comprisign (a) candidate words from said phonetic list whereni a phonetic comparison of the candidate word with the query word and an essence comparison of the candidate word with the query word indicates a perfect match, and (b) candidate words from said typographic list which have a vowel transposition compared to the query word, a third class of words excluding words in said first and second classes and comprising (a) words from said phonetic list wherein (i) a phonetic comparison of the candidate word with the query word and an essence comparison of the candidate word with the query word exceed a pre-determined threshold and (ii) the first and last phonetic codes of the candidate word and the query word are the same, and (b) candidate words from said typographic list which have a vowel substitution compared to the query word or which have a vowel addition or a vowel deletion adjacent to another vowel in the query word, and at least a fourth class of words excluding words from said first, second and third classes and comprising those candidate words from said phonetic list whose phonetic comparison with the query word exceeds a predetermined threshold and those candidate words from said typographic list whose typographic comparison with the query word exceeds a predetermined threshold. - View Dependent Claims (7, 8)
-
-
9. In an electronic machine spelling corrector, the output list of candidate words in response to an input query word, the output list being the merger of a list of phonetic correction candidate words and a list of typographic correction candidate words, comprising:
-
a first set of candidate words comprisign candidate words from said phonetic list and said typographic list where the phonetic code and typographic code for the candidate word are substantially identical to the phonetic code and typographic code for the query word, a second set of candidate words comprising; (a) candidate words from the phonetic list where the phonetic code and the essence code for the candidate words are substantially identical to the phonetic code and the essence code for the query word, and (b) words from said typographical list where the only difference between the candidate word and the query word is a vowel transposition, said second set excluding all words in said first set, a third set of candidate words comprising; (a) candidate words from the phonetic list which meet the three criteria that (i) a comparison of the phonetic codes of the candidate word and query word exceed a first predetermined value, (ii) a comparison of the essence codes of the candidate word and the query word exceed a second predetermined value, and (iii) the first and last phonetic codes of the candidate word and query word are identical, and (b) candidate words from said typographic list which have either (i) a vowel substitution as comapred with the query word or (ii) a vowel addition or deletion adjacent to a vowel in the query word, said third set excluding all words in aid first and second set, and a fourth set of candidate words comprising; (a) words from said phonetic list whose phonetic code comparison with the phonetic code of the query word provides a comparison value that exceeds a predetermined value, and (b) words from said typographic code whose typographic code comparison with the typographic code of the query word exceeds a second predetermined value, said fourth set excluding all words in said first, second or third sets. - View Dependent Claims (10, 11)
-
Specification