Method and system for lexical processing
First Claim
Patent Images
1. In a real-time handwriting recognition system, a method of lexical processing that produces a lexical output, the method comprising the following steps:
- receiving an input word derived from non-scanned input data, the input word including a plurality of characters and a plurality of character confidence values;
generating a first result by comparing a rejection ratio, based on the plurality of character confidence values, to a reject character ratio threshold;
generating a second result by comparing the average of the plurality of character confidence values to an accept threshold; and
producing the lexical output according to the first result and the second result, wherein the lexical output is produced by a process step selected from the group consisting of;
accepting the input word without correction,rejecting the input word as unidentifiable and including a rejection indicator in the lexical output, andgenerating the lexical output as a function of the plurality of character confidence values and a confusion matrix.
4 Assignments
0 Petitions
Accused Products
Abstract
A lexical processor and its method of use is provided. The lexical processor includes an input interface (300) and a word generator (302) for producing an output as a function of an input word and a confusion matrix. The confusion matrix is a handwriting error model that is based on the recognition capabilities of classifiers used in preprocessing inputs to the lexical processor. The lexical processor output comprises any of the following: the input word, a rejection indicator, a candidate replacement word, or a suggestion list of related words.
50 Citations
19 Claims
-
1. In a real-time handwriting recognition system, a method of lexical processing that produces a lexical output, the method comprising the following steps:
-
receiving an input word derived from non-scanned input data, the input word including a plurality of characters and a plurality of character confidence values; generating a first result by comparing a rejection ratio, based on the plurality of character confidence values, to a reject character ratio threshold; generating a second result by comparing the average of the plurality of character confidence values to an accept threshold; and producing the lexical output according to the first result and the second result, wherein the lexical output is produced by a process step selected from the group consisting of; accepting the input word without correction, rejecting the input word as unidentifiable and including a rejection indicator in the lexical output, and generating the lexical output as a function of the plurality of character confidence values and a confusion matrix. - View Dependent Claims (2, 3, 4, 5, 9, 10)
-
-
6. In a handwriting recognition system, a method of lexical processing that produces a lexical output, the method comprising the following steps:
-
receiving an input word having a plurality of characters and a plurality of character confidence values; generating a rejection ratio as a function of the plurality of character confidence values; determining whether the rejection ratio is within a reject range; if so, including a rejection indicator in the lexical output; if not, generating a mean character confidence as a function of the plurality of character confidence values; determining whether the mean character confidence value is within an accept range; if so, equating the lexical output to the input word; if not, generating a candidate word by; using a confusion matrix to select a replacement character having a likelihood value that minimizes a distance between the candidate word and the input word; and
replacing at least one of the plurality of characters with the replacement character;spell-checking the candidate word against a lexicon to determine whether the candidate word is present in the lexicon; if so, comparing the distance to a first threshold to produce the lexical output selected from the group consisting of the candidate word and a suggestion list; if not, generating a miss rejection ratio as a function of the plurality of character confidence values; and
selecting the lexical output by comparing the miss rejection ratio and the mean character confidence to a pair of miss thresholds, the lexical output being generated by a process selected from the group consisting of;accepting the input word without correction, rejecting the input word as unidentifiable and producing the rejection indicator, and generating a suggestion list. - View Dependent Claims (7, 8)
-
-
11. A real-time handwriting recognition system, comprising:
-
a lexical processor that includes an input interface receiving an input word derived from non-scanned input data, the input word comprising a plurality of character confidence values and representing a handwritten word having a plurality of characters, a calculator generating a word confidence value based on the character confidence values, a comparator for comparing the word confidence value to a pair of thresholds to produce a comparator output, and a selector producing a processor output in response to the comparator output, a spell-checker, operatively associated with the selector, for spell-checking a candidate word using a dictionary, and a computer memory for storing a confusion matrix; wherein the processor output is selected from the input word and candidate word as a function of the comparator output and the confusion matrix. - View Dependent Claims (12, 13, 14)
-
-
15. An article of manufacture, comprising:
-
a computer-readable memory storing a computer program that directs a computer to perform real-time handwriting recognition, wherein the computer program includes steps for receiving an input word derived from non-scanned input data, the input word including a plurality of character confidence values and a plurality of characters representing a word included in the handwriting text, generating a first result by comparing a rejection ratio, based on the plurality of character confidence values, to a reject character ratio threshold, generating a word confidence value based on the average of the character confidence values; comparing the word confidence value to an accept threshold to generate a second result, and producing the lexical output according to the first result and the second result, wherein the lexical output is produced by a computer program step selected from the group consisting of accepting the input word without correction, rejecting the input word as unidentifiable and including a rejection indicator in the lexical output, and generating the lexical output as a function of the plurality of character confidence values and a confusion matrix. - View Dependent Claims (16, 17, 18, 19)
-
Specification