Handwriting recognition system simultaneously considering shape and context information
First Claim
Patent Images
1. A method of processing data that represents strokes of handwritten information, thereby to identify characters represented by the data, the method comprising the steps of:
- receiving the data provided by a user, the data having a beginning part and an end part that define therebetween a complete amount of handwritten information to be processed;
assembling portions of the data into a sequence of packages; and
determining for the sequence of packages a probable result set of characters represented by the sequence, the determining step occurring before the end part of the data is received and comprising the steps of simultaneously considering information pertaining to the character shapes represented by the packages and information pertaining to the literal context of the characters represented by the packages;
wherein the determining step includes searching for the result set of characters from among a plurality of character candidates using a first search technique; and
wherein the first of two search techniques comprises the steps of;
developing for each package at least one path representing a string of character candidates, each path having a respective beginning and a respective end; and
generating for each path a first cost value that reflects the probability that the path includes the result set of characters, the first cost value being based upon information pertaining to the character shapes represented by the packages and information pertaining to the literal context of the characters represented by the packages andwherein the second of the two search techniques assigns a second cost value to a considered path based upon the literal context of the considered path, and wherein the second search technique conducts a reverse search towards the respective beginning of at least some of the paths developed by the first search technique to identify the result set as a path having the lowest of the combine first and second cost values.
2 Assignments
0 Petitions
Accused Products
Abstract
The speed and accuracy of a computer implemented handwriting recognition system is enhanced by several innovations, including integrated segmentation and context processing. The recognition processing occurs while the user is providing ink data. The system quickly reaches the recognition result once all of the input is received. More than one result may be returned by the system.
85 Citations
11 Claims
-
1. A method of processing data that represents strokes of handwritten information, thereby to identify characters represented by the data, the method comprising the steps of:
-
receiving the data provided by a user, the data having a beginning part and an end part that define therebetween a complete amount of handwritten information to be processed; assembling portions of the data into a sequence of packages; and determining for the sequence of packages a probable result set of characters represented by the sequence, the determining step occurring before the end part of the data is received and comprising the steps of simultaneously considering information pertaining to the character shapes represented by the packages and information pertaining to the literal context of the characters represented by the packages; wherein the determining step includes searching for the result set of characters from among a plurality of character candidates using a first search technique; and wherein the first of two search techniques comprises the steps of; developing for each package at least one path representing a string of character candidates, each path having a respective beginning and a respective end; and generating for each path a first cost value that reflects the probability that the path includes the result set of characters, the first cost value being based upon information pertaining to the character shapes represented by the packages and information pertaining to the literal context of the characters represented by the packages and wherein the second of the two search techniques assigns a second cost value to a considered path based upon the literal context of the considered path, and wherein the second search technique conducts a reverse search towards the respective beginning of at least some of the paths developed by the first search technique to identify the result set as a path having the lowest of the combine first and second cost values. - View Dependent Claims (2, 3, 4)
-
-
5. A method of processing data that represents strokes of handwritten information, thereby to identify characters represented by the data, the method comprising the steps of:
-
receiving the data provided by a user, the data defining an amount of handwritten information to be processed; assembling portions of the data into a sequence of packages; determining for the sequence of packages a probable result set of characters represented by the sequence, the determining step comprising the step of simultaneously considering information pertaining to the character shapes represented by the packages and information pertaining to the literal context of the characters represented by the packages wherein the determining step includes searching for the result set of characters from among a plurality of character candidates using a first search technique and combining the first search technique with a second search technique that is performed subsequent to the first technique; wherein considering the information pertaining to the character shapes represented by the packages and the information pertaining to the literal context of the characters represented by the packages includes considering whether the size of a space between two adjacent character candidates exceeds a predetermined amount representative of a space between adjacent character candidates.
-
-
6. A method of processing a received stream of data that is representative of stylus tip movement on a tablet or the like, comprising the steps of:
-
dividing the stream into a plurality of sequential frames; combining the frames of input data into one or more packages; assigning one or more character hypotheses to at least some of the packages; constructing in a sequence conforming to the sequence with which the data is received paths of multiple character hypotheses; assigning to portions of the paths values based upon the shape and literal context of the character hypotheses contained in the path; and reverse-searching the constructed paths in a direction opposite to the direction the data was received. - View Dependent Claims (7, 8, 9)
-
-
10. A handwriting recognition system for providing a result of a recognition task comprising:
-
input means for collecting ink data that represents strokes of handwritten information, the data having a beginning part and an end part that define therebetween an amount of handwritten information to be processed; classifier means for providing character hypotheses based the ink data; and search means for selecting the result by substantially simultaneously considering the shape and literal content of the character hypotheses as soon as the beginning part is provided by a user. - View Dependent Claims (11)
-
Specification