Character segmentation technique with integrated word search for handwriting recognition
First Claim
1. A method for segmenting plural recognized characters into words in a handwriting recognition process, comprising the steps of:
- generating hypotheses pertaining to respective combinations of characters having word-break spaces at different locations between the characters;
assigning a weight value to each hypothesis;
searching the hypotheses having the highest weight values, relative to a dictionary of recognized words; and
selecting the hypothesis whose combination of characters and word break spaces most closely matches entries in the dictionary.
2 Assignments
0 Petitions
Accused Products
Abstract
After each complete stroke in a handwriting recognition process, a hypothesis is generated whether a word break is present between the previous stroke and the new stroke. This hypothesis is weighted with a probability of a word-break occurring between the strokes. This probability is determined from the geometrical relationships between characters. Subsequently, a word search is carried out on the basis of these weighted hypotheses, to identity the most likely candidates for the words represented by the written strokes. A user interface is provided that offers the user a limited list of alternative word recognitions for a group of characters. These recognitions undergo segmentation filtering, in accordance with the word breaks of the selected hypotheses, to present the user with only those alternatives having the same groupings of strokes.
60 Citations
13 Claims
-
1. A method for segmenting plural recognized characters into words in a handwriting recognition process, comprising the steps of:
-
generating hypotheses pertaining to respective combinations of characters having word-break spaces at different locations between the characters; assigning a weight value to each hypothesis; searching the hypotheses having the highest weight values, relative to a dictionary of recognized words; and selecting the hypothesis whose combination of characters and word break spaces most closely matches entries in the dictionary. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A handwriting recognition system, comprising:
-
an input device in which a user enters handwritten strokes; a stroke processor for measuring distances between strokes; a character classifier for classifying groups of strokes into recognized characters; and a word recognition device for generating output signals indicative of possible words represented by the handwritten strokes, said word recognition device including; means for generating hypotheses pertaining to respective combinations of characters having word-break spaces at different locations between the characters, and for assigning a weight value to each hypothesis; means for searching the hypotheses having the highest weight values relative to a dictionary of recognized words; and means for selecting the hypothesis whose combination of characters and word break spaces most closely matches entries in the dictionary. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A method for identifying words within handwritten characters, comprising the steps of:
-
receiving strokes entered by a user and classifying one or more strokes as a possible character; receiving additional strokes entered by a user and classifying said additional strokes as additional possible characters; grouping successive possible characters into combinations having word-break spaces at different respective locations between the possible characters to form word hypotheses, and assigning a weight value to each word hypothesis based upon a geometric relationship between the strokes comprising the possible characters; searching the hypotheses having the highest weight values, relative to a dictionary of recognized words; and selecting the hypothesis whose combination of characters and word break spaces most closely matches entries in the dictionary. - View Dependent Claims (12)
-
-
13. A method for identifying words within handwritten characters, comprising the steps of:
-
receiving strokes entered by a user and classifying one or more strokes as a possible character; receiving additional strokes entered by a user and classifying said additional strokes as a second possible character; grouping said two possible characters into combinations having word-break spaces at different respective locations between the possible characters to form word hypotheses, and assigning a weight value to each word hypothesis based upon a geometric relationship between the strokes comprising the two possible characters; receiving additional strokes entered by the user and classifying said additional strokes as successive possible characters; after each successive possible character is classified, forming additional word hypothesis having word-break spaces at different respective locations between the possible characters, and assigning a weight value to each word hypothesis; searching the hypotheses having the highest weight values, relative to a dictionary of recognized words; and selecting the hypothesis whose combination of characters and word break spaces most closely matches entries in the dictionary.
-
Specification