Handwriting symbol recognition accuracy using speech input
First Claim
1. One or more computer-readable storage devices having computer-executable instructions, which when executed by one or more processing devices, cause the one or more processing devices to perform:
- receiving handwriting input including handwritten mathematical symbols;
receiving speech input including spoken words corresponding to spoken mathematical symbols;
recognizing the handwriting input to identify candidate handwritten mathematical symbols;
recognizing the speech input to identify candidate spoken words;
processing the candidate handwritten mathematical symbols into a handwriting graph including nodes representing the candidate handwritten mathematical symbols and arcs representing probabilities that the candidate handwritten mathematical symbols are correct;
processing the candidate spoken words into a speech graph by;
accessing a lookup table with the candidate spoken words recognized from the speech input to identify candidate spoken mathematical symbols; and
including nodes in the speech graph representing the candidate spoken mathematical symbols and arcs representing probabilities that the candidate spoken mathematical symbols identified from the lookup table are correct; and
enhancing the handwriting graph into an enhanced graph by adjusting individual probabilities in the handwriting graph based on individual matching probabilities in the speech graph.
2 Assignments
0 Petitions
Accused Products
Abstract
Described is a bimodal data input technology by which handwriting recognition results are combined with speech recognition results to improve overall recognition accuracy. Handwriting data and speech data corresponding to mathematical symbols are received and processed (including being recognized) into respective graphs. A fusion mechanism uses the speech graph to enhance the handwriting graph, e.g., to better distinguish between similar handwritten symbols that are often misrecognized. The graphs include nodes representing symbols, and arcs between the nodes representing probability scores. When arcs in the first and second graphs are determined to match one another, such as aligned in time and associated with corresponding symbols, the probability score in the second graph for that arc is used to adjust the matching probability score in the first graph. Normalization and smoothing may be performed to correspond the graphs to one another and to control the influence of one graph on the other.
72 Citations
18 Claims
-
1. One or more computer-readable storage devices having computer-executable instructions, which when executed by one or more processing devices, cause the one or more processing devices to perform:
-
receiving handwriting input including handwritten mathematical symbols; receiving speech input including spoken words corresponding to spoken mathematical symbols; recognizing the handwriting input to identify candidate handwritten mathematical symbols; recognizing the speech input to identify candidate spoken words; processing the candidate handwritten mathematical symbols into a handwriting graph including nodes representing the candidate handwritten mathematical symbols and arcs representing probabilities that the candidate handwritten mathematical symbols are correct; processing the candidate spoken words into a speech graph by; accessing a lookup table with the candidate spoken words recognized from the speech input to identify candidate spoken mathematical symbols; and including nodes in the speech graph representing the candidate spoken mathematical symbols and arcs representing probabilities that the candidate spoken mathematical symbols identified from the lookup table are correct; and enhancing the handwriting graph into an enhanced graph by adjusting individual probabilities in the handwriting graph based on individual matching probabilities in the speech graph. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method comprising:
-
receiving handwriting input including handwritten mathematical symbols; receiving speech input including a spoken word corresponding to a spoken mathematical symbol; recognizing the handwriting input to identify candidate handwritten mathematical symbols; recognizing the speech input to identify a candidate spoken word;
processing the candidate handwritten mathematical symbols into a handwriting graph reflecting probabilities, represented by arcs, that the candidate handwritten mathematical symbols, represented by nodes, correctly identify the handwritten mathematical symbols;processing the candidate spoken word by; accessing a lookup table with the candidate spoken word recognized from the speech input to identify a candidate spoken mathematical symbol; and generating a speech graph reflecting a probability, represented by an arc, that the candidate spoken mathematical symbol, represented by a node, correctly identifies the spoken mathematical symbol; and fusing the handwriting graph and the speech graph to create an enhanced graph by adjusting individual probabilities of the handwriting graph based on individual matching probabilities in the speech graph, wherein at least the fusing is performed by a processing device. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A system comprising:
-
a handwriting recognizer configured to; receive handwriting input including handwritten mathematical symbols; recognize the handwriting input to identify candidate handwritten mathematical symbols; and generate a handwriting graph reflecting probabilities, represented by arcs, that the candidate handwritten mathematical symbols, represented by nodes, correctly identify the handwritten mathematical symbols; a speech recognizer configured to; receive speech input including a spoken word corresponding to a spoken mathematical symbol; recognize the speech input to identify a candidate spoken word; access a lookup table with the candidate spoken word recognized from the speech input to identify a candidate spoken mathematical symbol; and generate a speech graph reflecting a probability, represented by an arc, that the candidate spoken mathematical symbol, represented by a node, correctly identifies the spoken mathematical symbol; a fusion mechanism configured to fuse the handwriting graph and the speech graph to create an enhanced graph by adjusting individual probabilities of the handwriting graph based on individual matching probabilities in the speech graph; and at least one processing device configured to execute the handwriting recognizer, the speech recognizer, or the fusion mechanism. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification