Multimodal note taking, annotation, and gaming
First Claim
Patent Images
1. A system that facilitates data processing, comprising:
- a processor; and
a memory, on which are stored processor-executable instructions, which when executed by the processor result in operation of an input component and an output component;
wherein the input component receives a plurality of different types of input data;
wherein the output component processes the plurality of different types of input data into a document to generate a rich document having a multidimensional level of data; and
wherein a text document is received by the input component and is processed by an optical character recognition (OCR) subsystem of the input component to produce optically recognized characters, wherein the input component utilizes at least one additional document according to the plurality of different types of input data received, having content that is related to the text document, to verify accuracy of the optically recognized characters, and wherein the verification comprises;
initiating OCR of the text document to generate the optically recognized characters;
determining a likelihood that the optically recognized characters are correct;
accessing the at least one additional document, wherein the at least one additional document is an image document;
analyzing content of the additional document; and
replacing optically recognized characters according to data from the additional document when indicated by the analysis and the likelihood.
2 Assignments
0 Petitions
Accused Products
Abstract
A multimodal, multilanguage mobile device which can be employed to enhance note taking and/or annotation of a document, and gaming. Input data types such as optical character recognition (OCR), speech, handwriting, and visual information (e.g., image and/or video), etc., can be fused to generate rich documents with a multidimensional level of data to provide an increased level of context over conventional documents. Such architecture can be utilized by students for homework management, as well as entertainment (e.g., gaming).
57 Citations
18 Claims
-
1. A system that facilitates data processing, comprising:
-
a processor; and a memory, on which are stored processor-executable instructions, which when executed by the processor result in operation of an input component and an output component; wherein the input component receives a plurality of different types of input data; wherein the output component processes the plurality of different types of input data into a document to generate a rich document having a multidimensional level of data; and wherein a text document is received by the input component and is processed by an optical character recognition (OCR) subsystem of the input component to produce optically recognized characters, wherein the input component utilizes at least one additional document according to the plurality of different types of input data received, having content that is related to the text document, to verify accuracy of the optically recognized characters, and wherein the verification comprises; initiating OCR of the text document to generate the optically recognized characters; determining a likelihood that the optically recognized characters are correct; accessing the at least one additional document, wherein the at least one additional document is an image document; analyzing content of the additional document; and replacing optically recognized characters according to data from the additional document when indicated by the analysis and the likelihood. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented method of data processing, the method comprising:
-
receiving a plurality of different types of input data; selecting one or more of the plurality of different types of input data according to an application; associating the one or more of the plurality of different types of input data with a document; fusing the one or more of the plurality of different types of input data into a fused output with a fusion component, wherein the fusing of the plurality of different types of input data comprises; utilizing at least one additional document to verify accuracy of optical character recognition (OCR) of a text document to be fused, and wherein the verification comprises; initiating OCR of the text document to generate the optically recognized characters; determining a likelihood that the optically recognized characters are correct; accessing the at least one other document, wherein the at least one other document is an image document; analyzing content of the other document; and replacing optically recognized characters according to data from the other document when indicated by the analysis and the likelihood; and outputting a rich document having associated therewith the fused output of the one or more of the plurality of different types of input data. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A system that facilitates data processing, comprising:
-
a processor; a memory, on which are stored processor-executable instructions, which when executed by the processor result facilitates data processing; means for receiving a plurality of different types of input data for document processing; means for verifying accuracy of a text document, from among the plurality of different types of input data received by the means for receiving, wherein the text document is processed by an optical character recognition (OCR) algorithm to produce optically recognized characters, wherein at least one additional document having content that is related to the text document is used to verify accuracy of the OCR algorithm, and wherein the verification comprises; initiating OCR of the text document to generate the optically recognized characters; determining a likelihood that the optically recognized characters are correct; accessing the at least one other document, wherein the at least one other document is an image document; analyzing content of the other document; and replacing optically recognized characters according to data from the other document when indicated by the analysis and the likelihood; means for associating the plurality of different types of input data with a document; means for placing the plurality of different types of input data in the document according to a predetermined order; means for changing one input data based on analysis of other input data; and means for outputting a rich document having associated therewith the changed input data and the other input data.
-
Specification