Retrieving handwritten documents using multiple document recognizers and techniques allowing both typed and handwritten queries
First Claim
1. A method comprising the steps of:
- creating a document stack from at least one word in a handwritten document;
creating a query stack from a query; and
determining a measure between the document stack and the query stack.
1 Assignment
0 Petitions
Accused Products
Abstract
The techniques in the present invention allow both text and handwritten queries, and the queries can be single-word or multiword. Generally, each handwritten word in a handwritten document is converted to a document stack of words, where each document stack contains a list of text words and a word score of some type for each text word in the list. The query is also converted to one or more stacks of words. A measure is determined from each query and document stack. Documents that meet search criteria in the query are then selected based on the query and the values of the measures. The present invention also performs multiple recognitions, with multiple recognizers, on a handwritten document to create multiple recognized transcriptions of the document. The multiple transcriptions are used for document retrieval. In another embodiment, a single transcription is created from the multiple transcriptions, and the single transcription is used for document retrieval.
81 Citations
37 Claims
-
1. A method comprising the steps of:
-
creating a document stack from at least one word in a handwritten document;
creating a query stack from a query; and
determining a measure between the document stack and the query stack. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 25, 27, 28, 29, 30, 31, 32, 33)
-
-
13. A method comprising the steps of:
-
for each of a plurality of documents, performing the following steps;
creating a document stack from at least one word in a text document;
creating a query stack from a query;
determining a measure between the document stack and the query stack; and
scoring the documents based on the measure, thereby creating a document score; and
displaying each document whose document score meets a predetermined threshold.
-
-
16. A method for retrieving a subset of handwritten documents from a set of handwritten documents, each of the handwritten documents having a plurality of document stacks associated therewith, the method comprising the steps of:
-
a) creating at least one query stack from a query comprising one or more words, wherein each word is handwritten or typed;
b) selecting a handwritten document from the set of handwritten documents;
c) selecting a document stack from the selected handwritten document;
d) determining a measure between the at least one query stack and the selected document stack;
e) performing steps (c) and (d) for at least one document stack associated with the selected handwritten document;
f) performing steps (b), (c), and (d) for each handwritten document of the set of handwritten documents;
g) scoring each of the handwritten documents in the set of handwritten documents by using the query and the measures, thereby creating a number of document scores; and
h) selecting the subset of handwritten documents for display by using the document scores.
-
-
26. A method comprising the steps of:
-
creating a first word stack, by using a first handwriting recognizer, from at least one word;
creating a second word stack, by using a second handwriting recognizer, from the at least one word; and
comparing the first and second word stacks with a third word stack to determine whether a handwritten document should be retrieved.
-
-
34. A computer system comprising:
-
a memory that stores computer-readable code; and
a processor operatively coupled to the memory, the processor configured to implement the computer-readable code, the computer-readable code configured to;
create a document stack from at least one word in a handwritten document;
create a query stack from a query; and
determine a measure between the document stack and the query stack.
-
-
35. A computer system comprising:
-
a memory that stores computer-readable code; and
a processor operatively coupled to the memory, the processor configured to implement the computer-readable code, the computer-readable code configured to;
create a first word stack, by using a first handwriting recognizer, from at least one word;
create a second word stack, by using a second handwriting recognizer, from the at least one word; and
compare the first and second word stacks with a third word stack to determine whether a handwritten document should be retrieved.
-
-
36. An article of manufacture comprising:
-
a computer readable medium having computer-readable code means embodied thereon, the computer-readable program code means comprising;
a step to create a document stack from at least one word in a handwritten document;
a step to create a query stack from a query; and
a step to determine a measure between the document stack and the query stack.
-
-
37. An article of manufacture comprising:
-
a computer readable medium having computer-readable code means embodied thereon, the computer-readable program code means comprising;
a step to create a first word stack, by using a first handwriting recognizer, from at least one word;
a step to create a second word stack, by using a second handwriting recognizer, from the at least one word; and
a step to compare the first and second word stacks with a third word stack to determine whether a handwritten document should be retrieved
-
Specification