Method and system for retrieving documents with spoken queries
First Claim
1. A computer implemented method for indexing and retrieving documents stored in a database, comprising the steps of:
- extracting a document feature vector from each of a plurality of documents;
indexing each of the plurality of documents according the associated document feature vector;
converting a spoken query to an intermediate representation representing possible sequential combinations of terms in the spoken query;
generating a query certainty vector from the intermediate representation;
acquiring other information;
combining the other information with the query certainty vector; and
comparing the query vector and the other information to each of the document feature vectors to retrieve a ranked result set of documents.
1 Assignment
0 Petitions
Accused Products
Abstract
A method indexes and retrieves documents stored in a database. A document feature vector is extracted from each document and the documents are then indexed according to the feature vectors. A spoken query is converted to an intermediate representation representing likelihoods of possible sequential combinations of terms in the spoken query. A query certainty vector is generated from the intermediate representation. Other information is acquired. The other information is combined with the query certainty vector. The query vector and the other information are then compared to each of the document feature vectors to retrieve a ranked result set of documents.
67 Citations
13 Claims
-
1. A computer implemented method for indexing and retrieving documents stored in a database, comprising the steps of:
-
extracting a document feature vector from each of a plurality of documents;
indexing each of the plurality of documents according the associated document feature vector;
converting a spoken query to an intermediate representation representing possible sequential combinations of terms in the spoken query;
generating a query certainty vector from the intermediate representation;
acquiring other information;
combining the other information with the query certainty vector; and
comparing the query vector and the other information to each of the document feature vectors to retrieve a ranked result set of documents. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
2. The method of claim 2, further comprising:
projecting the document feature vector and the query certainty vector to a low dimension.
-
13. A system for indexing and retrieving documents, comprising:
-
a plurality of documents, each document having an associated document feature vector;
a database indexing each of the plurality of documents according the associated document feature vector;
a speech recognition engine converting a spoken query to an intermediate representation representing possible sequential combinations of terms in the spoken query;
means for generating a query certainty vector from the intermediate representation;
means for acquiring other information;
means for combining the other information with the query certainty vector; and
a comparator configured to compare the query vector and the other information to each of the document feature vectors to retrieve a ranked result set of documents.
-
Specification