System and method of lattice-based search for spoken utterance retrieval
First Claim
Patent Images
1. A method comprising:
- receiving data corresponding to a text query from a user;
retrieving a spoken document associated with the text query;
searching a word index of the spoken document associated with the text query using the text query, to yield first search results;
searching a sub-word index of the spoken document associated with the text query using the text query, to yield second search results; and
returning, via a computer network and according to the first search results and the second search results, audio segments from the spoken document associated with the text query which correspond to the text query.
5 Assignments
0 Petitions
Accused Products
Abstract
A system and method are disclosed for retrieving audio segments from a spoken document. The spoken document preferably is one having moderate word error rates such as telephone calls or teleconferences. The method comprises converting speech associated with a spoken document into a lattice representation and indexing the lattice representation of speech. These steps are performed typically off-line. Upon receiving a query from a user, the method further comprises searching the indexed lattice representation of speech and returning retrieved audio segments from the spoken document that match the user query.
-
Citations
20 Claims
-
1. A method comprising:
-
receiving data corresponding to a text query from a user; retrieving a spoken document associated with the text query; searching a word index of the spoken document associated with the text query using the text query, to yield first search results; searching a sub-word index of the spoken document associated with the text query using the text query, to yield second search results; and returning, via a computer network and according to the first search results and the second search results, audio segments from the spoken document associated with the text query which correspond to the text query. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising; receiving data corresponding to a text query from a user; retrieving a spoken document associated with the text query; searching a word index of the spoken document associated with the text query using the text query, to yield first search results; searching a sub-word index of the spoken document associated with the text query using the text query, to yield second search results; and returning, via a computer network and according to the first search results and the second search results, audio segments from the spoken document associated with the text query which correspond to the text query. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-readable storage device having instructions stored which, when executed by a processor, cause the processor to perform operations comprising:
-
receiving data corresponding to a text query from a user; retrieving, based on the text query, a spoken document; searching a word index of the spoken document associated with the text query using the text query, to yield first search results; searching a sub-word index of the spoken document associated with the text query using the text query, to yield second search results; and returning, via a computer network and according to the first search results and the second search results, audio segments from the spoken document associated with the text query which correspond to the text query. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification