System and method of lattice-based search for spoken utterance retrieval
First Claim
Patent Images
1. A method comprising:
- converting, via a processor, speech from a spoken document into a lattice representation comprising phones from the speech, wherein the lattice representation belongs to a recognition network represented as weighted finite state machines, and wherein the phones have a pronunciation length longer than a minimum pronunciation length; and
indexing the lattice representation for word and topic searching.
5 Assignments
0 Petitions
Accused Products
Abstract
A system and method are disclosed for retrieving audio segments from a spoken document. The spoken document preferably is one having moderate word error rates such as telephone calls or teleconferences. The method comprises converting speech associated with a spoken document into a lattice representation and indexing the lattice representation of speech. These steps are performed typically off-line. Upon receiving a query from a user, the method further comprises searching the indexed lattice representation of speech and returning retrieved audio segments from the spoken document that match the user query.
-
Citations
20 Claims
-
1. A method comprising:
-
converting, via a processor, speech from a spoken document into a lattice representation comprising phones from the speech, wherein the lattice representation belongs to a recognition network represented as weighted finite state machines, and wherein the phones have a pronunciation length longer than a minimum pronunciation length; and indexing the lattice representation for word and topic searching. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system comprising:
-
a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising; converting speech from a spoken document into a lattice representation comprising phones from the speech, wherein the lattice representation belongs to a recognition network represented as weighted finite state machines, and wherein the phones have a pronunciation length longer than a minimum pronunciation length; and indexing the lattice representation for word and topic searching. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer-readable storage device having instructions stored which, when executed by a computing device, cause the computing device to perform operations comprising:
-
converting speech from a spoken document into a lattice representation comprising phones from the speech, wherein the lattice representation belongs to a recognition network represented as weighted finite state machines, and wherein the phones have a pronunciation length longer than a minimum pronunciation length; and indexing the lattice representation for word and topic searching. - View Dependent Claims (20)
-
Specification