Spoken document retrieval using multiple speech transcription indices
First Claim
1. A method of spoken document retrieval by a system using multiple search transcription indices, the method comprising:
- receiving an input phrase as a search query formed of query terms;
identifying a first type of query term in the input phrase, wherein the first type includes a query term in a speech recognition vocabulary of the system;
identifying one or more first search transcription indices for searching the first type of query term;
identifying a second type of query term in the input phrase, wherein the second type includes a query term not in the speech recognition vocabulary;
identifying a plurality of second search transcription indices for searching the second type of query term;
receiving at least a first list of results from the one or more first search transcription indices;
receiving plural lists of results from the plurality of second search transcription indices;
ranking results from a phonetic index of the plurality of second search transcription indices based on temporal proximity of phones, wherein the ranking computes a score in accordance with the following expression;
3 Assignments
0 Petitions
Accused Products
Abstract
A method and system are provided of spoken document retrieval using multiple search transcription indices. The method includes receiving a query input formed of one or more query terms and determining a type of a query term, wherein a type includes a term in a speech recognition vocabulary or a term not in a speech recognition vocabulary. One or more indices of search transcriptions are selected for searching the query term based on the type of the query term. The one or more indices are generated using different speech transcription methods. The results for the query term are scored by the one or more indices and the results of the one or more indices for the query term are merged. The results of the one or more query terms are then merged to provide the results for the query.
55 Citations
16 Claims
-
1. A method of spoken document retrieval by a system using multiple search transcription indices, the method comprising:
-
receiving an input phrase as a search query formed of query terms; identifying a first type of query term in the input phrase, wherein the first type includes a query term in a speech recognition vocabulary of the system; identifying one or more first search transcription indices for searching the first type of query term; identifying a second type of query term in the input phrase, wherein the second type includes a query term not in the speech recognition vocabulary; identifying a plurality of second search transcription indices for searching the second type of query term; receiving at least a first list of results from the one or more first search transcription indices; receiving plural lists of results from the plurality of second search transcription indices; ranking results from a phonetic index of the plurality of second search transcription indices based on temporal proximity of phones, wherein the ranking computes a score in accordance with the following expression; - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A non-transitory computer-readable storage medium comprising computer-executable instructions that, when executed by a computer, cause the computer to execute acts of:
-
receiving an input phrase as a search query formed of query terms; identifying a first type of query term in the input phrase, wherein the first type includes a query term in a speech recognition vocabulary of the system; identifying one or more first search transcription indices for searching the first type of query term; identifying a second type of query term in the input phrase, wherein the second type includes a query term not in the speech recognition vocabulary; identifying a plurality of second search transcription indices for searching the second type of query term; receiving at least a first list of results from the one or more first search transcription indices; receiving plural lists of results from the plurality of second search transcription indices; ranking results from a phonetic index of the plurality of second search transcription indices based on temporal proximity of phones, wherein the ranking computes a score in accordance with the following expression;
-
-
13. A method of providing a service to a customer over a network for spoken document retrieval, the service comprising:
-
receiving an input phrase as a search query formed of query terms; identifying a first type of query term in the input phrase, wherein the first type includes a query term in a speech recognition vocabulary of the system; identifying one or more first search transcription indices for searching the first type of query term; identifying a second type of query term in the input phrase, wherein the second type includes a query term not in the speech recognition vocabulary; identifying a plurality of second search transcription indices for searching the second type of query term; receiving at least a first list of results from the one or more first search transcription indices; receiving plural lists of results from the plurality of second search transcription indices; ranking results from a phonetic index of the plurality of second search transcription indices based on temporal proximity of phones, wherein the ranking computes a score in accordance with the following expression;
-
-
14. A search system for spoken document retrieval using multiple search transcription indices, the system comprising:
-
a processor; a memory containing instructions that, when executed by the processor, adapt the search system to; receive a search query formed of one or more query terms, determine a first type and a second type of query term by reference to a speech recognition vocabulary, wherein the first type includes a term in a speech recognition vocabulary and the second type includes a term not in the speech recognition vocabulary, identify multiple search transcription indices for searching a first query term based on the type of the first query term, wherein the multiple indices include a first phonetic index and a second index that is different from the first index, rank results from the second index based on temporal proximity of phones, wherein the ranking computes a score in accordance with the following expression; - View Dependent Claims (15, 16)
-
Specification