Speech index pruning
First Claim
Patent Images
1. A method of indexing a speech segment, the method comprising:
- identifying at least two alternative speech unit sequences based on the speech segment;
for each speech unit in the at least two alternative speech unit sequences, placing information in an entry in the index that indicates a position of the speech unit in at least one of the two alternative speech unit sequences and a probability of the speech unit appearing at the position; and
eliminating speech units from entries in the index based on a comparison of the probabilities to a threshold value.
2 Assignments
0 Petitions
Accused Products
Abstract
A speech segment is indexed by identifying at least two alternative word sequences for the speech segment. For each word in the alternative sequences, information is placed in an entry for the word in the index. Speech units are eliminated from entries in the index based on a comparison of a probability that the word appears in the speech segment and a threshold value.
227 Citations
20 Claims
-
1. A method of indexing a speech segment, the method comprising:
-
identifying at least two alternative speech unit sequences based on the speech segment;
for each speech unit in the at least two alternative speech unit sequences, placing information in an entry in the index that indicates a position of the speech unit in at least one of the two alternative speech unit sequences and a probability of the speech unit appearing at the position; and
eliminating speech units from entries in the index based on a comparison of the probabilities to a threshold value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer readable medium having computer executable instructions for performing steps comprising:
-
receiving a search query;
searching a speech index for an entry associated with a word in the search query;
retrieving from the entry a plurality of positions for the word in a plurality of speech signals, and a probability of the word appearing at each of the positions given the corresponding speech signal;
eliminating the positions based on a comparison of their probabilities to a threshold value;
using the probabilities to rank the speech signals corresponding to the remaining positions relative to each other to form ranked speech signals; and
returning search results based on the ranked speech signals. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A method of searching for speech segments comprising:
-
accessing an index containing probabilities of positions for words generated from a plurality of speech segments;
retrieving a threshold value;
retrieving a set of probabilities for a word from the index; and
returning identifiers for candidate speech segments that contain the word based on the probabilities in the set of probabilities that meet a predefined relationship to the threshold value. - View Dependent Claims (18, 19, 20)
-
Specification