Phonetic fragment search in speech data
First Claim
Patent Images
1. A method of generating a lattice from audio data, comprising:
- recognizing phonetic fragments within the audio data wherein at least some of the phonetic fragments include at least two phones;
accessing a mutual information score for recognized phonetic fragments within the audio data that include at least two phones, wherein the mutual information score for each of the phonetic fragments having at least two phones is a function of a likelihood that phones in the phonetic fragment occur consecutively and a likelihood that each phone in the phonetic fragment occurs independent of other phones in the phonetic fragment; and
determining a score for paths joining adjacent phonetic fragments in the audio data using in part the mutual information score for the phonetic fragments having at least two phones.
2 Assignments
0 Petitions
Accused Products
Abstract
A method of searching audio data is provided including receiving a query defining multiple phonetic possibilities. The method also includes comparing the query with a lattice of phonetic hypotheses associated with the audio data to identify if at least one of the multiple phonetic possibilities is approximated by at least one phonetic hypothesis in the lattice of phonetic hypotheses.
31 Citations
4 Claims
-
1. A method of generating a lattice from audio data, comprising:
-
recognizing phonetic fragments within the audio data wherein at least some of the phonetic fragments include at least two phones; accessing a mutual information score for recognized phonetic fragments within the audio data that include at least two phones, wherein the mutual information score for each of the phonetic fragments having at least two phones is a function of a likelihood that phones in the phonetic fragment occur consecutively and a likelihood that each phone in the phonetic fragment occurs independent of other phones in the phonetic fragment; and determining a score for paths joining adjacent phonetic fragments in the audio data using in part the mutual information score for the phonetic fragments having at least two phones. - View Dependent Claims (2, 3, 4)
-
Specification