STRUCTURED MODELS OF REPITITION FOR SPEECH RECOGNITION
First Claim
1. In a computing environment, a method, comprising, receiving two or more adjacent utterances, in which a later utterance is related to an earlier utterance by repetition, and using a structured model of repetition to determine an intention associated with at least one of the utterances.
2 Assignments
0 Petitions
Accused Products
Abstract
Described is a technology by which a structured model of repetition is used to determine the words spoken by a user, and/or a corresponding database entry, based in part on a prior utterance. For a repeated utterance, a joint probability analysis is performed on (at least some of) the corresponding word sequences as recognized by one or more recognizers) and associated acoustic data. For example, a generative probabilistic model, or a maximum entropy model may be used in the analysis. The second utterance may be a repetition of the first utterance using the exact words, or another structural transformation thereof relative to the first utterance, such as an extension that adds one or more words, a truncation that removes one or more words, or a whole or partial spelling of one or more words.
53 Citations
20 Claims
- 1. In a computing environment, a method, comprising, receiving two or more adjacent utterances, in which a later utterance is related to an earlier utterance by repetition, and using a structured model of repetition to determine an intention associated with at least one of the utterances.
- 15. In a computing environment, a system comprising, a repeat analysis mechanism that processes speech recognition results differently based on whether input speech is an initial input, or is repeated input speech related to prior input speech that includes the initial input, and, when the input speech is repeated input speech, the repeat analysis mechanism configured to combine recognition data corresponding to the repeated input speech with recognition data corresponding to the prior input speech to provide a recognition result for that repeated input speech.
- 18. One or more computer-readable media having computer-executable instructions, which when executed perform steps, comprising, receiving an utterance, determining if the utterance is a repeated utterance relative to a prior utterance, and if so, using word sequence data corresponding to recognition of the prior utterance in combination with word sequence data corresponding to recognition of the repeated utterance to select a recognition result for the repeated utterance.
Specification