Natural language grammar enablement by speech characterization
First Claim
Patent Images
1. A non-transitory computer-readable medium comprising code effective to cause one or more processors to:
- characterize a speech utterance to determine at least one characteristic;
recognize the speech utterance, without regard to the at least one characteristic, to produce at least one transcription hypothesis;
parse the at least one transcription hypothesis according to a set of grammar rules to produce a plurality of interpretation hypotheses, each having a corresponding likelihood score;
provide a plurality of grammar rule weights for the plurality of grammar rules corresponding to a plurality of speech characteristics;
select one or more grammar rule weights corresponding to the at least one characteristic from the plurality of grammar rule weights;
for each interpretation hypothesis of the plurality of interpretation hypotheses, adjust the likelihood score of the each interpretation hypothesis according to the selected one or more grammar rule weights corresponding to the at least one characteristic;
determine a set of authorized domains based on the at least one characteristic; and
filter the plurality of interpretation hypotheses according to the set of authorized domains; and
select a selected interpretation hypothesis from the plurality of interpretation hypotheses according to the likelihood scores thereof.
9 Assignments
0 Petitions
Accused Products
Abstract
Either or both of voice speaker identification or utterance classification such as by age, gender, accent, mood, and prosody characterize speech utterances in a system that performs automatic speech recognition (ASR) and natural language processing (NLP). The characterization conditions NLP, either through application to interpretation hypotheses or to specific grammar rules. The characterization also conditions language models of ASR. Conditioning may comprise enablement and may comprise reweighting of hypotheses.
39 Citations
4 Claims
-
1. A non-transitory computer-readable medium comprising code effective to cause one or more processors to:
-
characterize a speech utterance to determine at least one characteristic; recognize the speech utterance, without regard to the at least one characteristic, to produce at least one transcription hypothesis; parse the at least one transcription hypothesis according to a set of grammar rules to produce a plurality of interpretation hypotheses, each having a corresponding likelihood score; provide a plurality of grammar rule weights for the plurality of grammar rules corresponding to a plurality of speech characteristics; select one or more grammar rule weights corresponding to the at least one characteristic from the plurality of grammar rule weights; for each interpretation hypothesis of the plurality of interpretation hypotheses, adjust the likelihood score of the each interpretation hypothesis according to the selected one or more grammar rule weights corresponding to the at least one characteristic; determine a set of authorized domains based on the at least one characteristic; and filter the plurality of interpretation hypotheses according to the set of authorized domains; and select a selected interpretation hypothesis from the plurality of interpretation hypotheses according to the likelihood scores thereof. - View Dependent Claims (2, 3, 4)
-
Specification