INTERMEDIATE SCORING AND REJECTION LOOPBACK FOR IMPROVED KEY PHRASE DETECTION
First Claim
Patent Images
1. A computer-implemented method for key phrase detection comprising:
- updating, at a current time instance, a start state based rejection model having a single state and a key phrase model having a plurality of states and associated with a predetermined key phrase based on scores of sub-phonetic units representative of received audio input, wherein said updating comprises;
providing a transition of a score from a particular state of the plurality of states of the key phrase model to a next state of the plurality of states of the key phrase model and to the single state of the rejection model; and
generating a rejection likelihood score corresponding to the single state of the start state based rejection model and a key phrase likelihood score corresponding to the key phrase model; and
determining whether the received audio input is associated with the predetermined key phrase based on the rejection likelihood score and the key phrase likelihood score.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques related to key phrase detection for applications such as wake on voice are discussed. Such techniques may include intermediate scoring of a state or states of a key phrase model and/or a backward transition or rejection loopback from a state of the key phrase model to a rejection model to reduce false accepts based on received utterances.
53 Citations
25 Claims
-
1. A computer-implemented method for key phrase detection comprising:
-
updating, at a current time instance, a start state based rejection model having a single state and a key phrase model having a plurality of states and associated with a predetermined key phrase based on scores of sub-phonetic units representative of received audio input, wherein said updating comprises; providing a transition of a score from a particular state of the plurality of states of the key phrase model to a next state of the plurality of states of the key phrase model and to the single state of the rejection model; and generating a rejection likelihood score corresponding to the single state of the start state based rejection model and a key phrase likelihood score corresponding to the key phrase model; and determining whether the received audio input is associated with the predetermined key phrase based on the rejection likelihood score and the key phrase likelihood score. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer-implemented method for key phrase detection comprising:
-
updating a start state based rejection model and a key phrase model associated with a predetermined key phrase based on scores of sub-phonetic units representative of received audio input; determining a rejection likelihood score based on the updated start state based rejection model; determining an overall key phrase likelihood score comprising a minimum of a first likelihood score associated with a first state of the key phrase model and a second likelihood score associated with a second state of the key phrase model; and determining whether the received audio input is associated with the predetermined key phrase based on the rejection likelihood score and the overall key phrase likelihood score. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
-
17. A system for performing key phrase detection comprising:
-
a memory configured to store an acoustic model, a start state based rejection model, and a key phrase model associated with a predetermined key phrase; and a digital signal processor coupled to the memory, the digital signal processor to update, at a current time instance, the start state based rejection model having a single state and the key phrase model having a plurality of states based on scores of sub-phonetic units representative of received audio input, wherein to update the start state based rejection model and the key phrase model, the digital signal processor is to provide a transition of a score from a particular state of the plurality of states of the key phrase model to a next state of the plurality of states of the key phrase model and to the single state of the rejection model and to generate a rejection likelihood score corresponding to the single state of the start state based rejection model and a key phrase likelihood score corresponding to the key phrase model; and to determine whether the received audio input is associated with the predetermined key phrase based on the rejection likelihood score and the key phrase likelihood score. - View Dependent Claims (18, 19, 20, 21)
-
-
22. A system for performing key phrase detection comprising:
-
a memory configured to store an acoustic model, a start state based rejection model, and a key phrase model associated with a predetermined key phrase; and a digital signal processor coupled to the memory, the digital signal processor to update a start state based rejection model and a key phrase model associated with a predetermined key phrase based on scores of sub-phonetic units representative of received audio input, to determine a rejection likelihood score based on the updated start state based rejection model, to determine an overall key phrase likelihood score comprising a minimum of a first likelihood score associated with a first state of the key phrase model and a second likelihood score associated with a second state of the key phrase model, and to determine whether the received audio input is associated with the predetermined key phrase based on the rejection likelihood score and the overall key phrase likelihood score. - View Dependent Claims (23, 24, 25)
-
Specification