Low resource key phrase detection for wake on voice
First Claim
Patent Images
1. A computer-implemented method for key phrase detection comprising:
- receiving a time series of scores of sub-phonetic units based on received audio input;
updating a start state based rejection model and a key phrase model associated with a predetermined key phrase based on at least some of the time series of scores of sub-phonetic units to generate a key phrase score, wherein the start state based rejection model has a single rejection state comprising one or more rejection model self loops each associated with a particular score of the scores of sub-phonetic units and the key phrase model comprises a plurality of key phrase states interconnected by transitions therebetween with each of the key phrase states comprising a self loop associated with a particular score of the scores of sub-phonetic units; and
determining whether the received audio input is associated with the predetermined key phrase based on the key phrase score.
2 Assignments
0 Petitions
Accused Products
Abstract
Techniques related to key phrase detection for applications such as wake on voice are discussed. Such techniques may include updating a start state based rejection model and a key phrase model based on scores of sub-phonetic units from an acoustic model to generate a rejection likelihood score and a key phrase likelihood score and determining whether received audio input is associated with a predetermined key phrase based on the rejection likelihood score and the key phrase likelihood score.
62 Citations
20 Claims
-
1. A computer-implemented method for key phrase detection comprising:
-
receiving a time series of scores of sub-phonetic units based on received audio input; updating a start state based rejection model and a key phrase model associated with a predetermined key phrase based on at least some of the time series of scores of sub-phonetic units to generate a key phrase score, wherein the start state based rejection model has a single rejection state comprising one or more rejection model self loops each associated with a particular score of the scores of sub-phonetic units and the key phrase model comprises a plurality of key phrase states interconnected by transitions therebetween with each of the key phrase states comprising a self loop associated with a particular score of the scores of sub-phonetic units; and determining whether the received audio input is associated with the predetermined key phrase based on the key phrase score. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. At least one non-transitory machine readable medium comprising a plurality of instructions that, in response to being executed on a device, cause the device to perform key phrase detection by:
-
receiving a time series of scores of sub-phonetic units based on received audio input; updating a start state based rejection model and a key phrase model associated with a predetermined key phrase based on at least some of the time series of scores of sub-phonetic units to generate a key phrase score, wherein the start state based rejection model has a single rejection state comprising one or more rejection model self loops each associated with a particular score of the scores of sub-phonetic units and the key phrase model comprises a plurality of key phrase states interconnected by transitions therebetween with each of the key phrase states comprising a self loop associated with a particular score of the scores of sub-phonetic units; and determining whether the received audio input is associated with the predetermined key phrase based on the key phrase score. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A system for performing key phrase detection comprising:
-
a memory configured to store a start state based rejection model and a key phrase model associated with a predetermined key phrase; and a processor coupled to the memory, the processor to receive a time series of scores of sub-phonetic units based on received audio input, to update the start state based rejection model and the key phrase model based on at least some of the time series of scores of sub-phonetic units to generate a key phrase score, wherein the start state based rejection model has a single rejection state comprising one or more rejection model self loops each associated with a particular score of the scores of sub-phonetic units and the key phrase model comprises a plurality of key phrase states interconnected by transitions therebetween with each of the key phrase states comprising a self loop associated with a particular score of the scores of sub-phonetic units, and to determine whether the received audio input is associated with the predetermined key phrase based on the key phrase score. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20)
-
Specification