ADAPTIVE AUDIO FRAME PROCESSING FOR KEYWORD DETECTION
First Claim
1. A method of detecting a target keyword from an input sound for activating a function in a mobile device, the method comprising:
- receiving a first plurality of sound features in a buffer;
receiving a second plurality of sound features in the buffer;
while receiving at least one sound feature of the second plurality of sound features in the buffer, processing a first number of sound features from the buffer, the first number of sound features including two or more sound features;
determining a keyword score for at least one sound feature of the processed sound features; and
detecting the input sound as the target keyword if the keyword score is greater than a threshold score.
1 Assignment
0 Petitions
Accused Products
Abstract
A method of detecting a target keyword from an input sound for activating a function in a mobile device is disclosed. In this method, a first plurality of sound features is received in a buffer, and a second plurality of sound features is received in the buffer. While receiving each of the second plurality of sound features in the buffer, a first number of the sound features are processed from the buffer. The first number of the sound features includes two or more sound features. Further, the method may include determining a keyword score for each of the processed sound features and detecting the input sound as the target keyword if at least one of the keyword scores is greater than a threshold score.
40 Citations
30 Claims
-
1. A method of detecting a target keyword from an input sound for activating a function in a mobile device, the method comprising:
-
receiving a first plurality of sound features in a buffer; receiving a second plurality of sound features in the buffer; while receiving at least one sound feature of the second plurality of sound features in the buffer, processing a first number of sound features from the buffer, the first number of sound features including two or more sound features; determining a keyword score for at least one sound feature of the processed sound features; and detecting the input sound as the target keyword if the keyword score is greater than a threshold score. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A mobile device, comprising:
-
a buffer configured to store a first plurality of sound features and a second plurality of sound features; a feature processing unit configured to process a first number of sound features from the buffer while the buffer receives each of the second plurality of sound features, the first number of the sound features including two or more sound features; a keyword score calculation unit configured to determine a keyword score for each of the processed sound features; and a keyword detection unit configured to detect an input sound as a target keyword if at least one of the keyword scores is greater than a threshold score. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A mobile device, comprising:
-
means for storing sound features, wherein the sound features include a first plurality of sound features and a second plurality of sound features; means for processing a first number of sound features from the means for storing the sound features while the means for storing the sound features receives each of the second plurality of sound features, the first number of the sound features including two or more sound features; means for determining a keyword score for each of the processed sound features; and means for detecting an input sound as a target keyword if at least one of the keyword scores is greater than a threshold score. - View Dependent Claims (26, 27)
-
-
28. A non-transitory computer-readable storage medium storing instructions for detecting a target keyword from an input sound for activating a function in a mobile device, the instructions causing a processor to perform operations, the operations comprising:
-
receiving a first plurality of sound features in a buffer; receiving a second plurality of sound features in the buffer; while receiving each of the second plurality of sound features in the buffer, processing a first number of the sound features from the buffer, the first number of the sound features including two or more sound features; determining a keyword score for each of the processed sound features; and detecting the input sound as the target keyword if at least one of the keyword scores is greater than a threshold score. - View Dependent Claims (29, 30)
-
Specification