VOICE RECOGNITION WITH DYNAMIC FILTER BANK ADJUSTMENT BASED ON SPEAKER CATEGORIZATION
First Claim
Patent Images
1. A method for voice recognition, the method comprising:
- obtaining a voice signal for an utterance of a speaker;
categorizing the speaker as male, female or child;
using the categorization as a basis for dynamically adjusting a maximum frequency fmax and a minimum frequency fmin of a filter bank used for processing the input utterance to produce an output, and using corresponding gender or age specific acoustic models to perform voice recognition based on the filter bank output.
1 Assignment
0 Petitions
Accused Products
Abstract
Voice recognition methods and systems are disclosed. A voice signal is obtained for an utterance of a speaker. The speaker is categorized as a male, female, or child and the categorization is used as a basis for dynamically adjusting a maximum frequency fmax and a minimum frequency fmin of a filter bank used for processing the input utterance to produce an output. Corresponding gender or age specific acoustic models are used to perform voice recognition based on the filter bank output.
35 Citations
14 Claims
-
1. A method for voice recognition, the method comprising:
-
obtaining a voice signal for an utterance of a speaker; categorizing the speaker as male, female or child; using the categorization as a basis for dynamically adjusting a maximum frequency fmax and a minimum frequency fmin of a filter bank used for processing the input utterance to produce an output, and using corresponding gender or age specific acoustic models to perform voice recognition based on the filter bank output. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A voice recognition system, comprising:
-
an interface adapted to obtain a voice signal; one or more processors coupled to the interface; and a memory coupled to the interface and the processor, the memory having embodied therein a set of processor readable instructions for configured to implement a method for voice recognition, the processor readable instructions including; an instruction for obtaining a voice signal for an utterance of a speaker; an instruction for categorizing the speaker as male, female, or child; an instruction for using the categorization as a basis for dynamically adjusting a maximum frequency fmax and a minimum frequency fmin of a filter bank used for processing the input utterance to produce an output, and using corresponding gender or age specific acoustic models to perform voice recognition based on the filter bank output.
-
Specification