Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch
First Claim
Patent Images
1. A method for voice recognition, the method comprising:
- obtaining a voice signal for an utterance of a speaker;
determining a runtime pitch from the voice signal for the utterance;
categorizing the speaker as male, female or child based on the runtime pitch;
using the categorization as a basis for dynamically adjusting a maximum frequency fmax and a minimum frequency fmin of a filter bank used for processing the input utterance to produce an output, and using corresponding gender or age specific acoustic models to perform voice recognition based on the filter bank output.
4 Assignments
0 Petitions
Accused Products
Abstract
Voice recognition methods and systems are disclosed. A voice signal is obtained for an utterance of a speaker. A runtime pitch is determined from the voice signal for the utterance. The speaker is categorized based on the runtime pitch and one or more acoustic model parameters are adjusted based on a categorization of the speaker. The parameter adjustment may be performed at any instance of time during the recognition. A voice recognition analysis of the utterance is then performed based on the acoustic model.
133 Citations
13 Claims
-
1. A method for voice recognition, the method comprising:
-
obtaining a voice signal for an utterance of a speaker; determining a runtime pitch from the voice signal for the utterance; categorizing the speaker as male, female or child based on the runtime pitch; using the categorization as a basis for dynamically adjusting a maximum frequency fmax and a minimum frequency fmin of a filter bank used for processing the input utterance to produce an output, and using corresponding gender or age specific acoustic models to perform voice recognition based on the filter bank output. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 11, 12, 13)
-
-
10. A voice recognition system, comprising:
-
an interface adapted to obtain a voice signal; one or more processors coupled to the interface; and a memory coupled to the interface and the processor, the memory having embodied therein a set of processor readable instructions for configured to implement a method for voice recognition, the processor readable instructions including; an instruction for obtaining a voice signal for an utterance of a speaker; an instruction for determining a runtime pitch from the voice signal for the utterance; an instruction for categorizing the speaker as male, female or child based on the runtime pitch; an instruction for using the categorization as a basis for dynamically adjusting a maximum frequency fmax and a minimum frequency fmin of a filter bank used for processing the input utterance to produce an output, and using corresponding gender or age specific acoustic models to perform voice recognition based on the filter bank output.
-
Specification