VOICE RECOGNITION WITH DYNAMIC FILTER BANK ADJUSTMENT BASED ON SPEAKER CATEGORIZATION

US 20100324898A1
Filed: 07/21/2010
Published: 12/23/2010
Est. Priority Date: 02/21/2006
Status: Active Grant

First Claim

Patent Images

1. A method for voice recognition, the method comprising:

obtaining a voice signal for an utterance of a speaker;

categorizing the speaker as male, female or child;

using the categorization as a basis for dynamically adjusting a maximum frequency f_maxand a minimum frequency f_minof a filter bank used for processing the input utterance to produce an output, and using corresponding gender or age specific acoustic models to perform voice recognition based on the filter bank output.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Voice recognition methods and systems are disclosed. A voice signal is obtained for an utterance of a speaker. The speaker is categorized as a male, female, or child and the categorization is used as a basis for dynamically adjusting a maximum frequency f_maxand a minimum frequency f_minof a filter bank used for processing the input utterance to produce an output. Corresponding gender or age specific acoustic models are used to perform voice recognition based on the filter bank output.

35 Citations

View as Search Results

14 Claims

1. A method for voice recognition, the method comprising:
- obtaining a voice signal for an utterance of a speaker;
  
  categorizing the speaker as male, female or child;
  
  using the categorization as a basis for dynamically adjusting a maximum frequency f_maxand a minimum frequency f_minof a filter bank used for processing the input utterance to produce an output, and using corresponding gender or age specific acoustic models to perform voice recognition based on the filter bank output.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
- - 2. The method of claim 1 wherein categorizing the speaker includes determining the speaker'"'"'s age and/or gender.
  - 3. The method of claim 2 wherein determining the speaker'"'"'s age and/or gender includes determining whether the runtime pitch falls into a range, wherein the range depends on the speakers age and/or gender.
  - 4. The method of claim 2 wherein determining the speaker'"'"'s age and/or gender includes determining from the pitch whether the speaker is a male, female or child speaker.
  - 5. The method of claim 1 wherein the one or more acoustic model parameters include a maximum frequency f_maxand a minimum frequency f_minfor a filter bank used in performing the voice recognition analysis.
  - 6. The method of claim 5 wherein the values of f_maxand f_minare chosen based on a gender and/or an age of the speaker as determined during categorizing the speaker based on the runtime pitch.
  - 7. The method of claim 5 wherein the values of f_maxand f_minare chosen based whether the speaker is a male, female or child speaker during categorizing the speaker based on the runtime pitch.
  - 8. The method of claim 5 wherein the f_minand f_maxare adjusted dynamically at any instance of time during the recognition.
  - 9. The method of claim 5, wherein the minimum frequency f_minis about 70 Hz and the maximum frequency f_maxis about 3800 Hz if the speaker is categorized as a man.
  - 10. The method of claim 5, wherein the minimum frequency f_minis about 70 Hz and the maximum frequency f_maxis about 4200 Hz if the speaker is categorized as a woman.
  - 11. The method of claim 5, wherein the minimum frequency f_minis about 90 Hz and the maximum frequency f_maxis about 4400 Hz if the speaker is categorized as a child.
  - 12. The method of claim 1, further comprising storing the speaker categorization and/or the one or more acoustic model parameters based on the categorization of the speaker, and associating the speaker categorization of the speaker and/or the one or more acoustic model parameters based on the categorization of the speaker with a particular speaker.
  - 13. The method of claim 12, further comprising using the stored speaker categorization and/or the one or more acoustic model parameters based on the categorization of the speaker during a subsequent voice recognition analysis for the speaker.

14. A voice recognition system, comprising:
- an interface adapted to obtain a voice signal;
  
  one or more processors coupled to the interface; and
  
  a memory coupled to the interface and the processor, the memory having embodied therein a set of processor readable instructions for configured to implement a method for voice recognition, the processor readable instructions including;
  
  an instruction for obtaining a voice signal for an utterance of a speaker;
  
  an instruction for categorizing the speaker as male, female, or child;
  
  an instruction for using the categorization as a basis for dynamically adjusting a maximum frequency f_maxand a minimum frequency f_minof a filter bank used for processing the input utterance to produce an output, and using corresponding gender or age specific acoustic models to perform voice recognition based on the filter bank output.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Sony Interactive Entertainment Inc. (Sony Group Corp.)
Original Assignee
Sony Computer Entertainment Incorporated (Sony Group Corp.)
Inventors
Chen, Ruxin

Granted Patent

US 8,050,922 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/246
CPC Class Codes

G10L 15/065   Adaptation

G10L 17/00   Speaker identification or v...

G10L 25/90   Pitch determination of spee...

VOICE RECOGNITION WITH DYNAMIC FILTER BANK ADJUSTMENT BASED ON SPEAKER CATEGORIZATION

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

35 Citations

14 Claims

Specification

Use Cases

Quick Links

Others

VOICE RECOGNITION WITH DYNAMIC FILTER BANK ADJUSTMENT BASED ON SPEAKER CATEGORIZATION

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

35 Citations

14 Claims

Specification

Subscription Required

Use Cases

Quick Links

Others