Bifurcated speaker specific and non-speaker specific speech recognition method and apparatus
First Claim
1. A speech recognition device, comprising:
- a data processing terminal, comprising;
a speech input unit to receive sounds including speech and translate the received speech into digital form;
a speech analyzer coupled to said speech input unit to generate voice feature parameters for the received digitized speech; and
a speaker accommodation unit comprising;
a first feature reference memory for storing pre-registered non-specific speaker feature information,a conversion rule generated in advance highlighting variations between previously stored specific speaker feature information and the pre-registered non-specific speaker feature information, anda feature converter for generating converted voice feature parameters received from said speech analyzer based on the conversion rule, anda speech recognition processor, comprising;
a second feature reference memory for storing standard feature information corresponding to pre-registered phrases;
a phrase detector to determine whether the converted voice feature parameters substantially match any pre-registered phrases in said second feature reference memory and generate phrase detection data in response thereto; and
a comprehension controller coupled to said phrase detector to receive the phrase detection data, to recognize a meaning of the received speech based on the received phrase detection data, and to perform at least one of controlling an action and formulating an appropriate response responsive to the recognized meaning;
wherein said data processing terminal transmits the converted voice feature parameters to said speech recognition processor which is in radio frequency communication with said data processing terminal to receive the converted voice feature parameters.
1 Assignment
0 Petitions
Accused Products
Abstract
Bifurcated speaker specific and non-speaker specific method and apparatus is provided for enabling speech-based remote control and for recognizing the speech of an unspecified speaker at extremely high recognition rates regardless of the speaker'"'"'s age, sex, or individual speech mannerisms. A device main unit is provided with a speech recognition processor for recognizing speech and taking an appropriate action, and with a user terminal containing specific speaker capture and/or preprocessing capabilities. The user terminal exchanges data with the speech recognition processor using radio transmission. The user terminal may be provided with a conversion rule generator that compares the speech of a user with previously compiled standard speech feature data and, based on this comparison result, generates a conversion rule for converting the speaker'"'"'s speech feature parameters to corresponding standard speaker'"'"'s feature information. The speech recognition processor, in turn, may reference the conversion rule developed in the user terminal and perform speech recognition based on the input speech feature parameters that have been converted above.
-
Citations
18 Claims
-
1. A speech recognition device, comprising:
-
a data processing terminal, comprising; a speech input unit to receive sounds including speech and translate the received speech into digital form; a speech analyzer coupled to said speech input unit to generate voice feature parameters for the received digitized speech; and a speaker accommodation unit comprising; a first feature reference memory for storing pre-registered non-specific speaker feature information, a conversion rule generated in advance highlighting variations between previously stored specific speaker feature information and the pre-registered non-specific speaker feature information, and a feature converter for generating converted voice feature parameters received from said speech analyzer based on the conversion rule, and a speech recognition processor, comprising; a second feature reference memory for storing standard feature information corresponding to pre-registered phrases; a phrase detector to determine whether the converted voice feature parameters substantially match any pre-registered phrases in said second feature reference memory and generate phrase detection data in response thereto; and a comprehension controller coupled to said phrase detector to receive the phrase detection data, to recognize a meaning of the received speech based on the received phrase detection data, and to perform at least one of controlling an action and formulating an appropriate response responsive to the recognized meaning; wherein said data processing terminal transmits the converted voice feature parameters to said speech recognition processor which is in radio frequency communication with said data processing terminal to receive the converted voice feature parameters. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A speech recognition device, comprising:
-
a speech input unit to receive sounds including speech and translate the received speech into digital form; a speech analyzer coupled to said speech input unit to generate voice feature parameters for the received digitized speech; a data processing terminal including a speaker accommodation unit comprising; a first feature reference memory for storing pre-registered non-specific speaker feature information, a conversion rule generated in advance highlighting variations between previously stored specific speaker feature information and the pre-registered non-specific speaker feature information, and a feature converter for generating converted voice feature parameters received from said speech analyzer based on the conversion rule, and a speech recognition processor, comprising; a second feature reference memory for storing standard feature information corresponding to pre-registered phrases; a phrase detector to determine whether the converted voice feature parameters substantially match any pre-registered phrases in said second feature reference memory and generate phrase detection data in response thereto; and a comprehension controller coupled to said phrase detector to receive the phrase detection data, to recognize a meaning of the received speech based on the received phrase detection data, and to perform at least one of controlling an action and formulating an appropriate response responsive to the recognized meaning; wherein said speech analyzer transmits the voice feature parameters to said data processing terminal which is in radio frequency communication with said speech analyzer to receive the voice feature parameters. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification