Interactive speech recognition with varying responses for time of day and environmental conditions
First Claim
1. An interactive speech recognition device, comprising:
- speech analysis means for analyzing an input speech and creating a speech data pattern that matches characteristics of the input speech;
detection means for detecting variable non-speech data that changes speech flowing from the speech recognition device;
coefficient setting means, responsive to the variable non-speech data, for generating a plurality of weighting coefficients each pre-assigned to a pre-registered recognition target speech, based on the variable non-speech data;
speech recognition means for computing a final recognition result in response to the speech data pattern, said speech recognition means including;
means for storing a plurality of pre-registered recognition target speeches and for outputting, in response to the speech data pattern, a plurality of recognition data values each for a corresponding pre-registered recognition target speech,means for computing final recognition data by multiplying each recognition data value by a corresponding one of said pre-assigned weighting coefficients for a corresponding pre-registered recognition target speech, andmeans for recognizing the input speech by comparing the final recognition data for all of the pre-registered recognition target speeches and for outputting a final recognition result; and
speech synthesis means for converting the final recognition result to corresponding speech synthesis data for producing an appropriate response to the input speech.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention improves recognition rates by providing an interactive speech recognition device that performs recognition by taking situational and environmental changes into consideration, thus enabling interactions that correspond to situational and environmental changes. The invention comprises a speech analysis unit that creates a speech data pattern corresponding to the input speech; a timing circuit for generating time data, for example, as variable data; a coefficient setting unit receiving the time data from the timing circuit and generating weighting coefficients that change over time, in correspondence to the content of each recognition target speech; a speech recognition unit that receives the speech data pattern of the input speech from the speech analysis unit, and that at the same time obtains a weighting coefficient in effect for a pre-registered recognition target speech at the time from the coefficient setting unit, that computes final recognition data by multiplying the recognition data corresponding to each recognition target speech by its corresponding weighting coefficient, and that recognizes the input speech based on the computed final recognition result; a speech synthesis unit for outputting speech synthesis data based on the recognition data that takes the weighting coefficient into consideration; and a drive control unit for transmitting the output from the speech synthesis unit to the outside.
-
Citations
12 Claims
-
1. An interactive speech recognition device, comprising:
-
speech analysis means for analyzing an input speech and creating a speech data pattern that matches characteristics of the input speech; detection means for detecting variable non-speech data that changes speech flowing from the speech recognition device; coefficient setting means, responsive to the variable non-speech data, for generating a plurality of weighting coefficients each pre-assigned to a pre-registered recognition target speech, based on the variable non-speech data; speech recognition means for computing a final recognition result in response to the speech data pattern, said speech recognition means including; means for storing a plurality of pre-registered recognition target speeches and for outputting, in response to the speech data pattern, a plurality of recognition data values each for a corresponding pre-registered recognition target speech, means for computing final recognition data by multiplying each recognition data value by a corresponding one of said pre-assigned weighting coefficients for a corresponding pre-registered recognition target speech, and means for recognizing the input speech by comparing the final recognition data for all of the pre-registered recognition target speeches and for outputting a final recognition result; and speech synthesis means for converting the final recognition result to corresponding speech synthesis data for producing an appropriate response to the input speech. - View Dependent Claims (2, 3)
-
-
4. An interactive speech recognition device, comprising:
-
speech analysis means for analyzing an input speech and creating a speech data pattern that matches characteristics of the input speech; speech recognition means for generating recognition data that corresponds to the input speech based on the speech data pattern created by said speech analysis means; timing means for generating time data; response content level storage means for storing information relating to passage of time relative to a response content level; response content level generation means for storing time ranges for a plurality of response content levels, said response content level generation means being responsive to the time data from said timing means, the recognition data from said speech recognition means, and the information from said response content level storage means, for generating a response content level value according to passage of time; response content creation means, responsive to the recognition data from said speech recognition means and the response content level value from said response content level generation means, for determining response content data appropriate for the response content level value generated by said response content level generation means; and speech synthesis means for converting the response content data from said response content creation means to corresponding speech synthesis data for producing an appropriate response to the input speech.
-
-
5. An interactive speech recognition device, comprising;
-
speech analysis means for analyzing an input speech and creating a speech data pattern that matches characteristics of the input speech; speech recognition means for generating recognition data that corresponds to the input speech, based on the speech data pattern from said speech analysis means; detection means for detecting variable non-speech data that changes speech flowing from the speech recognition device; response content creation means, responsive to the variable non-speech data from said detection means and the recognition data from said speech recognition means, for outputting response content data, based on the recognition data by taking the variable non-speech data into consideration, speech synthesis means for converting the response content data to corresponding speech synthesis data for producing an appropriate response to the input speech. - View Dependent Claims (6, 7, 8, 9)
-
-
10. An interactive speech recognition device, comprising:
-
speech analysis means for analyzing an input speech and creating a speech data pattern that matches characteristics of the input speech; detection means for detecting time data that changes speech flowing from the speech recognition device; coefficient setting means, responsive to the detected time data, for generating a plurality of weighting coefficients each pre-assigned to a pre-registered recognition target speech, based on the time data; speech recognition means for computing a final recognition result in response to the speech data pattern, said speech recognition means including; means for storing a plurality of pre-registered recognition target speeches and for outputting, in response to the speech data pattern, a plurality of recognition data values each for a corresponding pre-registered recognition target speech, means for computing final recognition data by multiplying each recognition data value by a corresponding one of said pre-assigned weighting coefficient for a corresponding pre-registered recognition target speech, and means for recognizing the input speech by comparing the final recognition data for all of the pre-registered recognition target speeches and for outputting a final recognition result; and speech synthesis means for converting the final recognition result to corresponding speech synthesis data for producing an appropriate response to the input speech.
-
-
11. An interactive speech recognition device, comprising:
-
speech analysis means for analyzing an input speech and creating a speech data pattern that matches characteristics of the input speech; speech recognition means for generating recognition data that corresponds to the input speech, based on the speech data pattern from said speech analysis means; detection means for detecting variable non-speech data that changes speech flowing from the speech recognition device; response content creation means, responsive to the variable non-speech data from said detection means and the recognition data from said speech recognition means, for outputting response content data, based on the recognition data by taking the variable data into consideration; an operating mechanism and a drive control unit responsive to the response content data for controlling the operating mechanism. - View Dependent Claims (12)
-
Specification