SPEECH RECOGNITION APPARATUS
First Claim
1. A speech recognition apparatus for allowing setting by speech, comprising:
- an input unit configured to input a setting instruction by speech;
a speech interpretation unit configured to recognize and interpret contents of the setting instruction by speech to generate first structured data containing candidates of the interpretation result;
an instruction input detecting unit configured to detect a setting instruction input by a user;
an instruction input interpretation unit configured to interpret contents of the setting instruction input to generate second structured data; and
a selection unit configured to select one of the interpretation candidates contained in the first structured data on the basis of the second structured data.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognition apparatus that enables efficient multimodal input in setting a plurality of items by one utterance is provided. An input unit inputs a setting instruction by speech. A speech interpretation unit recognizes and interprets the contents of the setting instruction by speech to generate first structured data containing candidates of the interpretation result. An instruction input detecting unit detects a setting instruction input by a user. An instruction input interpretation unit interprets the contents of the setting instruction input to generate second structured data. A selection unit selects one of the interpretation candidates contained in the first structured data based on the second structured data.
44 Citations
22 Claims
-
1. A speech recognition apparatus for allowing setting by speech, comprising:
-
an input unit configured to input a setting instruction by speech;
a speech interpretation unit configured to recognize and interpret contents of the setting instruction by speech to generate first structured data containing candidates of the interpretation result;
an instruction input detecting unit configured to detect a setting instruction input by a user;
an instruction input interpretation unit configured to interpret contents of the setting instruction input to generate second structured data; and
a selection unit configured to select one of the interpretation candidates contained in the first structured data on the basis of the second structured data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 17, 18)
-
-
11. A speech recognition apparatus for allowing setting by speech, comprising:
-
an input unit configured to input a setting instruction by speech;
a feature extraction unit configured to extract a feature parameter string from the speech input by said input unit;
a search unit configured to search for a pattern resembling to the feature parameter string extracted by said feature extraction unit most from predetermined phoneme sequence pattern candidates; and
an instruction input detecting unit configured to detect a setting instruction input by a user, said search unit narrowing down the phoneme sequence pattern candidates on the basis of the setting instruction input detected by said instruction input detecting unit. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
19. A method for setting a device by using speech recognition, comprising the steps of:
-
inputting a setting instruction by speech;
recognizing and interpreting contents of the setting instruction by speech to generate first structured data containing candidates of the interpretation result;
detecting a setting instruction input by a user;
interpreting contents of the detected setting instruction input to generate second structured data; and
selecting one of the interpretation candidates contained in the first structured data on the basis of the second structured data.
-
-
20. A method for setting a device by using speech recognition, comprising the steps of:
-
inputting a setting instruction by speech;
extracting a feature parameter string from the input speech;
searching for a pattern resembling to the extracted feature parameter string most from predetermined phoneme sequence pattern candidates; and
detecting a setting instruction input by a user, wherein in the search step, the phoneme sequence pattern candidates are narrowed down on the basis of the detected setting instruction input.
-
-
21. A computer program stored on a computer-readable medium for setting device options using speech recognition, the program comprising code for performing the following steps of:
-
inputting a setting instruction by speech;
recognizing and interpreting contents of the setting instruction by speech to generate first structured data containing candidates of the interpretation result;
detecting a setting instruction input by a user;
interpreting contents of the detected setting instruction input to generate second structured data; and
selecting one of the interpretation candidates contained in the first structured data on the basis of the second structured data.
-
-
22. A computer program stored on a computer-readable medium for setting device options using speech recognition, the program comprising code for performing the following steps of:
-
inputting a setting instruction by speech;
extracting a feature parameter string from the input speech;
searching for a pattern resembling to the extracted feature parameter string most from predetermined phoneme sequence pattern candidates; and
detecting a setting instruction input by a user, wherein in the search step, the phoneme sequence pattern candidates are narrowed down on the basis of the detected setting instruction input.
-
Specification