Speech recognition based interactive information retrieval scheme using dialogue control to reduce user stress
First Claim
1. A method of speech recognition based interactive information retrieval for ascertaining and retrieving a target information of a user by determining a retrieval key entered by the user using a speech recognition processing, comprising the steps of:
- (a) storing retrieval key candidates that constitute a number of data that cannot be processed by the speech recognition processing in a prescribed processing time, as recognition target words in a speech recognition database, the recognition target words being divided into prioritized recognition target words that constitute a number of data that can be processed by the speech recognition processing in the prescribed processing time and that have relatively higher importance levels based on statistical information defined for the recognition target words, and non-prioritized recognition target words other than the prioritized recognition target words;
(b) requesting the user by a speech dialogue with the user to enter a speech input indicating the retrieval key, and carrying out the speech recognition processing for the speech input with respect to the prioritized recognition target words to obtain a recognition result;
(c) carrying out a confirmation process using a speech dialogue with the user according to the recognition result to determine the retrieval key, when the recognition result satisfies a prescribed condition for judging that the retrieval key can be determined only by a confirmation process with the user;
(d) carrying out a related information query using a speech dialogue with the user to request the user to enter another speech input indicating a related information of the retrieval key, when the recognition result does not satisfy the prescribed condition;
(e) carrying out the speech recognition processing for the another speech input to obtain another recognition result, and adjusting the recognition result according to the another recognition result to obtain adjusted recognition result; and
(f) repeating the step (c) or the steps (d) and (e) using the adjusted recognition result in place of the recognition result, until the retrieval key is determined.
0 Assignments
0 Petitions
Accused Products
Abstract
In the disclosed speech recognition based interactive information retrieval scheme, the recognition target words in the speech recognition database are divided into prioritized recognition target words that constitute a number of data that can be processed by the speech recognition processing in the prescribed processing time and that have relatively higher importance levels based on statistical information, and the other non-prioritized recognition target words. Then, the speech recognition processing for the speech input with respect to the prioritized recognition target words is carried out at higher priority, and a confirmation process is carried out when the recognition result satisfies a prescribed condition for judging that the retrieval key can be determined only by a confirmation process with the user. On the other hand, a related information query to request the user to enter another speech input for a related information of the retrieval key is carried out when the recognition result does not satisfy the prescribed condition, and the original recognition result is adjusted according to the recognition result for another speech input. In this way, the retrieval key determination is realized through natural speech dialogues with the user.
36 Citations
42 Claims
-
1. A method of speech recognition based interactive information retrieval for ascertaining and retrieving a target information of a user by determining a retrieval key entered by the user using a speech recognition processing, comprising the steps of:
-
(a) storing retrieval key candidates that constitute a number of data that cannot be processed by the speech recognition processing in a prescribed processing time, as recognition target words in a speech recognition database, the recognition target words being divided into prioritized recognition target words that constitute a number of data that can be processed by the speech recognition processing in the prescribed processing time and that have relatively higher importance levels based on statistical information defined for the recognition target words, and non-prioritized recognition target words other than the prioritized recognition target words;
(b) requesting the user by a speech dialogue with the user to enter a speech input indicating the retrieval key, and carrying out the speech recognition processing for the speech input with respect to the prioritized recognition target words to obtain a recognition result;
(c) carrying out a confirmation process using a speech dialogue with the user according to the recognition result to determine the retrieval key, when the recognition result satisfies a prescribed condition for judging that the retrieval key can be determined only by a confirmation process with the user;
(d) carrying out a related information query using a speech dialogue with the user to request the user to enter another speech input indicating a related information of the retrieval key, when the recognition result does not satisfy the prescribed condition;
(e) carrying out the speech recognition processing for the another speech input to obtain another recognition result, and adjusting the recognition result according to the another recognition result to obtain adjusted recognition result; and
(f) repeating the step (c) or the steps (d) and (e) using the adjusted recognition result in place of the recognition result, until the retrieval key is determined. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A method of speech recognition based interactive information retrieval for ascertaining and retrieving a target information of a user by determining a retrieval key entered by the user using a speech recognition processing, comprising the steps of:
-
(a) storing retrieval key candidates that are classified according to attribute values of an attribute item in a speech recognition database;
(b) requesting the user by a speech dialogue with the user to enter a speech input indicating an attribute value of the attribute item for the retrieval key, and carrying out the speech recognition processing for the speech input to obtain a recognition result indicating attribute value candidates and their recognition likelihoods;
(c) selecting those attribute value candidates which have recognition likelihoods that are exceeding a prescribed likelihood threshold as attribute value leading candidates, and extracting those retrieval key candidates that belong to the attribute value leading candidates as new recognition target data;
(d) requesting the user by a speech dialogue with the user to enter another speech input indicating the retrieval key, and carrying out the speech recognition processing for the another speech input with respect to the new recognition target data to obtain another recognition result; and
(e) carrying out a confirmation process using a speech dialogue with the user according to the another recognition result to determine the retrieval key. - View Dependent Claims (19)
-
-
20. A speech recognition based interactive information retrieval apparatus for ascertaining and retrieving a target information of a user by determining a retrieval key entered by the user using a speech recognition processing, comprising:
-
a speech recognition database configured to store retrieval key candidates that constitute a number of data that cannot be processed by the speech recognition processing in a prescribed processing time, as recognition target words, the recognition target words being divided into prioritized recognition target words that constitute a number of data that can be processed by the speech recognition processing in the prescribed processing time and that have relatively higher importance levels based on statistical information defined for the recognition target words, and non-prioritized recognition target words other than the prioritized recognition target words;
a speech recognition unit configured to carry out the speech recognition processing; and
a dialogue control unit configured to carry out speech dialogues with the user;
wherein the dialogue control unit carries out a speech dialogue for requesting the user to enter a speech input indicating the retrieval key, such that the speech recognition unit carries out the speech recognition processing for the speech input with respect to the prioritized recognition target words to obtain a recognition result;
the dialogue control unit carries out a speech dialogue for a confirmation process according to the recognition result to determine the retrieval key, when the recognition result satisfies a prescribed condition for judging that the retrieval key can be determined only by a confirmation process with the user;
the dialogue control unit carries out a speech dialogue for a related information query to request the user to enter another speech input indicating a related information of the retrieval key, when the recognition result does not satisfy the prescribed condition, such that the speech recognition unit carries out the speech recognition processing for the another speech input to obtain another recognition result and the dialogue control unit adjusts the recognition result according to the another recognition result to obtain adjusted recognition result, and the dialogue control unit controls the speech dialogues to repeat the confirmation process or the related information query using the adjusted recognition result in place of the recognition result, until the retrieval key is determined. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36)
-
-
37. A speech recognition based interactive information retrieval apparatus for ascertaining and retrieving a target information of a user by determining a retrieval key entered by the user using a speech recognition processing, comprising:
-
a speech recognition database configured to store retrieval key candidates that are classified according to attribute values of an attribute item;
a speech recognition unit configured to carry out the speech recognition processing; and
a dialogue control unit configured to carry out speech dialogues with the user;
wherein the dialogue control unit carries out a speech dialogue for requesting the user to enter a speech input indicating an attribute value of the attribute item for the retrieval key, such that the speech recognition unit carries out the speech recognition processing for the speech input to obtain a recognition result indicating attribute value candidates and their recognition likelihoods;
the dialogue control unit selects those attribute value candidates which have recognition likelihoods that are exceeding a prescribed likelihood threshold as attribute value leading candidates, and extracts those retrieval key candidates that belong to the attribute value leading candidates as new recognition target data;
the dialogue control unit carries out a speech dialogue for requesting the user to enter another speech input indicating the retrieval key, such that the speech recognition unit carries out the speech recognition processing for the another speech input with respect to the new recognition target data to obtain another recognition result; and
the dialogue control unit carries out a speech dialogue for a confirmation process according to the another recognition result to determine the retrieval key. - View Dependent Claims (38)
-
-
39. A computer usable medium having computer readable program codes embodied therein for causing a computer to function as a speech recognition based interactive information retrieval system for ascertaining and retrieving a target information of a user by determining a retrieval key entered by the user using a speech recognition processing and a speech recognition database for storing retrieval key candidates that constitute a number of data that cannot be processed by the speech recognition processing in a prescribed processing time, as recognition target words in a speech recognition database, the recognition target words being divided into prioritized recognition target words that constitute a number of data that can be processed by the speech recognition processing in the prescribed processing time which have relatively higher importance levels based on statistical information defined for the recognition target words, and non-prioritized recognition target words other than the prioritized recognition target words, the computer readable program codes include:
-
a first computer readable program code for causing said computer to request the user by a speech dialogue with the user to enter a speech input indicating the retrieval key, and carry out the speech recognition processing for the speech input with respect to the prioritized recognition target words to obtain a recognition result;
a second computer readable program code for causing said computer to carry out a confirmation process using a speech dialogue with the user according to the recognition result to determine the retrieval key, when the recognition result satisfies a prescribed condition for judging that the retrieval key can be determined only by a confirmation process with the user;
a third computer readable program code for causing said computer to carry out a related information query using a speech dialogue with the user to request the user to enter another speech input indicating a related information of the retrieval key, when the recognition result does not satisfy the prescribed condition;
a fourth computer readable program code for causing said computer to carry out the speech recognition processing for the another speech input to obtain another recognition result, and adjust the recognition result according to the another recognition result to obtain adjusted recognition result; and
a fifth computer readable program code for causing said computer to repeat processing of the second computer readable program code or the third and fourth computer readable program codes using the adjusted recognition result in place of the recognition result, until the retrieval key is determined.
-
-
40. A computer usable medium storing a data structure to be used as a speech recognition database in a speech recognition based interactive information retrieval system for ascertaining and retrieving a target information of a user by determining a retrieval key entered by the user using a speech recognition processing, the data structure comprising:
retrieval key candidates that constitute a number of data that cannot be processed by the speech recognition processing in a prescribed processing time, as recognition target words, the recognition target words being divided into prioritized recognition target words that constitute a number of data that can be processed by the speech recognition processing in the prescribed processing time which have relatively higher importance levels based on statistical information defined for the recognition target words, and non-prioritized recognition target words other than the prioritized recognition target words. - View Dependent Claims (41)
-
42. A computer usable medium having computer readable program codes embodied therein for causing a computer to function as a speech recognition based interactive information retrieval system for ascertaining and retrieving a target information of a user by determining a retrieval key entered by the user using a speech recognition processing and a speech recognition database for storing retrieval key candidates that are classified according to attribute values of an attribute item, the computer readable program codes include:
-
a first computer readable program code for causing said computer to request the user by a speech dialogue with the user to enter a speech input indicating an attribute value of the attribute item for the retrieval key, and carry out the speech recognition processing for the speech input to obtain a recognition result indicating attribute value candidates and their recognition likelihoods;
a second computer readable program code for causing said computer to select those attribute value candidates which have recognition likelihoods that are exceeding a prescribed likelihood threshold as attribute value leading candidates, and extract those retrieval key candidates that belong to the attribute value leading candidates as new recognition target data;
a third computer readable program code for causing said computer to request the user by a speech dialogue with the user to enter another speech input indicating the retrieval key, and carry out the speech recognition processing for the another speech input with respect to the new recognition target data to obtain another recognition result; and
a fourth computer readable program code for causing said computer to carry out a confirmation process using a speech dialogue with the user according to the another recognition result to determine the retrieval key.
-
Specification