Apparatus, method and computer readable memory medium for speech recognition using dynamic programming
First Claim
1. A control system for controlling a telephony system, comprising:
- a speech recognition user interface for allowing a user to input speech commands for controlling the telephony system, said speech recognition user interface comprising;
(i) means for receiving an input speech command;
(ii) means for storing a plurality of reference word models;
(iii) means for comparing the input speech command with the stored reference word models to generate a recognition result; and
control means, responsive to the recognition result generated by said speech recognition user interface, for controlling the telephony system in accordance with the input speech command;
where said control means controls the telephony system so as to reproduce a stored message using the recognition result generated by said speech recognition user interface;
wherein said message is stored with an identifier for identifying the caller that has left the message;
wherein said control means is operable to display a list of messages to be reproduced upon instruction to reproduce the message identified by said identifier; and
wherein said control means comprises means for predicting, using current system status information, what telephony service is wanted if the user inputs, via said speech recognition user interface, only the identifier of another user.
0 Assignments
0 Petitions
Accused Products
Abstract
A method for matching an input pattern with a number of stored reference patterns using a dynamic programming matching technique is described. The reference patterns of a reference signal which are at the end of a dynamic programming path for a current input pattern are listed in an active list. The dynamic programming paths are propagated by processing the reference patterns on the active list, and a new active list is generated for the succeeding input pattern. The amount of processing required for each pattern on the active list is reduced by using a pointer which identifies the reference pattern which is the earliest in the sequence of patterns of the current reference signal listed on the new active list during the processing of a preceding dynamic programming path. In a second aspect, a speech recognition interface is used as a control system for a telephony system.
-
Citations
88 Claims
-
1. A control system for controlling a telephony system, comprising:
-
a speech recognition user interface for allowing a user to input speech commands for controlling the telephony system, said speech recognition user interface comprising;
(i) means for receiving an input speech command;
(ii) means for storing a plurality of reference word models;
(iii) means for comparing the input speech command with the stored reference word models to generate a recognition result; and
control means, responsive to the recognition result generated by said speech recognition user interface, for controlling the telephony system in accordance with the input speech command;
where said control means controls the telephony system so as to reproduce a stored message using the recognition result generated by said speech recognition user interface;
wherein said message is stored with an identifier for identifying the caller that has left the message;
wherein said control means is operable to display a list of messages to be reproduced upon instruction to reproduce the message identified by said identifier; and
wherein said control means comprises means for predicting, using current system status information, what telephony service is wanted if the user inputs, via said speech recognition user interface, only the identifier of another user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
-
24. A mail box facility for use with a telephony system, comprising:
-
a speech recognition user interface for allowing a user to input speech commands for controlling the mail box facility, and for outputting a recognition result based on comparing the input speech commands with pre-stored reference models;
control means, responsive to the recognition result output from said speech recognition user interface, for controlling the mail box facility in accordance with an input speech command;
storage means for storing messages left by callers, when the users are unable to take the calls, receiving means for receiving information of the callers via said speech recognition user interface;
reproducing means for reproducing the messages left by the callers identified by the information received by said receiving means; and
second storage means for storing a pointer that points to a last message reproduced by said reproducing means.
-
-
25. A telephony system comprising:
-
a speech recognition user interface for allowing a user to input speech commands for controlling telephony services provided by the system, said speech recognition user interface comprising;
(i) means for receiving an input speech command;
(ii) means for storing a plurality of reference models; and
(iii) means for comparing the input speech command with the stored reference models to generate a recognition result; and
execution means, responsive to the recognition result generated by said speech recognition user interface, for executing an operation corresponding to the speech command, wherein each user of the system is identified by a telephone number and an associated identifier, and wherein said execution means comprises;
means for receiving data identifying a current status of the telephony system; and
means for predicting a desired telephony service using the current system status data;
wherein if the user input command identifies a telephony service, then said execution means is operable to provide the desired telephony services identified in the input speech command; and
wherein said execution means is operable to predict a desired telephony service using said predicting means and said current system status data, if said input speech command does not identify a desired telephony service. - View Dependent Claims (26, 27)
-
-
28. A control system for controlling a telephony system, comprising:
-
a speech recognition user interface for allowing a user to input speech commands for controlling the telephony system, the speech recognition user interface comprising;
(i) an input terminal for receiving an input speech command;
(ii) a memory for storing a plurality of reference word models; and
(iii) a comparator for comparing the input speech command with the stored reference word models to generate a recognition result; and
a controller, responsive to the recognition result generated by said speech recognition user interface, for controlling the telephony system in accordance with the input speech command;
wherein said controller is operable to control the telephony system so as to reproduce a stored message using the recognition result generated by said speech recognition user interface;
wherein said message is stored with an identifier for identifying the caller that has left the message;
wherein said controller is operable to display a list of messages to be reproduced upon instruction to reproduce the message identified by said identifier; and
wherein said controller comprises a predictor for predicting, using current system status information, what telephony service is wanted if the user inputs, via said speech recognition user interface, only the identifier of another user. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50)
-
-
51. A telephony system comprising:
-
a speech recognition user interface for allowing a user to input speech commands for controlling telephony services provided by the system, the speech recognition user interface comprising;
(i) an input terminal for receiving an input speech command;
(ii) a memory for storing a plurality of reference models; and
(iii) a comparator for comparing the input speech command with the stored reference models to generate a recognition result; and
a command executioner, responsive to the recognition result generated by said speech recognition user interface, for executing an operation corresponding to the speech command, wherein each user of the system is identified by a telephone number and an associated identifier, and wherein the command executioner comprises;
an input terminal for receiving data identifying a current status of the telephony system; and
a predictor for predicting a desired telephony service using the current system status data;
wherein if said user input command identifies a telephony service, then said command executioner is operable to provide the desired telephony service identified in the input speech command; and
wherein said command executioner is operable to predict a desired telephony service using said predictor and the current system status data, if the input speech command does not identify a desired telephony service. - View Dependent Claims (52, 53)
-
-
54. A method of controlling a telephony system comprising the steps of:
-
receiving an input speech command;
storing a plurality of reference word models;
comparing an input speech command with the stored reference word models to generate a recognition result; and
controlling the telephony system in accordance with the generated recognition result by executing an operation corresponding to the input speech command;
wherein said controlling step controls the telephony system so as to reproduce a stored message using the recognition result generated in said comparing step;
wherein said storing step stores said message together with an identifier for identifying the caller that left the message;
wherein said controlling step displays a list of messages to be reproduced upon instruction to reproduce the message identified by said identifier; and
wherein said controlling step comprises the step of predicting, using current system status information, what telephony service is wanted if the user inputs a speech command including only the identifier of another user. - View Dependent Claims (55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73)
-
-
74. A telephony method comprising the steps of:
-
providing a speech recognition user interface for allowing a user to input speech commands for controlling telephony services provided by the system, the speech recognition user interface comprising;
(i) an input terminal for receiving an input speech command;
(ii) a memory for storing a plurality of reference models; and
(iii) a comparator for comparing the input speech command with said stored reference models to generate a recognition result; and
executing an operation corresponding to the speech command in dependence upon the recognition result generated by the speech recognition user interface, wherein each user of the system is identified by a telephone number and an associated identifier, and wherein the executing step comprises the steps of;
receiving data identifying a current status of the telephony system; and
predicting a desired telephony service using the current system status data;
wherein if the user input command identifies a telephony service, then said executing step provides the desired telephony service identified in the input speech command; and
wherein said executing step predicts a desired telephony service using said predicting step and the current system status data, if the input speech command does not identify a desired telephony service. - View Dependent Claims (75, 76)
-
-
77. A computer readable medium storing computer executable process steps for controlling a telephony system, the process steps comprising:
-
steps for providing a speech recognition user interface for allowing a user to input speech commands for controlling the telephony system, comprising;
(i) steps for receiving an input speech command;
(ii) steps for storing a plurality of reference word models; and
(iii) steps for comparing the input speech command with said stored reference word models to generate a recognition result; and
steps for controlling the telephony system in accordance with the input speech command;
wherein said controlling step controls the telephony system so as to reproduce a stored message using the recognition result generated in said comparing step;
wherein said storing step stores said message together with an identifier for identifying the caller that left the message;
wherein said controlling step displays a list of messages to be reproduced upon instruction to reproduce the message identified by said identifier; and
wherein said controlling step comprises predicting, using current system status information, what telephony service is wanted if the user inputs, via said speech recognition user interface, only the identifier of another user.
-
-
78. A computer readable medium storing computer executable process steps defining a telephony system, the process steps comprising:
-
steps for providing a speech recognition user interface allowing a user to input speech commands for controlling telephony services provided by the system, the steps for providing the speech recognition user interface comprising;
(i) steps for receiving an input speech command;
(ii) steps for storing a plurality of reference models; and
(iii) steps for comparing the input speech command with said stored reference models to generate a recognition result; and
steps for executing an operation corresponding to the speech command in response to the generated recognition result, wherein each user of the system is identified by a telephone number and an associated identifier, and wherein the steps for executing comprise;
steps for receiving data identifying a current status of the telephony system; and
steps for predicting a desired telephony service using the current system status data;
wherein if the user input command identifies a telephony service, then said steps for executing provide the desired telephony service identified in the input speech command; and
wherein said steps for executing predict a desired telephony service using said steps for predicting and the current system status data, if the input speech command does not identify a desired telephony service.
-
-
79. A computer readable medium storing computer executable process steps for providing a mail box facility for use with a telephony system, the process steps comprising:
-
steps for providing a speech recognition user interface for allowing a user to input speech commands for controlling the mail box facility, and for outputting a recognition result based on comparing the input speech commands with pre-stored reference models;
steps for controlling the mail box facility in accordance with an input speech command in response to the recognition result;
steps for storing messages left by callers, when the users are unable to take the calls;
steps for receiving information of the callers via said speech recognition user interface;
steps for reproducing the messages left by the callers identified by the information received in the receiving steps; and
steps for storing a pointer which points to a last messge reproduced in the reproducing steps.
-
-
80. A computer executable program for controlling a telephony system, the program comprising:
-
a code for instructing the telephony system to provide a speech recognition user interface for allowing a user to input speech commands for controlling the telephony system, the speech recognition user interface code comprising;
(i) a code for instructing the telephony system to receive an input speech command;
(ii) a code for instructing the telephony system to store a plurality of reference word models; and
(iii) a code for instructing the telephony system to compare the input speech command with the stored reference word models to generate a recognition result; and
a code for controlling the telephony system in accordance with the input speech command in dependence upon the generated recognition result;
wherein said code for controlling the telephony system further comprises a code to control the telephony system so as to reproduce a stored message using the recognition result generated by said speech recognition user interface;
wherein said code for controlling the telephony system further comprises code for storing said message with an identifier for identifying the caller that left the message;
wherein said code for controlling further comprises code for instructing the telephony system to display a list of messages to be reproduced upon instruction to reproduce the message identified by said identifier; and
wherein said code for controlling further comprises code for instructing the telephony system to predict, using current system status information, what telephony service is wanted if the user inputs, via said speech recognition user interface, only the identifier of another user.
-
-
81. A computer executable program for defining a telephony system, the program comprising:
-
a code for instructing the telephony system to provide a speech recognition user interface for allowing a user to input speech commands for controlling telephony services provided by the system, the speech recognition user interface code comprising;
(i) a code for instructing the telephony system to receive an input speech command;
(ii) a code for instructing the telephony system to store a plurality of reference models; and
(iii) a code for instructing the telephony system to compare the input speech command with the stored reference models to generate a recognition result; and
a code for instructing the telephony system to execute an operation corresponding to the speech command in dependence upon the generated recognition result, a code for instructing the telephony system to identify each user of the system by a telephone number and an associated identifier, and wherein the execution code comprises;
a code for instructing the telephony system to receive data identifying a current status of the telephony system; and
a code for instructing the telephony system to predict a desired telephony service using the current system status data;
wherein if the user input command identifies a telephony service, then said execution code instructs the telephony system to provide the desired telephony service identified in the input speech command; and
wherein said execution code instructs the telephony system to predict a desired telephony service using said code for predicting and the current system status data, if the input speech command does not identify a desired telephony service.
-
-
82. Computer executable program for providing a telephony system, comprising:
-
a code for instructing the telephony system to provide a speech recognition user interface for allowing a user to input speech commands for controlling the mail box facility, and for outputting a recognition result based on comparing the input speech commands with pre-stored reference models;
a code for controlling the mail box facility in accordance with an input speech command in response to the recognition result;
a code for instructing the telephony system to store messages left by callers, when the users are unable to take the calls;
a code for instructing the telephony system to receive information of the callers via said speech recognition user interface;
a code for instructing the telephony system to reproduce the messages left by the callers identified by the information received by the telephony system; and
a code for instructing the telephony system to store a pointer which points to a last message reproduced by the telephony system.
-
-
83. A control system for controlling an information system, comprising:
-
a speech recognition user interface allowing a user to input speech commands for controlling the information system, said speech recognition user interface comprising;
(i) means for receiving an input speech command;
(ii) means for storing a plurality of reference models;
(iii) means for comparing the input speech command with the stored reference models to generate a recognition result; and
control means, responsive to the recognition result generated by said speech recognition user interface, for controlling the information system in accordance with the input speech command, wherein said speech recognition user interface is adapted to recognise continuously spoken commands comprising at least one word defining a desired information service, wherein said speech recognition user interface further comprises means for storing a language model which defines sequences of the reference models which can be compared with the input speech command, in order to define allowed input speech commands, wherein said comparing means is operable to compare the input speech command with selected sequences of the reference models, selected in accordance with the stored language model, and wherein said control means comprises execution means for executing an operation corresponding to the input speech command.
-
-
84. A control system for controlling an information system comprising:
-
a speech recognition user interface allowing a user to input speech commands for controlling the information system, said speech recognition user interface comprising;
(i) an input terminal for receiving an input speech command;
(ii) a memory for storing a plurality of reference models; and
(iii) a comparator for comparing the input speech command with the stored reference models to generate a recognition result; and
a controller, responsive to the recognition result generated by said speech recognition user interface, for controlling the information system in accordance with the input speech command, wherein said speech recognition user interface is adapted to recognise continuously spoken commands comprising at least one word defining a desired information service, wherein said speech recognition user interface further comprises a memory for storing a language model which defines sequences of the reference models which can be compared with the input speech command, in order to define allowed input speech commands, wherein said comparator compares the input speech command with selected sequences of the reference models, selected in accordance with the stored language model, and wherein said controller comprises a command executor for executing an operation corresponding to the input speech command.
-
-
85. A control system for controlling a telephony system, comprising:
-
a speech recognition user interface for allowing a user to input speech commands for controlling the telephony system, said speech recognition user interface comprising;
(i) means for receiving an input speech command, (ii) means for storing a plurality of reference word models, (iii) means for comparing the input speech command with the stored reference word models to generate a recognition result, and (iv) means for storing a language model which defines sequences of the reference models which can be compared with the input speech command, in order to define allowed input speech commands; and
control means, responsive to the recognition result generated by said speech recognition user interface, for controlling the telephony system in accordance with the input speech command, wherein said comparing means is operable to compare the input speech command with selected sequences of the reference models, selected in accordance with the stored language model, and said control means comprises execution means for executing an operation corresponding to the input speech command.
-
-
86. A method of controlling an information system comprising the steps of:
-
receiving an input speech command;
storing a plurality of reference models;
comparing an input speech command with the stored reference models to generate a recognition result;
in response to the generated recognition result, controlling the information system in accordance with the input speech command; and
storing a language model which defines sequences of the reference models which can be compared with the input speech command, in order to define allowed input speech commands, wherein said receiving step receives continuously spoken commands comprising at least one word defining a desired information service;
wherein said comparing step compares the input speech command with selected sequences of the reference models, selected in accordance with the stored language model, and said controlling step includes a step of executing an operation corresponding to the input speech command.
-
-
87. A control system for controlling a telephony system, comprising:
-
a speech recognition user interface that allows a user to input speech commands for controlling the telephony system, wherein the speech recognition user interface receives an input speech command, stores a plurality of reference word models, compares the input speech command with the stored reference word models to generate a recognition result and stores a language model which defines sequences of the reference models which can be compared with the input speech command, in order to define allowed input speech commands; and
a controller, responsive to the recognition result generated by the speech recognition user interface, that controls the telephony system in accordance with the input speech command, wherein the speech recognition user interface compares the input speech command with selected sequences of the reference models, selected in accordance with the stored language model, and wherein the controller executes an operation corresponding to the input speech command.
-
-
88. A method of controlling a telephony system, comprising the steps of:
-
receiving an input speech command;
storing a plurality of reference word models;
storing a language model which defines sequences of the reference models which can be compared with the input speech command, in order to define allowed input speech commands;
comparing the input speech command with selected sequences of the stored reference word models, selected in accordance with the stored language model, to generate a recognition result;
controlling the telephony system in accordance with the input speech command, in response to the generated recognition result; and
executing an operation corresponding to the input speech command.
-
Specification