System, method and computer program product for a distributed speech recognition tuning platform
First Claim
Patent Images
1. A method for improving a speech recognition process, comprising:
- maintaining a database of utterances;
collecting information associated with the utterances in the database utilizing a speech recognition process;
transmitting the utterances in the database to at least one user interface utilizing a network;
receiving transcriptions of the utterances in the database from the at least one user interface utilizing the network;
wherein a human is capable of utilizing the information and the transcriptions to improve a speech recognition application;
wherein the speech recognition process is improved by performing experiments based on the information;
wherein the information is selected from the group consisting of a dialog state, a gender of a speaker, and a date the utterances are wherein the at least one user interface includes a first icon for emitting a present utterance upon the selection thereof.
5 Assignments
0 Petitions
Accused Products
Abstract
A system, method and computer program product are provided for tuning a speech recognition process. Initially, a database of utterances is maintained. Thereafter, information associated with the utterances is collected utilizing a speech recognition process. Further, the utterances in the database are transmitted to a plurality of users utilizing a network. As such, transcriptions of the utterances in the database may be received from the users utilizing the network. In use, the speech recognition process may be tuned utilizing the information and the transcriptions.
-
Citations
19 Claims
-
1. A method for improving a speech recognition process, comprising:
-
maintaining a database of utterances; collecting information associated with the utterances in the database utilizing a speech recognition process; transmitting the utterances in the database to at least one user interface utilizing a network; receiving transcriptions of the utterances in the database from the at least one user interface utilizing the network; wherein a human is capable of utilizing the information and the transcriptions to improve a speech recognition application; wherein the speech recognition process is improved by performing experiments based on the information; wherein the information is selected from the group consisting of a dialog state, a gender of a speaker, and a date the utterances are wherein the at least one user interface includes a first icon for emitting a present utterance upon the selection thereof. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A computer program product embodied on a computer readable medium for improving a speech recognition process, comprising:
-
(a) computer code for maintaining a database of utterances; (b) computer code for collecting information associated with the utterances in the database utilizing a speech recognition process; (c) computer code for transmitting the utterances in the database to at least one user interface utilizing a network; and (d) computer code for receiving transcriptions of the utterances in the database from the at least one user interface utilizing the network; (e) wherein a human is capable of utilizing the information and the transcriptions to improve a speech recognition application; wherein the speech recognition process is improved by performing experiments based on the information; wherein the information is selected from the group consisting of a dialog state, a gender of a speaker, and a date the utterances are wherein the at least one user interface includes a string field for allowing a user to enter a string corresponding to each utterance.
-
-
19. A system including a tangible computer readable medium for improving a speech recognition process, comprising:
-
(a) logic for maintaining a database of utterances; (b) logic for collecting information associated with the utterances in the database utilizing a speech recognition process, (c) logic for transmitting the utterances in the database to at least one user interface utilizing a network; (d) logic for receiving transcriptions of the utterances in the database from the at least one user interface utilizing the network; (e) wherein a human is capable of utilizing the information and the transcriptions to improve a speech recognition application; wherein the speech recognition process is improved by performing experiments based on the information; wherein the information is selected from the group consisting of a dialog state, a gender of a speaker, and a date the utterances are wherein the at least one user interface includes a string field for allowing a user to enter a string corresponding to each utterance.
-
Specification