Use of multiple speech recognition software instances
First Claim
1. A method of using speech recognition software for recorded audio data received from a wireless communication device, comprising:
- Receiving recorded audio data communicated from a wireless communication device and directing the audio data to more than one simultaneous servers running speech recognition software, wherein the number of simultaneous servers running speech recognition software receiving the same audio data is controlled by the communication device user'"'"'s options;
Receiving a confidence level of recognition from each server running speech recognition software;
Routing the recognition result with the highest confidence level for further processing;
wherein receiving a confidence level of recognition from each server running speech recognition software includes;
upon detecting that each confidence level of recognition from each server running speech recognition software is below a predefined level, routing the recorded audio data to a location associated with a human transcriber currently experiencing an acceptable workload;
receiving a machine readable command from the location associated with the human transcriber, the machine readable command comprising a representation of an application response to the recorded audio data, the machine readable command created by the human transcriber;
creating an application command based on the machine readable command; and
transmitting the application command to the wireless communication device.
4 Assignments
0 Petitions
Accused Products
Abstract
A wireless communication device is disclosed that accepts recorded audio data from an end-user. The audio data can be in the form of a command requesting user action. Likewise, the audio data can be converted into a text file. The audio data is reduced to a digital file in a format that is supported by the device hardware, such as a .wav, .mp3, .vnf file, or the like. The digital file is sent via secured or unsecured wireless communication to one or more server computers for further processing. In accordance with an important aspect of the invention, the system evaluates the confidence level of the of the speech recognition process. If the confidence level is high, the system automatically builds the application command or creates the text file for transmission to the communication device. Alternatively, if the confidence of the speech recognition is lower, the recorded audio data file is routed to a human transcriber employed by the telecommunications service, who manually reviews the digital voice file and builds the application command or text file. Once the application command is created, it is transmitted to the communication device. As a result of the present invention, speech recognition in the context of a communications devices has been shown to be accurate over 90% of the time.
41 Citations
8 Claims
-
1. A method of using speech recognition software for recorded audio data received from a wireless communication device, comprising:
-
Receiving recorded audio data communicated from a wireless communication device and directing the audio data to more than one simultaneous servers running speech recognition software, wherein the number of simultaneous servers running speech recognition software receiving the same audio data is controlled by the communication device user'"'"'s options; Receiving a confidence level of recognition from each server running speech recognition software; Routing the recognition result with the highest confidence level for further processing; wherein receiving a confidence level of recognition from each server running speech recognition software includes; upon detecting that each confidence level of recognition from each server running speech recognition software is below a predefined level, routing the recorded audio data to a location associated with a human transcriber currently experiencing an acceptable workload; receiving a machine readable command from the location associated with the human transcriber, the machine readable command comprising a representation of an application response to the recorded audio data, the machine readable command created by the human transcriber; creating an application command based on the machine readable command; and transmitting the application command to the wireless communication device. - View Dependent Claims (2, 3, 4)
-
-
5. A method of using speech recognition software for recorded audio data received from a wireless communication device, comprising:
-
Receiving recorded audio data communicated from a wireless communication device and directing the audio data to more than one simultaneous servers running speech recognition software, wherein the number of simultaneous servers running speech recognition software receiving the same audio data is controlled by the communication device user'"'"'s options; Receiving a confidence level of recognition from each server running speech recognition software; and Routing the recognition result with the highest confidence level for further processing; wherein directing the audio data to more than one simultaneous servers running speech recognition software includes; appending at least one unique identifier to the recorded audio data, the unique identifier associated with at least one human transcriber who has previously reviewed voice commands from a user currently associated with the wireless communication device, the unique identifier further indicative of an accent of the user currently associated with the wireless communication device; wherein receiving a confidence level of recognition from each server running speech recognition software includes; upon detecting that each confidence level of recognition from each server running speech recognition software is below a predefined level, routing the recorded audio data to a location associated with the human transcriber who has previously reviewed voice commands from a user currently associated with the wireless communication device; receiving a machine readable command from the location associated with the human transcriber, the machine readable command comprising a representation of an application response to the recorded audio data, the machine readable command created by the human transcriber; creating an application command based on the machine readable command; and transmitting the application command to the wireless communication device.
-
-
6. A method of using speech recognition software for recorded audio data received from a wireless communication device, comprising:
-
Receiving recorded audio data communicated from a wireless communication device and directing the audio data to more than one simultaneous servers running speech recognition software, wherein the number of simultaneous servers running speech recognition software receiving the same audio data is controlled by the communication device user'"'"'s options; Receiving a confidence level of recognition from each server running speech recognition software; and Routing the recognition result with the highest confidence level for further processing; wherein receiving a confidence level of recognition from each server running speech recognition software includes; upon detecting that each confidence level of recognition from each server running speech recognition software is below a predefined level, routing the recorded audio data to a location associated with a human transcriber identified according to criteria defined by a user currently associated with the wireless communication device; creating data, to be presented at the location associated with the human transcriber, representing indication of the user'"'"'s historical activity; receiving a machine readable command from the location associated with the human transcriber, the machine readable command comprising a representation of an application response to the recorded audio data, the machine readable command created by the human transcriber; creating an application command based on the machine readable command; transmitting the application command to the wireless communication device; creating a prompt to be presented at the location associated with the human transcriber, the prompt requesting an update to a speech recognition grammar file associated with the user currently associated with the wireless communication device, the update indicative of an interpretation of the recorded audio data made by the human transcriber, the updated speech recognition grammar file enhancing the servers running speech recognition software ability to process subsequent recorded audio data created by the user currently associated with the wireless communication device.
-
-
7. A method of using speech recognition software for recorded audio data received from a wireless communication device, comprising:
-
Receiving recorded audio data communicated from a wireless communication device and directing the audio data to more than one simultaneous servers running speech recognition software, wherein the number of simultaneous servers running speech recognition software receiving the same audio data is controlled by the communication device user'"'"'s options; Receiving a confidence level of recognition from each server running speech recognition software; and Routing the recognition result with the highest confidence level for further Processing; wherein directing the audio data to more than one simultaneous servers running speech recognition software includes; appending at least one unique identifier to the recorded audio data, the unique identifier associated with a user of the wireless communication device; wherein receiving a confidence level of recognition from each server running speech recognition software includes; upon detecting that each confidence level of recognition from each server running speech recognition software is below a predefined level, routing the recorded audio data to a location associated with the human transcriber; based on the unique identifier, selecting a grammar file, the grammar file including a representation of at least one example of the user'"'"'s speech pattern; transmitting the grammar file for presentation at the location associated with the human transcriber; receiving a machine readable command from the location associated with the human transcriber, the machine readable command comprising a representation of an application response to the recorded audio data, the machine readable command created by the human transcriber; creating an application command based on the machine readable command; and transmitting the application command to the wireless communication device.
-
-
8. A method of using speech recognition software for recorded audio data received from a wireless communication device, comprising:
-
Receiving recorded audio data communicated from a wireless communication device and directing the audio data to more than one simultaneous servers running speech recognition software, wherein the number of simultaneous servers running speech recognition software receiving the same audio data is controlled by the communication device user'"'"'s options; Receiving a confidence level of recognition from each server running speech recognition software; and Routing the recognition result with the highest confidence level for further processing; wherein receiving a confidence level of recognition from each server running speech recognition software includes; upon detecting that each confidence level of recognition from each server running speech recognition software is below a predefined level, routing the recorded audio data to a location associated with a human transcriber currently experiencing an acceptable workload, wherein the communication device user'"'"'s options include indicating the user has selected to provide speech structured in accordance with a standardized format for voice commands, the recorded audio data representing speech structured in accordance with the standardized format for voice commands; receiving a machine readable command from the location associated with the human transcriber, the machine readable command comprising a representation of an application response to the recorded audio data, the machine readable command created by the human transcriber; creating an application command based on the machine readable command; and transmitting the application command to the wireless communication device.
-
Specification