Mass-scale, user-independent, device-independent voice messaging system
First Claim
1. A voice messaging system for converting an audio voice message from a caller to a recipient to text, the voice messaging system comprising:
- an automatic speech recognition (ASR) system to automatically recognize at least some of the audio voice message, the ASR system comprising;
a plurality of ASR components, each specially configured to recognize a respective type of content; and
a computer implemented boundary selection sub-system to process the audio voice message to identify at least one portion of the audio voice message which contains content of the type for which one of the plurality of ASR components is specially configured, wherein the identified at least one portion of the audio message is sent to the one of the plurality of ASR components specially configured for the type of content identified in the at least one portion to be automatically recognized.
4 Assignments
0 Petitions
Accused Products
Abstract
A mass-scale, user-independent, device-independent, voice messaging system that converts unstructured voice messages into text for display on a screen is disclosed. The system comprises (i) computer implemented sub-systems and also (ii) a network connection to human operators providing transcription and quality control; the system being adapted to optimize the effectiveness of the human operators by further comprising 3 core sub-systems, namely (i) a pre-processing front end that determines an appropriate conversion strategy; (ii) one or more conversion resources; and (iii) a quality control sub-system.
129 Citations
16 Claims
-
1. A voice messaging system for converting an audio voice message from a caller to a recipient to text, the voice messaging system comprising:
an automatic speech recognition (ASR) system to automatically recognize at least some of the audio voice message, the ASR system comprising; a plurality of ASR components, each specially configured to recognize a respective type of content; and a computer implemented boundary selection sub-system to process the audio voice message to identify at least one portion of the audio voice message which contains content of the type for which one of the plurality of ASR components is specially configured, wherein the identified at least one portion of the audio message is sent to the one of the plurality of ASR components specially configured for the type of content identified in the at least one portion to be automatically recognized. - View Dependent Claims (2, 3, 4, 5, 6)
-
7. A method for converting an audio voice message from a caller to a recipient to text, the method comprising:
-
processing the audio voice message to identify at least one portion of the audio voice message which contains content of a type for which one of a plurality of ASR components is specially configured to automatically recognize; sending the identified at least one portion of the audio voice message to the one of the plurality of ASR components specially configured for the type of content identified in the at least one portion of the audio voice message to be automatically recognized; receiving, from the ASR component to which the at least one portion of the audio voice message was sent, a text portion corresponding to the automatic recognition of the at least one portion of the audio; assembling the text portion into the text; and outputting the text to the recipient. - View Dependent Claims (9, 10, 11, 12)
-
-
8. At least one non-transitory computer readable storage device for storing instructions that, when executed on at least one computer, cause the at least one computer to perform a method for converting an audio voice message from a caller to a recipient to text, the method comprising:
-
processing the audio voice message to identify at least one portion of the audio voice message which contains content of a type for which one of a plurality of ASR components is specially configured to automatically recognize; sending the identified at least one portion of the audio voice message to the one of the plurality of ASR components specially configured for the type of content identified in the at least one portion of the audio voice message to be automatically recognized; receiving, from the ASR component to which the at least one portion of the audio voice message was sent, a text portion corresponding to the automatic recognition of the at least one portion of the audio; assembling the text portion into the text;
andoutputting the text to the recipient. - View Dependent Claims (13, 14, 15, 16)
-
Specification