Mass-scale, user-independent, device-independent voice messaging system
First Claim
1. A voice messaging system for converting an audio voice message from a caller into text, the voice messaging system comprising:
- a plurality of conversion resources for converting the audio voice message into the text for an intended recipient, the plurality of conversion resources comprising;
at least one automatic speech recognition (ASR) system to automatically recognize at least some of the audio voice message and generate a plurality of candidate words or phrases; and
a computer implemented lattice sub-system that generates a lattice of possible words or phrases, enabling an operator to view one or more candidate words or phrases and to either select one of the one or more candidate words or phrases, or, by entering one or more characters of a different word or phrase, to trigger the lattice sub-system to provide at least one alternative word or phrase, wherein the lattice sub-system automatically differentiates between parts of the message based on whether the lattice sub-system determines parts of the message to be important or unimportant.
4 Assignments
0 Petitions
Accused Products
Abstract
A mass-scale, user-independent, device-independent, voice messaging system that converts unstructured voice messages into text for display on a screen is disclosed. The system comprises (i) computer implemented sub-systems and also (ii) a network connection to human operators providing transcription and quality control; the system being adapted to optimise the effectiveness of the human operators by further comprising 3 core sub-systems, namely (i) a pre-processing front end that determines an appropriate conversion strategy; (ii) one or more conversion resources; and (iii) a quality control sub-system.
-
Citations
22 Claims
-
1. A voice messaging system for converting an audio voice message from a caller into text, the voice messaging system comprising:
-
a plurality of conversion resources for converting the audio voice message into the text for an intended recipient, the plurality of conversion resources comprising; at least one automatic speech recognition (ASR) system to automatically recognize at least some of the audio voice message and generate a plurality of candidate words or phrases; and a computer implemented lattice sub-system that generates a lattice of possible words or phrases, enabling an operator to view one or more candidate words or phrases and to either select one of the one or more candidate words or phrases, or, by entering one or more characters of a different word or phrase, to trigger the lattice sub-system to provide at least one alternative word or phrase, wherein the lattice sub-system automatically differentiates between parts of the message based on whether the lattice sub-system determines parts of the message to be important or unimportant. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
Specification