Method and apparatus for processing the output of a speech recognition engine
First Claim
Patent Images
1. A data processing arrangement comprising:
- a data processing apparatus, the data processing apparatus comprising;
input means for receiving recognition data from a speech recognition engine and audio data, said recognition data including a string of recognized characters and audio identifiers identifying audio components corresponding to a character component of the recognized characters;
processing means for receiving and processing the input recognized characters to at least one of replace, insert, move and position the recognized characters to form a processed character string;
link means for forming link data linking the audio identifiers to the character component positions in the character string, and for updating said link data after processing to maintain the link between the audio identifiers and the character component positions in the processed character string;
storage means for storing said recognition data and audio data received from said input means, and for storing said link data;
display means for displaying the characters received and processed by said processing means;
user operable selection means for selecting characters in the displayed characters for audio playback, where said link data identifies any selected audio components, if present, which are linked to the selected characters; and
audio playback means for playing back the selected audio components in the order of the character component positions in the character string or the processed character string; and
an editor work station comprising;
data reading means for reading the characters, link data, and audio data from said data processing apparatus;
editor processing means for processing the characters;
editor link means for linking the audio data to the character component position using the link data;
editor display means for displaying the characters being processed;
editor correction means for selecting and correcting any displayed characters which have been incorrectly recognized;
editor audio playback means for playing back any audio component corresponding to the selected characters to aid correction;
editor speech recognition update means for storing the corrected characters and the audio identifier for the audio component corresponding to the corrected character in a character correction file; and
data transfer means for transferring the character correction file to said data processing apparatus for later updating of models used by said speech recognition engine;
said data processing apparatus including correction file reading means for reading said character correction file to pass the data contained therein to said speech recognition engine for the updating of the models used by said speech recognition engine.
4 Assignments
0 Petitions
Accused Products
Abstract
Data processing apparatus for receiving recognition data from a speech recognition engine and its corresponding dictated audio data where the recognition data includes recognized words or characters. A display displays the recognized words or characters and the recognized words or characters are stored as a file together with the corresponding audio data. The recognized words or characters can be processed and link data is formed to link the position of the words or characters in the file and the position of the corresponding audio component in the audio data.
219 Citations
31 Claims
-
1. A data processing arrangement comprising:
-
a data processing apparatus, the data processing apparatus comprising;
input means for receiving recognition data from a speech recognition engine and audio data, said recognition data including a string of recognized characters and audio identifiers identifying audio components corresponding to a character component of the recognized characters;
processing means for receiving and processing the input recognized characters to at least one of replace, insert, move and position the recognized characters to form a processed character string;
link means for forming link data linking the audio identifiers to the character component positions in the character string, and for updating said link data after processing to maintain the link between the audio identifiers and the character component positions in the processed character string;
storage means for storing said recognition data and audio data received from said input means, and for storing said link data;
display means for displaying the characters received and processed by said processing means;
user operable selection means for selecting characters in the displayed characters for audio playback, where said link data identifies any selected audio components, if present, which are linked to the selected characters; and
audio playback means for playing back the selected audio components in the order of the character component positions in the character string or the processed character string; and
an editor work station comprising;
data reading means for reading the characters, link data, and audio data from said data processing apparatus;
editor processing means for processing the characters;
editor link means for linking the audio data to the character component position using the link data;
editor display means for displaying the characters being processed;
editor correction means for selecting and correcting any displayed characters which have been incorrectly recognized;
editor audio playback means for playing back any audio component corresponding to the selected characters to aid correction;
editor speech recognition update means for storing the corrected characters and the audio identifier for the audio component corresponding to the corrected character in a character correction file; and
data transfer means for transferring the character correction file to said data processing apparatus for later updating of models used by said speech recognition engine;
said data processing apparatus including correction file reading means for reading said character correction file to pass the data contained therein to said speech recognition engine for the updating of the models used by said speech recognition engine. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method of processing data comprising:
-
at an author work station, carrying out a method including;
receiving recognition data including a string of recognized characters and audio identifiers identifying audio components corresponding to a character component of the recognized characters;
storing the received audio data;
inputting the recognized characters to a processor for the processing of the characters to at least one of replace, insert, move and position the characters to form a processed character string;
forming link data linking the audio identifiers to the character component positions in the character string and updating said link data after processing to maintain the link between the audio identifiers and the character component positions in the processed character string;
displaying the characters input to and processed by the processor;
selecting displayed characters for audio playback, whereby said link data identifies any selected audio components, if present, which are linked to the selected characters; and
playing back the selected audio components in the order of the character component positions in the character string or processed character string;
wherein the recognized characters, the link data and the audio data are stored; and
at an editor work station, obtaining the stored characters, link data and audio data from the author work station;
inputting the characters into a processor;
linking the audio data to the character component positions using the link data;
displaying the characters being processed;
selecting any displayed characters which have been incorrectly recognized;
playing back any audio component corresponding to the selected characters to aid correction;
correcting the incorrectly recognized characters;
storing the corrected characters and the audio identifier for the audio component corresponding to the corrected character in a character correction file; and
transferring the character correction file to the author work station for later updating of models used by said speech recognition engine;
wherein, at a later time, said character correction file is read at said author work station to pass the data contained therein to said speech recognition engine for updating of said models. - View Dependent Claims (20, 21, 22, 23, 24, 25)
-
-
26. A data processing arrangement comprising:
-
data processing apparatus comprising;
input means for receiving recognition data from a speech recognition engine and corresponding audio data, said recognition data including a string of recognized characters and audio identifiers identifying audio components corresponding to character components of the recognized characters;
link means for forming link data linking the audio identifiers to the character component positions in the character string;
storage means for storing said audio data received from said input means, said link data, and said recognized characters; and
display means for displaying the recognized characters; and
an editor work station comprising;
data reading means for obtaining the characters, link data, and audio data from said data processing apparatus;
editor processing means for processing the characters;
editor link means for linking the audio data to the character component position using the link data;
editor display means for displaying the characters being processed;
editor correction means for selecting and correcting any displayed characters which have been incorrectly recognized;
editor audio playback means for playing back any audio component corresponding to the selected characters to aid correction;
editor speech recognition update means for storing the corrected characters and the audio identifier for the audio component corresponding to the corrected character in a character correction file; and
data transfer means for transferring the character correction file to said data processing apparatus for later updating of models used by said speech recognition engine;
said data processing apparatus including correction file reading means far reading said character correction file to pass the data contained therein to said speech recognition engine. - View Dependent Claims (27, 28, 29)
-
-
30. A computer usable medium having computer readable instructions stored therein for causing the processor in a data processing apparatus to process signals defining a string of characters and audio data to store the characters and the audio data, the instructions comprising instructions for
causing the processor to receive the signals from a speech recognition engine; -
causing the processor to generate an image of the characters on a display;
causing the processor to store the characters as a file;
causing the processor to selectively disable one of the display and storage of the characters and the speech recognition engine for a period of time; and
causing the processor to store the received audio data during said period of time as an audio message associated with the file. - View Dependent Claims (31)
-
Specification