PROCESSING OF AUDIO DATA
First Claim
Patent Images
1. A method for processing audio data, comprising:
- generating a transcript language model based on text data representative of a transcript associated with said audio data;
processing said audio data with a transcription engine to determine at least a set of confidence values for a plurality of language elements in a text output of the transcription engine, the transcription engine using said transcript language model; and
determining whether the text data is associated with said audio data based on said set of confidence values.
1 Assignment
0 Petitions
Accused Products
Abstract
Examples of processing audio data are described. In certain examples, a transcript language model is based on text data representative of a transcript associated with the audio data. The audio data is processed to determine at least a set of confidence values for language elements in a text output of the processing, wherein the processing uses the transcript language model. The set of confidence values enable a determination to be made. The determination relates to whether the text data is associated with said audio data based on said set of confidence values.
-
Citations
15 Claims
-
1. A method for processing audio data, comprising:
-
generating a transcript language model based on text data representative of a transcript associated with said audio data; processing said audio data with a transcription engine to determine at least a set of confidence values for a plurality of language elements in a text output of the transcription engine, the transcription engine using said transcript language model; and determining whether the text data is associated with said audio data based on said set of confidence values. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system processing media data, the media data comprising at least an audio portion, the system comprising:
-
a first component to instruct configuration of a language model based on text data representative of audible language elements within said audio portion; and a second component to instruct conversion of the audio portion of the media data to a text equivalent based on said language model, said conversion outputting a set of confidence values for a plurality of language elements in the text equivalent, wherein the system determines whether the text data is associated with said audio data based on said set of confidence values. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A non-transitory computer-readable storage medium storing instructions that, when executed by one or more processors, cause the one or more processors to:
-
generate a transcript language model based on text data representative of a transcript associated with said audio data; process said audio data with a transcription engine to determine at least a set of confidence values for a plurality of language elements in a text output of the transcription engine, the transcription engine using said transcript language model; and determine whether the text data is associated with said audio data based on said set of confidence values.
-
Specification