Transcript alignment
First Claim
1. A method comprising:
- accepting a search expression;
searching for text occurrences of the search expression in a transcript of an audio recording;
searching for spoken occurrences of the search expression in the audio recording without requiring first searching for the text occurrences of the search expression in the transcript of the audio recording;
compiling results of the searching for spoken occurrences and text occurrences, wherein the compiling includes eliminating duplicate occurrences of a common instance of the search expression according to time alignments of text occurrences and spoken occurrences; and
presenting representations of the compiled results of the searching, including enabling access to portions of the audio recording corresponding to speech occurrences and text occurrences in the results of the searching.
11 Assignments
0 Petitions
Accused Products
Abstract
An approach to alignment of transcripts with recorded audio is tolerant of moderate transcript inaccuracies, untranscribed speech, and significant non-speech noise. In one aspect, a number of search terms are formed from the transcript such that each search term is associated with a location within the transcript. Possible locations of the search terms are then determined in the audio recording. The audio recording and the transcript are then aligned using the possible locations of the search terms. In another aspect a search expression is accepted, and then a search is performed for spoken occurrences of the search expression in an audio recording. This search includes searching for text occurrences of the search expression in a text transcript of the audio recording, and searching for spoken occurrences of the search expression in the audio recording.
-
Citations
15 Claims
-
1. A method comprising:
-
accepting a search expression; searching for text occurrences of the search expression in a transcript of an audio recording; searching for spoken occurrences of the search expression in the audio recording without requiring first searching for the text occurrences of the search expression in the transcript of the audio recording; compiling results of the searching for spoken occurrences and text occurrences, wherein the compiling includes eliminating duplicate occurrences of a common instance of the search expression according to time alignments of text occurrences and spoken occurrences; and presenting representations of the compiled results of the searching, including enabling access to portions of the audio recording corresponding to speech occurrences and text occurrences in the results of the searching. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. One or more processor readable storage devices having code embodied on said storage devices, said code for programming one or more processors to perform a method comprising:
-
accepting a search expression; searching for text occurrences of the search expression in a transcript of an audio recording; searching for spoken occurrences of the search expression in the audio recording without requiring first searching for the text occurrences of the search expression in the transcript of the audio recording; compiling results of the searching for spoken occurrences and text occurrences, wherein the compiling includes eliminating duplicate occurrences of a common instance of the search expression according to time alignments of text occurrences and spoken occurrences; and presenting representations of the compiled results of the searching, including enabling access to portions of the audio recording corresponding to speech occurrences and text occurrences in the results of the searching. - View Dependent Claims (14, 15)
-
Specification