Transcript alignment
First Claim
1. A method for aligning an audio recording and a transcript comprising:
- forming a plurality of search terms from the transcript, each search term associated with a location within the transcript;
determining zero or more putative locations of each of the search terms in a time interval of the audio recording, including for at least some search terms determining multiple putative locations in the time interval of the audio recording; and
after determining the putative locations, aligning the time interval of the audio recording and the transcript using the determined putative locations of the search terms, including, for at least some of the search terms, selecting one of the putative locations of the search term for aligning with the location within the transcript that is associated with the search term.
14 Assignments
0 Petitions
Accused Products
Abstract
An approach to alignment of transcripts with recorded audio is tolerant of moderate transcript inaccuracies, untranscribed speech, and significant non-speech noise. In one aspect, a number of search terms are formed from the transcript such that each search term is associated with a location within the transcript. Possible locations of the search terms are then determined in the audio recording. The audio recording and the transcript are then aligned using the possible locations of the search terms. In another aspect a search expression is accepted, and then a search is performed for spoken occurrences of the search expression in an audio recording. This search includes searching for text occurrences of the search expression in a text transcript of the audio recording, and searching for spoken occurrences of the search expression in the audio recording.
92 Citations
23 Claims
-
1. A method for aligning an audio recording and a transcript comprising:
-
forming a plurality of search terms from the transcript, each search term associated with a location within the transcript; determining zero or more putative locations of each of the search terms in a time interval of the audio recording, including for at least some search terms determining multiple putative locations in the time interval of the audio recording; and after determining the putative locations, aligning the time interval of the audio recording and the transcript using the determined putative locations of the search terms, including, for at least some of the search terms, selecting one of the putative locations of the search term for aligning with the location within the transcript that is associated with the search term. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23)
-
Specification