System and method for the automatic discovery of salient segments in speech transcripts
First Claim
1. A method automatically discovering salient segments in a speech transcript, comprising:
- performing a first segmentation of the speech transcript using a boundary-based process to generate a sequence of first segments, indicative of a temporal proximity of features in the speech;
performing a second segmentation of the first segments for determining a rate of arrival of the features, and for generating a sequence of second segments; and
performing a third segmentation of the second segments using a content-based process to generate a sequence of third segments, to minimize oversegmentation.
3 Assignments
0 Petitions
Accused Products
Abstract
A system and associated method automatically discover salient segments in a speech transcript and focus on the segmentation of an audio/video source into topically cohesive segments based on Automatic Speech Recognition (ASR) transcriptions. The word n-grams are extracted from the speech transcript using a three-phase segmentation algorithm based on the following sequence or combination of boundary-based and content-based methods: a boundary-based method; a rate of arrival of feature method; and a content-based method. In the first two segmentation passes, the temporal proximity and the rate of arrival of features are analyzed to compute an initial segmentation. In the third segmentation pass, changes in the set of content-bearing words used by adjacent segments are detected, to validate the initial segments for merging them, to prevent over-segmentation.
-
Citations
22 Claims
-
1. A method automatically discovering salient segments in a speech transcript, comprising:
-
performing a first segmentation of the speech transcript using a boundary-based process to generate a sequence of first segments, indicative of a temporal proximity of features in the speech;
performing a second segmentation of the first segments for determining a rate of arrival of the features, and for generating a sequence of second segments; and
performing a third segmentation of the second segments using a content-based process to generate a sequence of third segments, to minimize oversegmentation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer program for automatically discovering salient segments in a speech transcript, comprising:
-
a first set of program instructions for performing a first segmentation of the speech transcript using a boundary-based process to generate a sequence of first segments, indicative of a temporal proximity of features in the speech;
a second set of program instructions for performing a second segmentation of the first segments for determining a rate of arrival of the features, and for generating a sequence of second segments; and
a third set of program instructions for performing a third segmentation of the second segments using a content-based process to generate a sequence of third segments, to minimize oversegmentation. - View Dependent Claims (10, 11, 12, 13)
-
-
14. A system for automatically discovering salient segments in a speech transcript, comprising:
-
means for performing a first segmentation of the speech transcript using a boundary-based process to generate a sequence of first segments, indicative of a temporal proximity of features in the speech;
means for performing a second segmentation of the first segments for determining a rate of arrival of the features, and for generating a sequence of second segments; and
means for performing a third segmentation of the second segments using a content-based process to generate a sequence of third segments, to minimize oversegmentation. - View Dependent Claims (15, 16, 17, 18, 19, 20)
-
-
21. A method automatically discovering salient segments in a time varying signal, comprising:
-
performing a first segmentation of the time varying signal using a boundary-based process to generate a sequence of first segments, indicative of a temporal proximity of features in the speech;
performing a second segmentation of the first segments for determining a rate of arrival of the features, and for generating a sequence of second segments; and
performing a third segmentation of the second segments using a content-based process to generate a sequence of third segments, to minimize oversegmentation. - View Dependent Claims (22)
-
Specification