Speech segment clustering and ranking
First Claim
1. A method of identifying potentially misaligned speech segments from an ordered sequence of speech segments, the method comprising:
- generating a first cluster comprising at least one speech segment selected from the ordered sequence if the at least one speech segment satisfies a predetermined filtering test;
generating a second cluster comprising at least one different speech segment selected from the ordered sequence if the at least one different speech segment satisfies the predetermined filtering test and if there is at least one intervening speech segment occupying a sequential position between the at least one speech segment and the at least one different speech segment, the intervening speech segment failing to satisfy the predetermined filtering test; and
combining the first and second clusters and the at least one intervening speech segment to generate an aggregated cluster if the aggregated cluster satisfies a predetermined combining criterion, the aggregated cluster replacing the first and second clusters.
8 Assignments
0 Petitions
Accused Products
Abstract
A system, method, and apparatus for identifying problematic speech segments is provided. The system includes a clustering module for generating a first cluster of one or more consecutive speech segments if the consecutive speech segments satisfy a predetermined filtering test, and for generating a second cluster comprising at least one different consecutive speech segment selected from the ordered sequence if the at least one different consecutive speech segment satisfies the predetermined filtering test. The system also includes a combining module for combining the first and second clusters as well as the at least one intervening consecutive speech segment to form an aggregated cluster if the aggregated cluster satisfies a predetermined combining criterion. The system can further include a ranking module for ranking aggregated clusters, the ranking reflecting a relative severity of misalignments among problematic speech segments. Once identified, more severely misaligned speech segments can be analyzed more effectively and efficiently.
-
Citations
21 Claims
-
1. A method of identifying potentially misaligned speech segments from an ordered sequence of speech segments, the method comprising:
-
generating a first cluster comprising at least one speech segment selected from the ordered sequence if the at least one speech segment satisfies a predetermined filtering test;
generating a second cluster comprising at least one different speech segment selected from the ordered sequence if the at least one different speech segment satisfies the predetermined filtering test and if there is at least one intervening speech segment occupying a sequential position between the at least one speech segment and the at least one different speech segment, the intervening speech segment failing to satisfy the predetermined filtering test; and
combining the first and second clusters and the at least one intervening speech segment to generate an aggregated cluster if the aggregated cluster satisfies a predetermined combining criterion, the aggregated cluster replacing the first and second clusters. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for identifying potentially misaligned speech segments from an ordered sequence of speech segments, the system comprising:
-
a clustering module for generating a first cluster comprising at least one speech segment selected from the ordered sequence if the at least one speech segment satisfies a predetermined filtering test, and generating a second cluster comprising at least one different speech segment selected from the ordered sequence if the at least one different speech segment satisfies the predetermined filtering test and if there is at least one intervening speech segment occupying a sequential position between the at least one speech segment and the at least one different speech segment, the intervening speech segment failing to satisfy the predetermined filtering test; and
a combining module for combining the first and second clusters and the at least one intervening consecutive speech segment to form an aggregated cluster if the aggregated cluster satisfies a predetermined combining criterion. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-readable storage medium for use in identifying potentially misaligned speech segments from an ordered sequence of speech segments, the computer-readable storage medium comprising computer instructions for:
-
generating a first cluster comprising at least one speech segment selected from the ordered sequence if the at least one speech segment satisfies a predetermined filtering test;
generating a second cluster comprising at least one different speech segment selected from the ordered sequence if the at least one different speech segment satisfies the predetermined filtering test and if there is at least one intervening speech segment occupying a sequential position between the at least one speech segment and the at least one different speech segment, the intervening speech segment failing to satisfy the predetermined filtering test; and
combining the first and second clusters and the at least one intervening speech segment to generate an aggregated cluster if the aggregated cluster satisfies a predetermined combining criterion, the aggregated cluster replacing the first and second clusters. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification