Method and system for matching speech data
DCFirst Claim
1. A method for matching speech data used to determine the similarity between an input speech data and a sample speech data, the method comprising:
- segmenting the input speech data into a plurality of input speech frames;
segmenting the sample speech data into a plurality of sample speech frames;
building a matching matrix, wherein each element of the matching matrix corresponds to one of the input speech frames and one of the sample speech frames and indicates a distance value between the corresponding input speech frame and the corresponding sample speech frame;
determining a minimum value of the distance values indicated in each row of elements of the matching matrix, thereby obtaining a plurality of minimum distance values of the respective rows of elements of the matching matrix, determining a second least value of the distance values indicated in each row of elements of the matching matrix, thereby obtaining a plurality of second least distance values of the respective rows of elements of the matching matrix;
summing up the minimum distance values and the second least distance value of the distance values indicated in each row of elements of the matching matrix, thereby obtaining a row score, determined by;
determining a minimum value of the distance values indicated in each column of elements of the matching matrix, thereby obtaining a plurality of another minimum distance values of the respective columns of elements of the matching matrix, determining a second least value of the distance values indicated in each column of elements of the matching matrix, thereby obtaining a plurality of second least distance values of the respective columns of elements of the matching matrix;
summing up the another minimum distance values and the second least value of the indicated distance values in each column of elements of the matching matrix distance values, thereby obtaining a column score, wherein;
calculating a matching score obtained by combining the distance row score and the column score; and
determining whether the input speech data and the sample speech data are similar according to the matching score.
1 Assignment
Litigations
0 Petitions
Accused Products
Abstract
A method and system used to determine the similarity between an input speech data and a sample speech data is provided. First, the input speech data is segmented into a plurality of input speech frames and the sample speech data is segmented into a plurality of sample speech frames. Then, the input speech frames and the sample speech frames are used to build a matching matrix, wherein the matching matrix comprises the distance values between each of the input speech frames and each of the sample speech frames. Next, the distance values are used to calculate a matching score. Finally, the similarity between the input speech data and the sample speech data is determined according to this matching score.
-
Citations
12 Claims
-
1. A method for matching speech data used to determine the similarity between an input speech data and a sample speech data, the method comprising:
-
segmenting the input speech data into a plurality of input speech frames; segmenting the sample speech data into a plurality of sample speech frames; building a matching matrix, wherein each element of the matching matrix corresponds to one of the input speech frames and one of the sample speech frames and indicates a distance value between the corresponding input speech frame and the corresponding sample speech frame; determining a minimum value of the distance values indicated in each row of elements of the matching matrix, thereby obtaining a plurality of minimum distance values of the respective rows of elements of the matching matrix, determining a second least value of the distance values indicated in each row of elements of the matching matrix, thereby obtaining a plurality of second least distance values of the respective rows of elements of the matching matrix; summing up the minimum distance values and the second least distance value of the distance values indicated in each row of elements of the matching matrix, thereby obtaining a row score, determined by; determining a minimum value of the distance values indicated in each column of elements of the matching matrix, thereby obtaining a plurality of another minimum distance values of the respective columns of elements of the matching matrix, determining a second least value of the distance values indicated in each column of elements of the matching matrix, thereby obtaining a plurality of second least distance values of the respective columns of elements of the matching matrix; summing up the another minimum distance values and the second least value of the indicated distance values in each column of elements of the matching matrix distance values, thereby obtaining a column score, wherein; calculating a matching score obtained by combining the distance row score and the column score; and determining whether the input speech data and the sample speech data are similar according to the matching score. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
Specification