Method of identifying duplicate voice recording
First Claim
1. A method of identifying duplicate voice recording, comprising the steps of:
- a) receiving a plurality of digital voice recordings;
b) selecting one of said plurality of digital voice recordings;
c) segmenting the selected digital voice recording;
d) extracting a pitch value from each segment;
e) estimating a total time that voice appears in the selected digital voice recording;
f) removing pitch values that are less than and equal to a user-definable value;
g) identifying unique pitch values in the result of step (f);
h) determining the frequency of occurrence of the unique pitch values;
i) normalizing the result of step (h) so that the frequencies of occurrence are greater than zero and less than one;
j) determining an average pitch value from the pitch values remaining after step (f);
k) determining the distribution percentiles of the result of step (h);
l) if additional digital voice recordings are to be processed then returning to step (b), otherwise proceeding to the next step;
m) comparing the results of steps (e), (j), and (k) for each digital voice recording processed; and
n) declaring the digital voice recordings duplicates that compared to within a user-definable threshold for each of the results of steps (e), (j), and (k).
1 Assignment
0 Petitions
Accused Products
Abstract
A method of identifying duplicate voice recording by receiving digital voice recordings, selecting one of the recordings; segmenting the selected recording, extracting a pitch value per segment, estimating a total time that voice appears in the recording, removing pitch values that are less than and equal to a user-definable value, identifying unique pitch values, determining the frequency of occurrence of the unique pitch values, normalizing the frequencies of occurrence, determining an average pitch value, determining the distribution percentiles of the frequencies of occurrence, returning to the second step if additional recordings are to be processed, otherwise comparing the total voice time, average pitch value, and distribution percentiles for each recording processed, and declaring the recordings duplicates that compared to within a user-definable threshold for total voice time, average pitch value, and distribution percentiles.
-
Citations
17 Claims
-
1. A method of identifying duplicate voice recording, comprising the steps of:
-
a) receiving a plurality of digital voice recordings; b) selecting one of said plurality of digital voice recordings; c) segmenting the selected digital voice recording; d) extracting a pitch value from each segment; e) estimating a total time that voice appears in the selected digital voice recording; f) removing pitch values that are less than and equal to a user-definable value; g) identifying unique pitch values in the result of step (f); h) determining the frequency of occurrence of the unique pitch values; i) normalizing the result of step (h) so that the frequencies of occurrence are greater than zero and less than one; j) determining an average pitch value from the pitch values remaining after step (f); k) determining the distribution percentiles of the result of step (h); l) if additional digital voice recordings are to be processed then returning to step (b), otherwise proceeding to the next step; m) comparing the results of steps (e), (j), and (k) for each digital voice recording processed; and n) declaring the digital voice recordings duplicates that compared to within a user-definable threshold for each of the results of steps (e), (j), and (k). - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
Specification