Method and apparatus for automatically discriminating information bearing audio segments and background noise audio segments
First Claim
1. A method of discriminating information bearing audio segments and background noise audio segments comprising:
- for each kth sample in a series of samples, testing if a Boolean test;
((SK−
1>
−
h1 AND SK<
h2) OR (SK−
1<
h3 AND SK>
−
h4)) where, SK is a kth audio sample, SK−
1 is a (k−
1)th sample that precedes the kth audio sample, h1 is a first, positive valued predetermined threshold, h2 is a second positive valued predetermined threshold, h3 is a third positive valued predetermined threshold, and h4 is a fourth positive valued predetermined threshold, is met, and if so, incrementing a count;
after a predetermined number of samples, inputting the count into a decision function; and
evaluating the decision function to determine if the audio segment is more likely to be background noise or information bearing audio.
1 Assignment
0 Petitions
Accused Products
Abstract
A system (100) for automatically discriminating information bearing audio segments and mere background noise segments processes digitized audio to extract two discriminants between information bearing audio and mere background audio that have a relatively low correlation. One discriminant is based on the rate (relative to the sample rate) at which a specified Boolean test involving sample values is met. Another possible discriminant is based on the variance of time-frequency magnitudes in a number of time windows and frequency bands. The two discriminants are suitably used as the independent variables of probability density functions that model information bearing audio and background noise audio.
41 Citations
24 Claims
-
1. A method of discriminating information bearing audio segments and background noise audio segments comprising:
-
for each kth sample in a series of samples, testing if a Boolean test;
((SK−
1>
−
h1 AND SK<
h2) OR (SK−
1<
h3 AND SK>
−
h4))where, SK is a kth audio sample, SK−
1 is a (k−
1)th sample that precedes the kth audio sample,h1 is a first, positive valued predetermined threshold, h2 is a second positive valued predetermined threshold, h3 is a third positive valued predetermined threshold, and h4 is a fourth positive valued predetermined threshold, is met, and if so, incrementing a count;
after a predetermined number of samples, inputting the count into a decision function; and
evaluating the decision function to determine if the audio segment is more likely to be background noise or information bearing audio. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. An apparatus for discriminating information bearing audio segments and background noise audio segments, the apparatus comprising:
-
a Boolean tester for applying a Boolean test;
((SK−
1>
−
h1 AND SK<
h2) OR (SK−
1<
h3 AND SK>
−
h4))where, SK is a kth audio sample, SK−
1 is a (k−
1)th sample that precedes the kth sample, andh1 is a first positive valued predetermined threshold, h2 is a second positive valued predetermined threshold, h3 is a third positive valued predetermined threshold, h4 is a fourth positive valued predetermined threshold, to each kth sample in a series of samples; and
a summer for summing, over a predetermined number of samples, a number of times that the Boolean tester produces a positive result and outputting a sum;
a decision function evaluator for receiving the sum as input and evaluating a decision function. - View Dependent Claims (9, 10)
-
-
11. An apparatus for discriminating information bearing audio segments and background noise audio segments, the apparatus comprising:
-
a processor;
a memory for storing programming instructions, said memory coupled to said processor, wherein said processor is programmed by said programming instructions to;
test whether a Boolean test;
((SK−
1>
−
h1 AND SK<
h2) OR (SK−
1<
h3 AND SK>
−
h4))where, SK is a kth audio sample, SK−
1 is a (k−
1)th sample that precedes the kth sample, andh1 is a first positive valued predetermined threshold, h2 is a second positive valued predetermined threshold, h3 is a third positive valued predetermined threshold, h4 is a fourth positive valued predetermined threshold, is met for each kth sample in a series of samples, and if so, increment a count;
after a predetermined number of samples, input the count into a decision function; and
evaluate the decision function to determine if the audio segment is more likely to be background noise or information bearing audio. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A computer readable medium storing programming instructions for discriminating information bearing audio segments and background noise audio segments, including programming instructions for:
-
for each kth sample in a series of samples, testing if a Boolean test;
((SK−
1>
−
h1 AND SK<
h2) OR (SK−
1<
h3 AND SK>
−
h4))where, SK is a kth audio sample, SK−
1 is a (k−
1)th sample that precedes the kth audio sample,h1 is a first positive valued predetermined threshold, h2 is a second positive valued predetermined threshold, h3 is a third positive valued predetermined threshold, and h4 is a fourth positive valued predetermined threshold, is met, and if so, incrementing a count;
after a predetermined number of samples, inputting the count into a decision function; and
evaluating the decision function to determine if the audio segment is more likely to be background noise or information bearing audio. - View Dependent Claims (19, 20, 21, 22, 23, 24)
-
Specification