Voice Activity Detection (VAD) for a Coded Speech Bitstream without Decoding
First Claim
Patent Images
1. A system for voice activity detection (VAD) within a digitally encoded bitstream, the system comprising:
- a parameter extraction module configured to extract parameters from a sequence of coded frames from a digitally encoded bitstream containing speech; and
a VAD classifier configured to operate with input of the digitally encoded bitstream to evaluate each coded frame based on bitstream coding parameter classification features to output a VAD decision indicative of whether or not speech is present in one or more of the coded frames.
2 Assignments
0 Petitions
Accused Products
Abstract
A system, method and computer program product are described for voice activity detection (VAD) within a digitally encoded bitstream. A parameter extraction module is configured to extract parameters from a sequence of coded frames from a digitally encoded bitstream containing speech. A VAD classifier is configured to operate with input of the digitally encoded bitstream to evaluate each coded frame based on bitstream coding parameter classification features to output a VAD decision indicative of whether or not speech is present in one or more of the coded frames.
-
Citations
20 Claims
-
1. A system for voice activity detection (VAD) within a digitally encoded bitstream, the system comprising:
-
a parameter extraction module configured to extract parameters from a sequence of coded frames from a digitally encoded bitstream containing speech; and a VAD classifier configured to operate with input of the digitally encoded bitstream to evaluate each coded frame based on bitstream coding parameter classification features to output a VAD decision indicative of whether or not speech is present in one or more of the coded frames. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method for voice activity detection implemented as a plurality of computer processes executing on at least one hardware processor, the method comprising:
-
extracting parameters from a sequence of coded frames from a digitally encoded bitstream containing speech; and evaluating each coded frame with a voice activity detection (VAD) classifier operating based on bitstream coding parameter classification features with input of the digitally encoded bitstream to output a VAD decision whether or not speech is present in one or more of the coded frames. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer program product implemented in a tangible computer readable storage medium for voice activity detection, the product comprising:
-
program code for extracting parameters from a sequence of coded frames from a digitally encoded bitstream containing speech; and program code for evaluating each coded frame with a voice activity detection (VAD) classifier operating based on bitstream coding parameter classification features with input of the digitally encoded bitstream to output a VAD decision whether or not speech is present in one or more of the coded frames. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification