Method and apparatus for transmitting speech activity in distributed voice recognition systems
First Claim
1. A method of providing detected voice activity information associated with a speech signal to a remote device, comprising:
- assembling detected voice activity information related to said speech signal;
identifying feature extraction information related to said speech signal;
selectively utilizing said detected voice activity information and said feature extraction information to form advanced front end data; and
providing the advanced front end data comprising detected voice activity information to the remote device.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method for transmitting speech activity in a distributed voice recognition system. The distributed voice recognition system includes a local VR engine in a subscriber unit and a server VR engine on a server . The local VR engine comprises an advanced feature extraction (AFE) module that extracts features from a speech signal, and a voice activity detection (VAD) module that detects voice activity within a speech signal. The combined results from the VAD module and feature extraction module are provided in an efficient manner to a remote device, such as a server, in the form of advanced front end features, thereby enabling the server to process speech segments free of silence regions. Various aspects of efficient speech segment transmission are disclosed.
-
Citations
32 Claims
-
1. A method of providing detected voice activity information associated with a speech signal to a remote device, comprising:
-
assembling detected voice activity information related to said speech signal;
identifying feature extraction information related to said speech signal;
selectively utilizing said detected voice activity information and said feature extraction information to form advanced front end data; and
providing the advanced front end data comprising detected voice activity information to the remote device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. An apparatus for transmitting speech activity, comprising:
-
a voice activity detector;
a feature extractor operating substantially in parallel to the voice activity detector;
a transmitter; and
a receiving device;
wherein the feature extractor and voice activity detector operate to extract features from speech and detect voice activity information from speech and selectively utilize extracted features and detected voice activity information to form advanced front end data. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20, 21)
-
-
22. A method of transmitting speech data to a remote device, comprising:
-
extracting voice activity data from the speech data;
identifying feature extraction data from the speech data; and
selectively transmitting information related to said voice activity data and said feature extraction data in the form of advanced front end data to the remote device. - View Dependent Claims (23, 24, 25, 26, 27, 28, 29, 30, 31, 32)
-
Specification