Timing of speech recognition over lossy transmission systems
First Claim
1. A method of recognizing speech, the method causing a computing device to perform steps comprising:
- generating via the computing device from received packets associated with input speech a speech vector;
identifying via the computing device features in the speech vector associated with corrupt data;
comparing via the computing device the speech vector to stored recognition models based on non-corrupt features in the speech vector to generate a first result or a second result;
recognizing via the computing device the speech input if the comparison generates the first result; and
requesting via the computing device a restoring mission of at least one packet if the comparison generates the second result.
4 Assignments
0 Petitions
Accused Products
Abstract
Recognizing a stream of speech received as speech vectors over a lossy communications link includes constructing for a speech recognizer a series of speech vectors from packets received over a lossy packetized transmission link, wherein some of the packets associated with each speech vector are lost or corrupted during transmission. Each constructed speech vector is multi-dimensional and includes associated features. After waiting for a predetermined time, speech vectors are generated and potentially corrupted features within the speech vector are indicated to the speech recognizer when present. Speech recognition is attempted at the speech recognizer on the speech vectors when corrupted features are present. This recognition may be based only on certain or valid features within each speech vector. Retransmission of a missing or corrupted packet is requested when corrupted values are indicated by the indicating step and when the attempted recognition step fails.
-
Citations
18 Claims
-
1. A method of recognizing speech, the method causing a computing device to perform steps comprising:
-
generating via the computing device from received packets associated with input speech a speech vector; identifying via the computing device features in the speech vector associated with corrupt data; comparing via the computing device the speech vector to stored recognition models based on non-corrupt features in the speech vector to generate a first result or a second result; recognizing via the computing device the speech input if the comparison generates the first result; and requesting via the computing device a restoring mission of at least one packet if the comparison generates the second result. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for recognizing speech, the system comprising:
-
a processor; a module configured to control the processor to generate from received packets associated with input speech a speech vector; a module configured to identify features in the speech vector associated with corrupt data; a module configured to compare the speech vector to stored recognition models based on non-corrupt features in the speech vector to generate a first result or a second result; a module configured, if the comparison generates the first result, to recognize the speech input; and a module configured, if the comparison generates the second result to request a restoring mission of at least one packet. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A tangible computer readable medium storing a computer program for having instructions for controlling a computing device to recognize speech, the instructions comprising:
-
generating via a processor from received packets associated with input speech a speech vector; identifying features in the speech vector associated with corrupt data; comparing the speech vector to stored recognition models based on non-corrupt features in the speech vector to generate a first result or a second result; recognizing the speech input if the comparison generates the first result; and requesting a restoring mission of at least one packet if the comparison generates the second result. - View Dependent Claims (14, 15, 16, 17, 18)
-
Specification