SEARCH OPTIMIZATION FOR CONTINUOUS SPEECH RECOGNITION
First Claim
1. A method for continuous speech recognition comprising:
- incorporating semantic information during searching by a continuous speech recognizer.
6 Assignments
0 Petitions
Accused Products
Abstract
A system and method for continuous speech recognition (CSR) is optimized to reduce processing time for connected word grammars bounded by semantically null words. The savings, which reduce processing time both during the forward and the backward passes of the search, as well as during rescoring, are achieved by performing only the minimal amount of computation required to produce an exact N-best list of semantically meaningful words (N-best list of salient words). This departs from the standard Spoken Language System modeling which any notion of meaning is handled by the Natural Language Understanding (NLU) component. By expanding the task of the recognizer component from a simple acoustic match to allow semantic information to be fed to the recognizer, significant processing time savings are achieved, and make it possible to run an increased number of speech recognition channels in parallel for improved performance, which may enhance users perception of value and quality of service.
-
Citations
18 Claims
-
1. A method for continuous speech recognition comprising:
incorporating semantic information during searching by a continuous speech recognizer. - View Dependent Claims (2, 14)
-
3. A method for continuous speech recognition
providing speech input to a continuous speech recognizer, providing to the continuous speech recognizer an acoustic model comprising a set of Hidden Markov Models, and a language model comprising both grammar and semantic information, performing recognition of speech input using semantic information to eliminate semantically null words from the N-best list of words and restrict searching to an N-best list of salient words, and performing word matching to output from the speech recognizer the N-best salient word sequences.
-
6. A method according to 5 wherein right-most semantically null networks are not computed.
-
7. A method according to 5 wherein some semantically salient words are not backward-scored.
- 8. A method according to 5 wherein an N-best list of only salient words is rescored instead of a true N-best list.
-
10. Software on a machine readable medium for performing a method of continuous speech recognition comprising:
incorporating semantic information during searching by a continuous speech recognizer. - View Dependent Claims (11)
-
12. Software on a machine readable medium for performing a method for continuous speech recognition
providing speech input to a continuous speech recognizer, providing to the continuous speech recognizer an acoustic model comprising a set of Hidden Markov Models, and a language model comprising both grammar and semantic information, performing recognition of speech input using semantic information to eliminate semantically null words from the N-best list of words and restrict searching to an N-best list of salient words,
-
13. A system for continuous speech recognition comprising:
means for incorporating semantic information during searching by a continuous speech recognizer.
-
15. A system for continuous speech recognition
comprising a continuous speech recognizer, input means for providing speech input to the continuous speech recognizer, means for providing to the continuous speech recognizer an acoustic model comprising a set of Hidden Markov Models, and a language model comprising both grammar and semantic information, the continuous speech recognizer comprising means for performing recognition of speech input using the semantic information for eliminating semantically null words from the N-best list of words and thereby restricting searching to an N-best list of salient words, and performing word matching to output the N-best salient word sequences.
-
17. A spoken language processing system for speech recognition comprising:
-
a continuous speech recognition component (CSR) a natural language understanding component (NLU) means for providing speech input to the CSR, means for providing acoustic-phonetic knowledge to the CSR comprising a set of Hidden Markov Models;
means for providing language knowledge comprising grammar and statistical models to the CSR, and means for providing semantic knowledge the NLU, and means for providing semantic knowledge to the CSR, the CSR being operable for searching using the semantic knowledge to constrain the search to an N-best list of salient words, and perform word matching to output N-best list of salient words to the NLU for interpretation of meaning.
-
-
18. A method for continuous speech recognition using a spoken language system comprising a continuous speech recognition component (CSR) linked to a natural language understanding component (NLU)
providing speech input to the CSR providing acoustic-phonetic knowledge to the CSR comprising a set of Hidden Markov Models; -
providing language knowledge comprising grammar and statistical models to the CSR;
providing language knowledge semantic knowledge to the CSR;
performing searching with the CSR using the semantic knowledge to constrain the search to an N-best list of salient words comprising semantically meaningful words of the N-best list of words, and performing word matching to output the N-best salient word sequences to the NLU.
-
Specification