DISCRIMINATIVE TRAINING OF AUTOMATIC SPEECH RECOGNITION MODELS WITH NATURAL LANGUAGE PROCESSING DICTIONARY FOR SPOKEN LANGUAGE PROCESSING
First Claim
Patent Images
1. A method for language processing, comprising:
- training one or more automatic speech recognition models using an automatic speech recognition dictionary;
determining a set of N automatic speech recognition hypotheses that characterize a spoken input, based on the one or more automatic speech recognition models, using a processor;
selecting a hypothesis from the set of N automatic speech recognition hypotheses using a discriminative language model and a list of relevant words according to an error rate of relevant words in each hypothesis; and
performing natural language processing on the best hypothesis.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods and systems for language processing includes training one or more automatic speech recognition models using an automatic speech recognition dictionary. A set of N automatic speech recognition hypotheses for an input is determined, based on the one or more automatic speech recognition models, using a processor. A best hypothesis is selected using a discriminative language model and a list of relevant words. Natural language processing is performed on the best hypothesis.
21 Citations
20 Claims
-
1. A method for language processing, comprising:
-
training one or more automatic speech recognition models using an automatic speech recognition dictionary; determining a set of N automatic speech recognition hypotheses that characterize a spoken input, based on the one or more automatic speech recognition models, using a processor; selecting a hypothesis from the set of N automatic speech recognition hypotheses using a discriminative language model and a list of relevant words according to an error rate of relevant words in each hypothesis; and performing natural language processing on the best hypothesis. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for language processing, comprising:
-
training one or more automatic speech recognition models using an automatic speech recognition dictionary; determining a set of N automatic speech recognition hypotheses that characterize a spoken input, based on the one or more automatic speech recognition models, using a processor, comprising; concatenating all words in the natural language processing training data to generate raw natural language processing text; tokenizing the raw natural language processing text using the automatic speech recognition dictionary; collecting tokenized words that appear more than a threshold number of times; and adding entries of a natural language processing dictionary to the collected tokenized words to form the relevant word list; selecting a hypothesis from the set of N automatic speech recognition hypotheses using a discriminative language model and a list of relevant words according to an error rate of relevant words in each hypothesis, comprising; determining a relevant word error rate for each hypothesis that considers only words that are on the relevant word list using the discriminative language model; and selecting the hypothesis having the lowest relevant word error rate; and performing natural language processing on the best hypothesis.
-
-
12. A system for language processing, comprising:
-
an automatic speech recognition module comprising a processor configured to train one or more automatic speech recognition models using an automatic speech recognition dictionary, to determine a set of N automatic speech recognition hypotheses that characterize a spoken input based on the one or more automatic speech recognition models, and to select a hypothesis from the set of N automatic speech recognition hypotheses using a discriminative language model and a list of relevant words according to an error rate of relevant words in each hypothesis; and a natural language processing module configured to perform natural language processing on the best hypothesis. - View Dependent Claims (13, 14, 15, 16, 17, 18, 19, 20)
-
Specification