LANGUAGE MODELING IN SPEECH RECOGNITION
First Claim
1. A computer-implemented method, comprising:
- providing a training set of text samples to a semantic parser that associates text samples with domains;
obtaining data that indicates associations determined by the semantic parser between at least some of the text samples of the training set and one or more domains;
generating a first subset of text samples that the semantic parser has associated with a first of the one or more domains;
generating a first language model for the first of the one or more domains using the first subset of text samples that the semantic parser has associated with the first of the one or more domains; and
performing speech recognition on an utterance using the first language model for the first of the one or more domains.
2 Assignments
0 Petitions
Accused Products
Abstract
Some implementations include a computer-implemented method. The method can include providing a training set of text samples to a semantic parser that associates text samples with actions. The method can include obtaining, for each of one or more of the text samples of the training set, data that indicates one or more domains that the semantic parser has associated with the text sample. For each of one or more domains, a subset of the text samples of the training set can be generated that the semantic parser has associated with the domain. Using the subset of text samples associated with the domain, a language model can be generated for one or more of the domain. Speech recognition can be performed on an utterance using the one or more language models that are generated for the one or more of the domains.
-
Citations
20 Claims
-
1. A computer-implemented method, comprising:
-
providing a training set of text samples to a semantic parser that associates text samples with domains; obtaining data that indicates associations determined by the semantic parser between at least some of the text samples of the training set and one or more domains; generating a first subset of text samples that the semantic parser has associated with a first of the one or more domains; generating a first language model for the first of the one or more domains using the first subset of text samples that the semantic parser has associated with the first of the one or more domains; and performing speech recognition on an utterance using the first language model for the first of the one or more domains. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. One or more computer-readable storage devices having instructions stored thereon that, when executed by one or more computers, cause the one or more computers to perform operations comprising:
-
providing a training set of text samples to a semantic parser that associates text samples with domains; obtaining data that indicates associations determined by the semantic parser between at least some of the text samples of the training set and one or more domains; generating a first subset of text samples that the semantic parser has associated with a first of the one or more domains; generating a first language model for the first of the one or more domains using the first subset of text samples that the semantic parser has associated with the first of the one or more domains; and performing speech recognition on an utterance using the first language model for the first of the one or more domains. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A system comprising:
one or more computers configured to provide; a repository of training data that includes a plurality of text samples in a natural language; a semantic parser configured to process a set of text samples from the plurality of text samples to determine, for each text sample in the set of text samples, a domain associated with the text sample; a training set manager configured to generate subsets of text samples that correspond to respective domains, wherein each subset of text samples includes text samples that the semantic parser has associated with the domain that corresponds to the subset of text samples; a language modeling engine configured to generate a respective language model for each of the subsets of text samples; and a speech recognizer configured to receive an utterance and to recognize the utterance using one or more of the language models that are generated for each of the subsets of text samples.
Specification