Generating language models
First Claim
1. A method performed by one or more computers, the method comprising:
- accessing data indicating a set of classes that each represent a different level of specificity of a same, particular semantic concept, wherein the classes respectively correspond to different types of words or phrases, and each class includes multiple words or phrases of the corresponding type for the class;
identifying a language sequence including a particular word or phrase that corresponds to the particular semantic concept;
generating a first modified language sequence by replacing the particular word or phrase with a symbol representing the first class;
generating a second modified language sequence by replacing the particular word or phrase with a symbol representing at least one the second classes;
generating a first language model in which a single first class from the set of classes represents the particular semantic concept, the first language model being trained using the first modified language sequence;
generating a second language model in which multiple second classes from the set of classes represent the particular semantic concept at a greater level of specificity than the first class, each of the second classes being different from the first class, the second language model being trained using the second modified language sequence; and
selecting the first class or the multiple second classes based on output of the first language model and output of the second language model.
2 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating language models. In some implementations, data is accessed that indicates a set of classes corresponding to a concept. A first language model is generated in which a first class represents the concept. A second language model is generated in which second classes represent the concept. Output of the first language model and the second language model is obtained, and the outputs are evaluated. A class from the set of classes is selected based on evaluating the output of the first language model and the output of the second language model. In some implementations, the first class and the second class are selected from a parse tree or other data that indicates relationships among the classes in the set of classes.
231 Citations
20 Claims
-
1. A method performed by one or more computers, the method comprising:
-
accessing data indicating a set of classes that each represent a different level of specificity of a same, particular semantic concept, wherein the classes respectively correspond to different types of words or phrases, and each class includes multiple words or phrases of the corresponding type for the class; identifying a language sequence including a particular word or phrase that corresponds to the particular semantic concept; generating a first modified language sequence by replacing the particular word or phrase with a symbol representing the first class; generating a second modified language sequence by replacing the particular word or phrase with a symbol representing at least one the second classes; generating a first language model in which a single first class from the set of classes represents the particular semantic concept, the first language model being trained using the first modified language sequence; generating a second language model in which multiple second classes from the set of classes represent the particular semantic concept at a greater level of specificity than the first class, each of the second classes being different from the first class, the second language model being trained using the second modified language sequence; and selecting the first class or the multiple second classes based on output of the first language model and output of the second language model. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
one or more computers; and one or more storage devices storing instructions that are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising; accessing data indicating a set of classes that each represent a different level of specificity of a same, particular semantic concept, wherein the classes respectively correspond to different types of words or phrases, and each class includes multiple words or phrases of the corresponding type for the class; identifying a language sequence including a particular word or phrase that corresponds to the particular semantic concept; generating a first modified language sequence by replacing the particular word or phrase with a symbol representing the first class; generating a second modified language sequence by replacing the particular word or phrase with a symbol representing at least one the second classes; generating a first language model in which a single first class from the set of classes represents the particular semantic concept, the first language model being trained using the first modified language sequence; generating a second language model in which multiple second classes from the set of classes represent the particular semantic concept at a greater level of specificity than the first class, each of the second classes being different from the first class, the second language model being trained using the second modified language sequence; and selecting the first class or the multiple second classes based on output of the first language model and output of the second language model.
-
-
16. A non-transitory computer-readable storage device storing instructions that, when executed by a computer, cause the computer to perform operations comprising:
-
accessing data indicating a set of classes that each represent a different level of specificity of a same, particular semantic concept, wherein the classes respectively correspond to different types of words or phrases, and each class includes multiple words or phrases of the corresponding type for the class; identifying a language sequence including a particular word or phrase that corresponds to the particular semantic concept; generating a first modified language sequence by replacing the particular word or phrase with a symbol representing the first class; generating a second modified language sequence by replacing the particular word or phrase with a symbol representing at least one the second classes; generating a first language model in which a single first class from the set of classes represents the particular semantic concept, the first language model being trained using the first modified language sequence; generating a second language model in which multiple second classes from the set of classes represent the particular semantic concept at a greater level of specificity than the first class, each of the second classes being different from the first class, the second language model being trained using the second modified language sequence; and selecting the first class or the multiple second classes based on output of the first language model and output of the second language model. - View Dependent Claims (17, 18, 19, 20)
-
Specification