Supervised automatic text generation based on word classes for language modeling
First Claim
Patent Images
1. A method for use in supervised automatic text generation comprising:
- inputting an original text of a language;
analyzing the original text to provide word classes;
generating an N-gram of valid word classes using the word classes;
generating string of random word classes;
validating a group of word classes in the string of random word classes using the N-gram of the valid word classes;
providing a word for words in classes;
substituting the word in the group of word classes; and
outputting new text in the language.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and method is provided that randomly generates text with a given structure. The structure is taken from a number of learning examples. The structure of training examples is captured by word classification and the definition of the relationships between word classes in a given language. The text generated with this procedure is intended to replicate the information given by the original learning examples. The resulting text may be used to better model the structure of a language in a stochastic language model.
49 Citations
22 Claims
-
1. A method for use in supervised automatic text generation comprising:
-
inputting an original text of a language; analyzing the original text to provide word classes; generating an N-gram of valid word classes using the word classes; generating string of random word classes; validating a group of word classes in the string of random word classes using the N-gram of the valid word classes; providing a word for words in classes; substituting the word in the group of word classes; and outputting new text in the language. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A method for use in supervised automatic text generation comprising:
-
providing an N-gram of valid word classes; generating a string of random word classes; validating a group of word classes in the string of random word classes using the N-gram of valid word classes; providing a word in a language for words in classes; substituting the word in the class in the group of word classes; and outputting new text in the language. - View Dependent Claims (8, 9, 10)
-
-
11. A method for use in supervised automatic text generation comprising:
-
inputting an original text of a language; analyzing the original text to provide word classes; generating an N-gram of valid word classes using the word classes; generating a string of random word classes; validating a group of word classes in the string of random word classes using the N-gram of valid word classes; providing a word for words in classes; substituting the word in a word class in the group of word classes; and outputting new text in the language.
-
-
12. In a supervised automatic text generation system, a system comprising:
-
an input for inputting an original text of a language; a text analyzer for analyzing original text to provide word classes; an N-gram generator for generating an N-gram of valid word classes using the word classes; a random class generator for generating a string of random word classes; a validating mechanism for validating a group of word classes in the string of random word classes using the N-gram of the valid word classes; a word in classes mechanism for providing a word for words in classes; a substitution mechanism for substituting the word in the group of word classes; and an output for outputting new text in the language. - View Dependent Claims (13, 14, 15, 16, 17)
-
-
18. In a supervised automatic text generation system, a system comprising:
-
a random class generator for generating a string of random word classes; a validating mechanism for validating a group of word classes in the siring of random word classes using an N-gram of valid word classes; a word in classes mechanism for providing a word in a language for words in classes; a substitution mechanism substituting the word in the group of word classes; and an output for outputting new text in the language. - View Dependent Claims (19, 20, 21)
-
-
22. In a supervised automatic text generation system, a system comprising:
-
an input for inputting original text of a language; a text analyzer for analyzing the original text to provide word classes; an N-gram generator for generating an N-gram of valid word classes using the word classes; a random class generator for generating a string of random word classes; a validating mechanism for validating a group of word classes in the string of random word classes using the N-gram of valid word classes; a word in classes mechanism for providing a word for words in classes; a substitution mechanism for substituting the word in a word class in the group of word classes; and an output outputting new text in the language.
-
Specification