×

Method and apparatus for generation of text documents

  • US 20060155530A1
  • Filed: 12/14/2005
  • Published: 07/13/2006
  • Est. Priority Date: 12/14/2004
  • Status: Abandoned Application
First Claim
Patent Images

1. A method for the generation of text documents, comprising the steps of:

  • (a) collecting a set of text documents as training documents and selecting a language model including model parameters (21);

    (b) training of the language model by using the training documents and the model parameters (22);

    (c) generating new documents (24) by using said probabilities and by using additional words beyond the words contained in the training documents, the new documents comprising the same distribution of their length as the training documents; and

    (d) determine if the deviations of the word frequency as a function of the word rank (Zipf'"'"'s law) and the growths of the vocabulary as a function of the number of terms (Heap'"'"'s law) are below user defined thresholds (42, 66) and accepting only new documents which fulfil this condition.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×