×

Retrieval of Documents Using Language Models

  • US 20080059187A1
  • Filed: 08/30/2007
  • Published: 03/06/2008
  • Est. Priority Date: 08/31/2006
  • Status: Active Grant
First Claim
Patent Images

1. A method of modeling documents comprising:

  • receiving a plurality of documents, for each of the plurality of documents tokenizing text included in the document building a language model, includingidentifying paragraph boundaries in the tokenized textidentifying word pairs in the paragraphscalculating the frequency of the word pairs in the paragraphsadding the word pairs and corresponding frequency information to the language model.

View all claims
  • 11 Assignments
Timeline View
Assignment View
    ×
    ×