×

Phrase-based generation of document descriptions

  • US 7,584,175 B2
  • Filed: 07/26/2004
  • Issued: 09/01/2009
  • Est. Priority Date: 07/26/2004
  • Status: Active Grant
First Claim
Patent Images

1. A method of automatically generating a description of a document, the method comprising:

  • retrieving a document in response to a query, the query comprising a query phrase, the document including a plurality of sentences;

    calculating, by operation of a processor adapted to manipulate data within a computer system, for sentences of the document, a first count that includes a measure of a number of instances in which the query phrase occurs in the sentences;

    calculating, by operation of a processor adapted to manipulate data within a computer system, for sentences of the document, a second count that includes a measure of a number of instances in which any of one or more related phrases of the query phrase occurs in the sentences;

    calculating, by operation of a processor adapted to manipulate data within a computer system, for sentences of the document, a third count that includes a measure of a number of instances in which any of one or more phrase extensions of the query phrase occurs in the sentences, wherein a phrase extension is a super-sequence that begins with the query phrase;

    selecting one or more of the sentences of the document based on their respective first, second and third counts; and

    forming a description of the document from the selected sentences, wherein a phrase gj is a related phrase of another phrase gk where an information gain of gj with respect to gk exceeds a predetermined threshold, the information gain being a function of both actual and expected co-occurrence rates of gj and gk.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×