×

Evaluating distinctiveness of document

  • US 20040006736A1
  • Filed: 06/13/2003
  • Published: 01/08/2004
  • Est. Priority Date: 07/04/2002
  • Status: Active Grant
First Claim
Patent Images

1. A method of evaluating a degree of distinctiveness of each document segment contained in a target document including at least one document segment with respect to a comparison document including at least one document segment, and identifying a distinctive document segment, the method comprising:

  • (a) identifying a respective document segment vector for each document segment contained in the comparison document and the target document, each document segment vector having component values associated with occurring frequencies of terms occurring in its respective document segment;

    (b) computing squared sum matrices respectively corresponding to the comparison document and the target document, from said document segment vectors;

    (c) computing a predetermined number of orders of topic difference factor vectors of the target document from said squared sum matrices corresponding to the comparison document and the target document;

    (d) computing respective degrees of distinctiveness of said respective orders and a total degree of distinctiveness for each document segment of the target document, from said corresponding document segment vector and said topic difference factor vectors of said respective orders; and

    (e) identifying a distinctive document segment in the target document, on the basis of the degrees of distinctiveness of said respective orders or on the basis of the total degree of distinctiveness thereof.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×