×

Method and means of matching documents based on text genre

  • US 6,178,417 B1
  • Filed: 06/29/1998
  • Issued: 01/23/2001
  • Est. Priority Date: 06/29/1998
  • Status: Expired due to Term
First Claim
Patent Images

1. A computer implemented method of developing text genre from a collection of documents, the method comprising the steps of:

  • (a) extracting at least one key string from one document;

    (b) extracting at least one key string from another document;

    (c) forming a sequence of matching strings therefrom which preserve reading order;

    (d) using a confusion class for each character of each extracted string;

    (e) finding the longest common subsequence of matching strings to form an initial estimate of text genre; and

    (f) repeating steps (b) to (e) until a definition of the text genre is developed that captures the spatial structure of key strings as an LCS (longest common sequence) of matching key string sequences.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×