×

Method for standardizing phrasing in a document

  • US 6,098,034 A
  • Filed: 03/18/1996
  • Issued: 08/01/2000
  • Est. Priority Date: 03/18/1996
  • Status: Expired due to Term
First Claim
Patent Images

1. A method of extracting phrases in a document, which comprise the steps of:

  • extracting phrases of a document to automatically create a preliminary list of extracted phrases;

    filtering the preliminary list of extracted phrases to create a final list of extracted phrases;

    extracting candidate phrases of the document which are similar to extracting phrases contained in the final list of extracted phrases;

    confirming whether a candidate phrase of the document is sufficiently proximate to the extracted phrase to constitute an approximate phrase by calculating an edit distance of the candidate phrases based on two distinct cost functions, a first one relating to a semantic significance and role of a text of the document, and a second one elating to operations performed on the text of the document; and

    computing a phrase substitution to determine the appropriate conformation of one of the extracted phrase to the approximate phrase and the approximate phrase to the extracted phrase.

View all claims
  • 6 Assignments
Timeline View
Assignment View
    ×
    ×