Please download the dossier by clicking on the dossier button x
×

Code, system and method for representing a natural-language text in a form suitable for text manipulation

  • US 7,386,442 B2
  • Filed: 07/01/2003
  • Issued: 06/10/2008
  • Est. Priority Date: 07/03/2002
  • Status: Expired due to Fees
First Claim
Patent Images

1. A computer-executed method for representing a natural-language document in a vector form suitable for text manipulation operations, comprising(a) for each of a plurality of terms selected from one of (i) non-generic words in the document, (ii) proximately arranged word groups in the document, and (iii) a combination of (i) and (ii), determining a selectivity value calculated as the frequency of occurrence of said each term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively, and(b) representing the document as a vector of terms, where the coefficient assigned to each term is a function of the selectivity value determined for said each term.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×