×

METHOD FOR DOMAIN IDENTIFICATION OF DOCUMENTS IN A DOCUMENT DATABASE

  • US 20060206483A1
  • Filed: 05/05/2006
  • Published: 09/14/2006
  • Est. Priority Date: 10/27/2004
  • Status: Active Grant
First Claim
Patent Images

1. A method for processing a plurality of documents in a document database comprising:

  • determining vocabulary words for each document of the plurality thereof;

    determining a respective relevancy for each vocabulary word based upon occurrences thereof in the plurality of documents;

    determining similarities between the plurality of documents based upon the vocabulary words and their respective relevancies; and

    determining at least one domain identification for documents based upon the determined similarities.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×