×

Method and system for discovering significant subsets in collection of documents

  • US 7,360,686 B2
  • Filed: 05/11/2005
  • Issued: 04/22/2008
  • Est. Priority Date: 05/11/2005
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method of discovering a subset in a collection of documents, comprising:

  • identifying a set of documents from a plurality of documents based on a likelihood that documents in said set of documents carry an instance of information that is characteristic to the documents in said set of documents;

    analyzing a first document in the collection of documents to determine a characteristic feature of said first document;

    generating a profile of said first document based on said characteristic feature; and

    comparing a subsequent document in the collection of documents to said profile,wherein said set of documents comprises a cluster of documents in said plurality of documents, andwherein when said subsequent document matches said profile, said subsequent document is included in said set of documents and a next subsequent document is compared at least to said subsequent document.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×