×

Information retrieval systems with duplicate document detection and presentation functions

  • US 20060041597A1
  • Filed: 05/05/2005
  • Published: 02/23/2006
  • Est. Priority Date: 08/23/2004
  • Status: Active Grant
First Claim
Patent Images

1. An information-retrieval system comprising:

  • a plurality of databases; and

    one or more servers for facilitating client access to the plurality of databases over a network, with each of the servers including at least one of;

    signature-generation means for generating a plurality of document signatures, with each document signature based on a plurality of features from a corresponding document in one or more of the databases;

    query-definition means for defining a query and selecting an option related to identification of search-result documents that include content duplicative of one or more other search-result documents;

    duplicate-determination means for determining, based on a subset of the document signatures, whether one or more documents within results of the query include content duplicative of content in one or more other documents within the results;

    means for controlling display of results of the query based on the selected option, with at least one of the displayed results indicated as including content duplicative of content in one or more other documents within the results; and

    means for controlling output of results of the query to a printer or email transmission device, based on user selected options related to output of documents that include content duplicative of content of one or more other documents within the results.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×