×

Semi-automatic index term augmentation in document retrieval

  • US 9,275,130 B2
  • Filed: 10/24/2013
  • Issued: 03/01/2016
  • Est. Priority Date: 03/31/1999
  • Status: Expired due to Term
First Claim
Patent Images

1. A non-transitory machine-readable medium having executable instructions to cause one or more processing units to perform a method to assign terms to a first document, the method comprising:

  • selecting the first document;

    generating a query containing at least one term, from the selected first document;

    applying the generated query to a plurality of documents to define a subset of the plurality of documents, wherein the defined subset of the plurality of documents constitutes those documents of the plurality of documents that contain the at least one term and meet a predetermined threshold of query relevance;

    determining additional terms based on the defined subset of the plurality of documents, including;

    determining a co-occurrence metric of the at least one term from the selected first document with each term of the defined subset of the plurality of documents,determining a frequency score using the determined co-occurrence metric for each term of the defined subset of the plurality of documents, andselecting a subset of terms of the defined subset of the plurality of documents based on the determined frequency score for each term;

    assigning the selected subset of terms to the selected first document; and

    storing the selected first document in a storage system that is remotely accessible via a network.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×