×

System and method for content-based object ranking to facilitate information lifecycle management

  • US 7,996,409 B2
  • Filed: 12/28/2006
  • Issued: 08/09/2011
  • Est. Priority Date: 12/28/2006
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method to manage objects in an information lifecycle management system, comprising:

  • retrieving objects from the information lifecycle management system;

    determining a score for each of the retrieved objects based on a score of at least one feature within respective ones of each of the retrieved objects, comprising;

    associating a valuation of the at least one feature with the score of the at least one feature, wherein the valuation of the at least one feature is based on a statistical importance of the feature;

    associating the score of the at least one feature with an importance of the at least one feature within a particular one of the retrieved objects and the importance of the feature within a population of objects to be managed in the information lifecycle management system;

    correlating the score for each of the retrieved objects to a value for each of the retrieved objects;

    determining a number of scores of the at least one feature to be included in the score for each of the retrieved objects;

    the determining of a number of scores of the at least one feature comprising at least one of the following steps;

    selecting a single feature having a maximum score for a selected retrieved object;

    selecting a particular number of top scoring features; and

    identifying a number of scores with Kullback-Liebler divergence;

    summing a predetermined number of top scores of features in each of the retrieved objects;

    determining a first set of statistics for each of the at least one feature based on information content associated with respective ones of each of the retrieved objects;

    determining a second set of statistics for each of the at least one feature based on information content associated with the population of objects to be managed in the information lifecycle management system;

    determining the score for each feature based on the first set of statistics and the second set of statistics; and

    determining the score for each of the retrieved objects based on the score for each feature in respective ones of each of the retrieved objects;

    managing each of the retrieved objects based on the score for each of the retrieved objects; and

    the managing each of the retrieved objects comprises;

    managing each of the retrieved objects in an information lifecycle engine;

    determining from the score for each of the retrieved objects a particular type of storage device that each of the retrieved objects is associated with;

    storing the retrieved object in tier one storage when a score for the retrieved object is above a specified value;

    storing the retrieved object in a lower tier storage when the score for the retrieved object is below the specified value;

    determining from the score for each of the retrieved objects how many copies of each of the retrieved objects to store and where to store each of the retrieved objects;

    retrieving each of the objects from the information lifecycle management system at a priority based on the score for each of the retrieved objects;

    retrieving each of the objects from the information lifecycle management system in an order based on the score for each of the retrieved objects;

    preferentially managing higher scored retrieved objects;

    determining if it is time to re-evaluate a retrieved object;

    identifying retrieved objects to re-evaluate;

    reassigning a score to a retrieved object to be re-evaluated;

    managing a re-evaluated retrieved object based on a reassigned score, comprising;

    moving a re-evaluated retrieved object from tier 2 storage to tier 1 storage when a reassigned score increases; and

    moving a re-evaluated retrieved object from tier 1 storage to tier 2 storage when a reassigned score decreases;

    wherein the at least one feature within each of the retrieved objects occur in the metadata associated with each of the retrieved objects, and the retrieved object is one of a file, a document, a record, a table, or a database.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×