×

ID persistence through normalization

  • US 7,991,797 B2
  • Filed: 02/17/2006
  • Issued: 08/02/2011
  • Est. Priority Date: 02/17/2006
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method for maintaining object ID persistence in a collection of data, comprising:

  • at a computer system including one or more processors and memory storing one or more programs, the one or more processors executing the one or more programs to perform the operations of;

    selecting a first object from the collection of data having a first object ID, wherein a first fact comprising an associated object ID is associated with the first object, the collection of data includes a plurality of objects and a plurality of facts associated with the objects, each fact comprises an attribute-value pair, and the plurality of facts are extracted from a plurality of web documents;

    selecting a second object from the collection of data having a second object ID;

    performing a heuristic comparison on the first object and the second object to determine if the first object and the second object refer to a same entity;

    responsive to determining that the first object and the second object refer to the same entity,associating with the first object a forwarding reference to the second object, so that the second object can be referenced using the first object ID;

    dissociating the first fact from the first object; and

    associating the first fact with the second object by setting the associated object ID of the first fact to the second object ID, so that the first fact is merged with facts for the second object; and

    responsive to receiving an external reference to the first object,identifying that the first object includes a forwarding reference to the second object; and

    retrieving the second object.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×