Method and system for identifying object information
First Claim
1. A method in a computer system for identifying an object of an information page, the method comprising:
- identifying an object block of the information page representing the object;
identifying object elements of the identified object block;
classifying the object block as an object type, the object type having attributes;
labeling the identified object elements as attributes of the object type;
providing an object data store having objects of the object type;
determining whether the object matches an object of the object data store based on the labeling of the object elements;
when the determining indicates a match, updating the matching object of the object data store based on the labeling of the object elements; and
when the determining does not indicate a match, adding an object to the object data store based on the labeling of the object elements.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and system for identifying object information of an information page is provided. An information extraction system identifies the object blocks of an information page. The extraction system classifies the object blocks into object types. Each object type has associated attributes that define a schema for the information of the object type. The extraction system identifies object elements within an object block that may represent an attribute value for the object. After the object elements are identified, the extraction system attempts to identify which object elements correspond to which attributes of the object type in a process referred to as “labeling.” The extraction system uses an algorithm to determine the confidence that a certain object element corresponds to a certain attribute. The extraction system then selects the set of labels with the highest confidence as being the labels for the object elements.
34 Citations
18 Claims
-
1. A method in a computer system for identifying an object of an information page, the method comprising:
-
identifying an object block of the information page representing the object; identifying object elements of the identified object block; classifying the object block as an object type, the object type having attributes; labeling the identified object elements as attributes of the object type; providing an object data store having objects of the object type; determining whether the object matches an object of the object data store based on the labeling of the object elements; when the determining indicates a match, updating the matching object of the object data store based on the labeling of the object elements; and when the determining does not indicate a match, adding an object to the object data store based on the labeling of the object elements. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-readable medium containing computer-executable instructions for controlling a computer system to identify an object of an object block, by a method comprising:
-
identifying object elements of the object block, the object block having an object type with attributes; labeling the identified object elements as attributes of the object type, the labeling including comparing the identified object elements to attribute values of objects of an object store; determining whether the object matches an object of the object data store based on the labeling of the object elements; when the determining indicates a match, updating the matching object of the object data store based on the labeling of the object elements; and when the determining does not indicate a match, adding an object to the object data store based on the labeling of the object elements. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A computer-readable medium containing computer-executable instructions for controlling a computer system to label object elements of an object block, by a method comprising:
-
providing an object data store of objects having attributes; determining labels for the object elements by comparing the object elements to attribute values of objects of the object store and generating constraints based on confidence that an object element should be labeled as an attribute; determining whether the object of the object block matches an object of the object data store based on the determined labels of the object elements; when the determining indicates a match, updating the matching object of the object data store based on the determined labels of the object elements; and when the determining does not indicate a match, adding an object to the object data store based on the determined labels of the object elements. - View Dependent Claims (17, 18)
-
Specification