Method, system and computer product for classifying web content nodes based on relationship scores derived from mapping content nodes, topical seed nodes and evaluation nodes
First Claim
Patent Images
1. A method comprising:
- assigning one or more source score values to one or more seed nodes included in a seed set of nodes, wherein the seed set of nodes is representative of a topic;
deriving a destination score value for a first web node based on a mapping of a reachability relationship between one or more seed nodes and the first web node;
deriving a source score value for the first web node based on a mapping of a reachability relationship between the first web node and one or more evaluation nodes, each of the one or more evaluation nodes having, respectively, a derived destination score;
determining, based at least in part on at least one of the destination score value of the first web node and the source score value of the first web node, that additional processing should be performed on the first web node;
determining that additional processing should not be performed on a second web node; and
classifying the content of the first web node.
4 Assignments
0 Petitions
Accused Products
Abstract
Determining the relevance of a web node is disclosed. A seed score value of a first type is assigned to a seed set of nodes. A score value of a second type is derived for the web node based on a mapping of a reachability relationship between one or more seed nodes and the web node. A score value of the first type is derived for the web node based on a mapping of a reachability relationship between the web node and one or more evaluation nodes having derived weight values of the second type. Content analysis is performed.
36 Citations
14 Claims
-
1. A method comprising:
-
assigning one or more source score values to one or more seed nodes included in a seed set of nodes, wherein the seed set of nodes is representative of a topic; deriving a destination score value for a first web node based on a mapping of a reachability relationship between one or more seed nodes and the first web node; deriving a source score value for the first web node based on a mapping of a reachability relationship between the first web node and one or more evaluation nodes, each of the one or more evaluation nodes having, respectively, a derived destination score; determining, based at least in part on at least one of the destination score value of the first web node and the source score value of the first web node, that additional processing should be performed on the first web node; determining that additional processing should not be performed on a second web node; and classifying the content of the first web node. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system comprising:
-
a processor configured to; assign one or more source score values to one or more seed nodes included in a seed set of nodes, wherein the seed set of nodes is representative of a topic; derive a destination score value for a first web node based on a mapping of a reachability relationship between one or more seed nodes and the first web node; derive a source score value for the first web node based on a mapping of a reachability relationship between the first web node and one or more evaluation nodes, each of the one or more evaluation nodes having, respectively, a derived destination score; determine, based at least in part on at least one of the destination score value of the first web node and the source score value of the first web node, that additional processing should be performed on the first web node; determine that additional processing should not be performed on a second web node; and classify the content of the first web node; and a memory coupled with the processor, wherein the memory provides the processor with instructions. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A computer program product embodied in a computer readable medium and comprising computer instructions for:
-
assigning one or more source score values to one or more seed nodes included in a seed set of nodes, wherein the seed set of nodes is representative of a topic; deriving a destination score value for a first web node based on a mapping of a reachability relationship between one or more seed nodes and the first web node; deriving a source score value for the first web node based on a mapping of a reachability relationship between the first web node and one or more evaluation nodes, each of the one or more evaluation nodes having, respectively, a derived destination score; determining, based at least in part on at least one of the destination score value of the first web node and the source score value of the first web node, that additional processing should be performed on the first web node; determining that additional processing should not be performed on a second web node; and classifying the content of the first web node. - View Dependent Claims (14)
-
Specification