Content category scoring for nodes in a linked database
First Claim
1. An information retrieval system comprising a processor for executing at least one program module for scoring nodes in a linked database based on a plurality of content categories, said nodes including linking nodes and linked nodes, wherein at least one of the linking nodes is a seed node, the seed node having an assigned seed node score for each content category, said at least one program module being stored in a system memory and comprising computer-executable instructions for:
- identifying a plurality of linking nodes in the linked database and assigning to each linking node a linking node score for each content category, wherein the linking node score for each content category represents a degree of relevancy of the linking node to the content category;
identifying a plurality of linked nodes in the linked database, the linked nodes being linked to by at least one of the linking nodes;
assigning to each linked node a content category score for each content category, wherein the content category score for each content category is based on the linking node score for said content category of each linking node that links to the linked node;
identifying at least one unique keyword in the seed node;
for each identified unique keyword, determining a keyword score for each content category, wherein the keyword score for each content category is based on the seed node score for said content category divided by the number of unique keywords in the seed node;
determining that a selected content category score for a selected linked node in a selected content category does not exceed a threshold value; and
replacing the selected content category score with a leaf node score that is based on an average of the keyword scores in the selected content category assigned to any of the unique keywords contained in the selected node.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems, methods and computer program products are provided for assigning content category scores to nodes of a linked database. The nodes of the linked database include linking nodes and linked nodes. Each linking node is assigned a linking node score for each content category. The linking node score for each content category represents a degree of relevancy of the linking node to the content category. Each of the linked nodes is linked to by at least one of the linking nodes. Each linked node is assigned a content category score for each content category. The content category score for each content category is based on the linking node score for said content category of each linking node that links to the linked node.
8 Citations
15 Claims
-
1. An information retrieval system comprising a processor for executing at least one program module for scoring nodes in a linked database based on a plurality of content categories, said nodes including linking nodes and linked nodes, wherein at least one of the linking nodes is a seed node, the seed node having an assigned seed node score for each content category, said at least one program module being stored in a system memory and comprising computer-executable instructions for:
-
identifying a plurality of linking nodes in the linked database and assigning to each linking node a linking node score for each content category, wherein the linking node score for each content category represents a degree of relevancy of the linking node to the content category; identifying a plurality of linked nodes in the linked database, the linked nodes being linked to by at least one of the linking nodes; assigning to each linked node a content category score for each content category, wherein the content category score for each content category is based on the linking node score for said content category of each linking node that links to the linked node; identifying at least one unique keyword in the seed node; for each identified unique keyword, determining a keyword score for each content category, wherein the keyword score for each content category is based on the seed node score for said content category divided by the number of unique keywords in the seed node; determining that a selected content category score for a selected linked node in a selected content category does not exceed a threshold value; and replacing the selected content category score with a leaf node score that is based on an average of the keyword scores in the selected content category assigned to any of the unique keywords contained in the selected node. - View Dependent Claims (2, 3, 4)
-
-
5. An information retrieval system comprising a processor for executing at least one program module for scoring nodes in a linked database based on a plurality of content categories, said nodes including linking nodes and linked nodes, wherein a plurality of the linking nodes are seed nodes, each seed node having an assigned seed node score for each content category, wherein the at least one program module is stored in a system memory and comprises computer-executable instructions for:
-
identifying a plurality of linking nodes in the linked database and assigning to each linking node a linking node score for each content category, wherein the linking node score for each content category represents a degree of relevancy of the linking node to the content category; identifying a plurality of linked nodes in the linked database, the linked nodes being linked to by at least one of the linking nodes; assigning to each linked node a content category score for each content category, wherein the content category score for each content category is based on the linking node score for said content category of each linking node that links to the linked node; identifying at least one unique keyword in each seed node; for each identified unique keyword in each seed node, determining a keyword score for each content category, wherein the keyword score for each content category is based on the seed node score for said content category divided by the number of unique keywords in the seed node; determining an aggregate keyword score for each content category by aggregating the keyword scores for each content category of all of the seed nodes; determining that a selected content category score for a selected linked node in a selected content category does not exceed a threshold value; and replacing the selected content category score with a leaf node score that is based on an average of the aggregate keyword scores in the selected content category assigned to any of the unique keywords contained in the node.
-
-
6. A computer implemented method for scoring a node in a linked database, the node being linked to by a plurality of linking nodes, the method comprising the steps of:
-
for each linking node, determining a linking node score for each of a plurality of content categories, wherein the linking node score for each content category represents a degree of relevancy of the linking node to the content category; assigning to the node a content category score for each of the plurality of content categories, wherein the content category score for each content category is based on each linking node score for said content category; identifying a plurality of unique keyword in a seed node having an assigned seed node score for each content category; for each identified unique keyword, determining a keyword score for each content category, wherein the keyword score for each content category is based on the seed node score for said content category divided by the number of unique keywords in the seed node; and if the content category score in any content category does not exceed a threshold value, replacing said content category score with a leaf node score that is based on an average of the keyword scores in said content category assigned to any of the unique keywords contained in the node. - View Dependent Claims (7, 8, 9)
-
-
10. A computer implemented method for scoring a node in a linked database, the node being linked to by a plurality of linking nodes, wherein a plurality of the linking nodes are seed nodes, each seed node having an assigned seed node score for each content category and wherein the method comprises the steps of:
-
for each linking node, determining a linking node score for each of a plurality of content categories, wherein the linking node score for each content category represents a degree of relevancy of the linking node to the content category; assigning to the node a content category score for each of the plurality of content categories, wherein the content category score for each content category is based on each linking node score for said content category identifying a plurality of keywords in each seed node having an assigned seed node score; for each identified keyword in each seed node, determining a keyword score for each content category, wherein the keyword score for each content category is based on the seed node score for said content category divided by the number of keywords in the seed node; determining an aggregate keyword score for each content category by aggregating the keyword scores for each content category of all of the seed nodes; and if the content category score in any content category of the node does not exceed a threshold value, replacing said content category score with the leaf node score that is based on an average of the aggregate keyword scores in said content category assigned to any of the keywords contained in the node.
-
-
11. A computer program product having stored thereon at least one program module for scoring a node in a linked database, the node being linked to by a plurality of linking nodes, the at least one program module comprising computer executable instructions for:
-
for each linking node, determining a linking node score for each of a plurality of content categories, wherein the linking node score for each content category represents a degree of relevancy of the linking node to the content category; assigning to the node a content category score for each of the plurality of content categories, wherein the content category score for each content category is based on each linking node score for said content category; identifying a plurality of unique keyword in a seed node having an assigned seed node score; for each identified unique keyword, determining a keyword score for each content category, wherein the keyword score for each content category is based on the seed node score for said content category divided by the number of unique keywords in the seed node; and if the content category score in any content category does not exceed a threshold value, replacing said content category score with a leaf node score that is based on an average of the keyword scores in said content category assigned to any of the unique keywords contained in the node. - View Dependent Claims (12, 13, 14)
-
-
15. A computer program product having stored thereon at least one program module for scoring a node in a linked database, the node being linked to by a plurality of linking nodes, wherein a plurality of the linking nodes are seed nodes, each seed node having an assigned seed node score for each content category and wherein the at least one program module comprises computer executable instructions for:
-
for each linking node, determining a linking node score for each of a plurality of content categories, wherein the linking node score for each content category represents a degree of relevancy of the linking node to the content category; assigning to the node a content category score for each of the plurality of content categories, wherein the content category score for each content category is based on each linking node score for said content category; identifying a plurality of keywords in each seed node having an assigned seed node score; for each identified keyword in each seed node, determining a keyword score for each content category, wherein the keyword score for each content category is based on the seed node score for said content category divided by the number of keywords in the seed node; determining an aggregate keyword score for each content category by aggregating the keyword scores for each content category of all of the seed nodes; and if the content category score in any content category of the node does not exceed a threshold value, replacing said content category score with the leaf node score that is based on an average of the aggregate keyword scores in said content category assigned to any of the keywords contained in the node.
-
Specification