Categorizing network resources and extracting user interests from network activity
First Claim
1. A method for network resource classification, comprising:
- obtaining a hierarchy of categories for classifying a plurality of network resources, where each category is assigned a text item describing the category;
obtaining a plurality of resource description data collections corresponding to the plurality of network resources, wherein the plurality of resource description data collections comprise a first resource description data collection corresponding to a first network resource of the plurality of network resources;
generating, by a computer processor and using a semantic correlation algorithm, a first category score vector of the first network resource by comparing the first resource description data collection to the text item assigned to each category in the hierarchy of categories, wherein the first category score vector comprises a category score for each category in the hierarchy of categories, wherein the category score is determined based on at least a semantic correlation measure between the first resource description data collection and the text item assigned to a corresponding category,wherein the plurality of network resources are classified based at least on the category score;
analyzing a network trace associated with a user to identify the plurality of network resources accessed by the user;
generating, based on a pre-determined criterion, a relationship graph comprising;
a plurality of nodes representing the plurality of network resources, anda plurality of edges representing a measure of cross-references between the plurality of resource description data collections; and
adjusting, based on the relationship graph, the first category score vector to generate a first adjusted category score vector using at least another category score vector of another network resource of the plurality of network resources.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for network resource classification and identifying user interests based on the classification. The method uses a provided hierarchy of categories for classifying network resources, wherein each category is assigned a text item describing the category and the method includes obtaining resource description data collections corresponding to the network resources, and generating, using a semantic correlation algorithm, a category score vector of a network resource by comparing the resource description data collection to the text item assigned to each category in the hierarchy of categories, wherein the category score vector comprises a category score for each category in the hierarchy of categories, wherein the category score is determined based on at least a semantic correlation measure between the resource description data collection and the text item assigned to a corresponding category, wherein the plurality of network resources are classified based at least on the category score.
30 Citations
27 Claims
-
1. A method for network resource classification, comprising:
-
obtaining a hierarchy of categories for classifying a plurality of network resources, where each category is assigned a text item describing the category; obtaining a plurality of resource description data collections corresponding to the plurality of network resources, wherein the plurality of resource description data collections comprise a first resource description data collection corresponding to a first network resource of the plurality of network resources; generating, by a computer processor and using a semantic correlation algorithm, a first category score vector of the first network resource by comparing the first resource description data collection to the text item assigned to each category in the hierarchy of categories, wherein the first category score vector comprises a category score for each category in the hierarchy of categories, wherein the category score is determined based on at least a semantic correlation measure between the first resource description data collection and the text item assigned to a corresponding category, wherein the plurality of network resources are classified based at least on the category score; analyzing a network trace associated with a user to identify the plurality of network resources accessed by the user; generating, based on a pre-determined criterion, a relationship graph comprising; a plurality of nodes representing the plurality of network resources, and a plurality of edges representing a measure of cross-references between the plurality of resource description data collections; and adjusting, based on the relationship graph, the first category score vector to generate a first adjusted category score vector using at least another category score vector of another network resource of the plurality of network resources. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for network resource classification, comprising:
-
a computer processor; an inference input module executing on the computer processor and configured to; obtain a hierarchy of categories for classifying a plurality of network resources, where each category is assigned a text item describing the category; obtain a plurality of resource description data collections corresponding to the plurality of network resources, wherein the plurality of resource description data collections comprise a first resource description data collection corresponding to a first network resource of the plurality of network resources; analyze a network trace associated with a user to identify the plurality of network resources accessed by the user; generate, based on a pre-determined criterion, a relationship graph comprising; a plurality of nodes representing the plurality of network resources, a plurality of edges representing a measure of cross-references between the plurality of resource description data collections; a category score generator executing on the computer processor and configured to; generate, using a semantic correlation algorithm, a first category score vector of the first network resource by comparing the first resource description data collection to the text item assigned to each category in the hierarchy of categories, wherein the first category score vector comprises a category score for each category in the hierarchy of categories, wherein the category score is determined based on at least a semantic correlation measure between the first resource description data collection and the text item assigned to a corresponding category; adjust, based on the relationship graph, the first category score vector to generate the first adjusted category score vector using at least another category score vector of another network resource of the plurality of network resources; and a repository configured to store the plurality of resource description data collections and the relationship graph, wherein the plurality of network resources are classified based at least on the category score. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A non-transitory computer readable medium embodying instructions for network resource classification, the instructions when executed by a processor comprising functionality for:
-
obtaining a hierarchy of categories for classifying a plurality of network resources, where each category is assigned a text item describing the category; obtaining a plurality of resource description data collections corresponding to the plurality of network resources, wherein the plurality of resource description data collections comprise a first resource description data collection corresponding to a first network resource of the plurality of network resources; generating, using a semantic correlation algorithm, a first category score vector of the first network resource by comparing the first resource description data collection to the text item assigned to each category in the hierarchy of categories, wherein the first category score vector comprises a category score for each category in the hierarchy of categories, wherein the category score is determined based on at least a semantic correlation measure between the first resource description data collection and the text item assigned to a corresponding category, wherein the plurality of network resources are classified based at least on the category score analyzing a network trace associated with a user to identify the plurality of network resources accessed by the user; generating, based on a pre-determined criterion, a relationship graph comprising; a plurality of nodes representing the plurality of network resources, and a plurality of edges representing a measure of cross-references between the plurality of resource description data collections; and adjusting, based on the relationship graph, the first category score vector to generate a first adjusted category score vector using at least another category score vector of another network resource of the plurality of network resources. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
Specification