Categorizing objects, such as documents and/or clusters, with respect to a taxonomy and data structures derived from such categorization
First Claim
Patent Images
1. A computer-implemented method comprising:
- a) accepting, by a computer system including at least one computer, Website information associated with a Website;
b) determining, by the computer system and the accepted Website information, a set of scored clusters, wherein the score of each of the scored clusters is indicative of how conceptually significant the cluster is to the Website;
c) determining, by the computer system, at least one category of a predefined taxonomy of categories using at least some of the set of clusters, wherein categories of the predefined taxonomy are hierarchical and correspond to at least one of (A) related products that are likely to be found in Website content, (B) related services that are likely to be found in Website content, or (C) related industries that are likely to be found in Website content;
d) associating, by the computer system, the Website with the determined at least one category to create an association;
e) storing, by the computer system, the association of the Website with the determined at least one category; and
f) determining, by the computer system, an advertisement relevant to the Website at least using the stored association of the Website with the determined at least one category,wherein the act of determining at least one category of a predefined taxonomy using at least some of the clusters includes(A) using information of the at least some of the clusters to look up one or more categories, and(B) for at least some of the one or more categories, determining a score based on a sum of values including (1) an intra-category cluster score of the category, and (2) intra-category cluster scores of categories that are descendants of the category in the hierarchical taxonomy.
2 Assignments
0 Petitions
Accused Products
Abstract
A Website may be automatically categorized by (a) accepting Website information, (b) determining a set of scored clusters (e.g., semantic, term co-occurrence, etc.) for the Website using the Website information, and (c) determining at least one category (e.g., a vertical category) of a predefined taxonomy using at least some of the set of clusters.
-
Citations
17 Claims
-
1. A computer-implemented method comprising:
-
a) accepting, by a computer system including at least one computer, Website information associated with a Website; b) determining, by the computer system and the accepted Website information, a set of scored clusters, wherein the score of each of the scored clusters is indicative of how conceptually significant the cluster is to the Website; c) determining, by the computer system, at least one category of a predefined taxonomy of categories using at least some of the set of clusters, wherein categories of the predefined taxonomy are hierarchical and correspond to at least one of (A) related products that are likely to be found in Website content, (B) related services that are likely to be found in Website content, or (C) related industries that are likely to be found in Website content; d) associating, by the computer system, the Website with the determined at least one category to create an association; e) storing, by the computer system, the association of the Website with the determined at least one category; and f) determining, by the computer system, an advertisement relevant to the Website at least using the stored association of the Website with the determined at least one category, wherein the act of determining at least one category of a predefined taxonomy using at least some of the clusters includes (A) using information of the at least some of the clusters to look up one or more categories, and (B) for at least some of the one or more categories, determining a score based on a sum of values including (1) an intra-category cluster score of the category, and (2) intra-category cluster scores of categories that are descendants of the category in the hierarchical taxonomy. - View Dependent Claims (2, 3, 4, 7, 8, 9, 10, 11, 12)
-
-
5. Apparatus comprising:
-
a) at least one processor; and b) at least one storage device storing processor-executable instructions which, when executed by the at least one processor, perform a method including 1) accepting Website information, 2) determining a set of scored clusters for the Website using the Website information associated with a Website, wherein the score of each of the scored clusters is indicative of how conceptually significant the cluster is to the Website, 3) determining at least one category of a predefined taxonomy of categories using at least some of the set of clusters, wherein categories of the predefined taxonomy are hierarchical and correspond to at least one of (A) related products that are likely to be found in Website content, (B) related services that are likely to be found in Website content, or (C) related industries that are likely to be found in Website content, 4) associating the Website with the determined at least one category to create an association, 5) storing the association of the Website with the determined at least one category, and 6) determining an advertisement relevant to the Website at least using the stored association of the Website with the determined at least one category, wherein the act of determining at least one category of a predefined taxonomy using at least some of the clusters includes (A) using information of the at least some of the clusters to look up one or more categories, and (B) for at least some of the one or more categories, determining a score based on a sum of values including (1) an intra-category cluster score of the category, and (2) intra-category cluster scores of categories that are descendants of the category in the hierarchical taxonomy.
-
-
6. A computer-implemented method comprising:
-
a) accepting, using a computer system including at least one computer, Website information associated with a Website; b) determining, using the computer system, a set of scored clusters for the Website using the Website information; c) determining, using the computer system, at least one category of a predefined taxonomy of categories using at least some of the set of clusters, wherein categories of the predefined taxonomy are hierarchical and correspond to related content formats that are likely to be found in Website content; d) associating, using the computer system, the Website with the determined at least one category to create an association; e) storing, using the computer system, the association of the Website with the determined at least one category; and f) determining, using the computer system, an advertisement relevant to the Website at least using the stored association of the Website with the determined at least one category, wherein the act of determining at least one category of a predefined taxonomy using at least some of the set of clusters includes (A) using information of the at least some of the set of clusters to look up one or more categories, and (B) for at least some of the one or more categories, determining a score based on a sum of values including (1) an intra-category cluster score of the category, and (2) intra-category cluster scores of categories that are descendants of the category in the hierarchical taxonomy. - View Dependent Claims (13, 14, 15, 16, 17)
-
Specification