Categorizing objects, such as documents and/or clusters, with respect to a taxonomy and data structures derived from such categorization
First Claim
Patent Images
1. A method for categorizing a property into one or more categories of a predefined taxonomy, the method comprising:
- a) receiving, by a computer system, information about a property;
b) identifying, by the computer system using the received information about the property, multiple semantic clusters of re-occurring terms within the information;
c) identifying, by the computer system, a set of one or more categories for the property from among the multiple semantic clusters based on a frequency of occurrence of the re-occurring terms in the information, including;
for each level of multiple different levels of a hierarchical taxonomy of categories, determining whether a cluster score for a category at that level of the hierarchical taxonomy meets a pre-specified cluster score threshold;
identifying, based on the determination, a deepest level from a top level of the hierarchical taxonomy that includes a given category having the cluster score that was determined to meet the pre-specified threshold, wherein the cluster score of a given category at a given level of the hierarchical taxonomy is a combination of the cluster score for the given category at that given level and cluster scores of one or more lower level categories that are subsumed by the given category at that level; and
assigning the given category of the most specific deepest level from the top level of the hierarchical taxonomy having the cluster score that meets the pre-specified threshold value as an assigned category for the property;
d) generating, using the identified set of categories, a mapping of the property to at least some of the one or more categories, including generating a mapping of the property to the assigned category;
e) receiving, by the computer system, a term submitted by a user;
f) identifying, by the computer system and using a mapping of terms to categories, the assigned category as a category that is mapped to the term; and
g) providing, to the user, information identifying the property based on the property being assigned to the assigned category that is mapped to the term.
2 Assignments
0 Petitions
Accused Products
Abstract
A Website may be automatically categorized by accepting Website information, determining a set of scored clusters for the Website using the Website information, and determining at least one category of a predefined taxonomy using at least some of the set of clusters.
42 Citations
20 Claims
-
1. A method for categorizing a property into one or more categories of a predefined taxonomy, the method comprising:
-
a) receiving, by a computer system, information about a property; b) identifying, by the computer system using the received information about the property, multiple semantic clusters of re-occurring terms within the information; c) identifying, by the computer system, a set of one or more categories for the property from among the multiple semantic clusters based on a frequency of occurrence of the re-occurring terms in the information, including; for each level of multiple different levels of a hierarchical taxonomy of categories, determining whether a cluster score for a category at that level of the hierarchical taxonomy meets a pre-specified cluster score threshold; identifying, based on the determination, a deepest level from a top level of the hierarchical taxonomy that includes a given category having the cluster score that was determined to meet the pre-specified threshold, wherein the cluster score of a given category at a given level of the hierarchical taxonomy is a combination of the cluster score for the given category at that given level and cluster scores of one or more lower level categories that are subsumed by the given category at that level; and assigning the given category of the most specific deepest level from the top level of the hierarchical taxonomy having the cluster score that meets the pre-specified threshold value as an assigned category for the property; d) generating, using the identified set of categories, a mapping of the property to at least some of the one or more categories, including generating a mapping of the property to the assigned category; e) receiving, by the computer system, a term submitted by a user; f) identifying, by the computer system and using a mapping of terms to categories, the assigned category as a category that is mapped to the term; and g) providing, to the user, information identifying the property based on the property being assigned to the assigned category that is mapped to the term. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for associating a property with one or more categories of a predefined taxonomy, the system comprising:
-
a computer system comprising a processor and a memory storing an advertising targeting database, the processor configured to perform operations including;
receiving information about a property;identifying, by the computer system using the received information about the property, multiple semantic clusters of re-occurring terms within the information; identifying, by the computer system, a set of one or more categories using the multiple semantic clusters, including; for each level of multiple different levels of a hierarchical taxonomy of categories, determining whether a cluster score for a category at that level of the hierarchical taxonomy meets a pre-specified cluster score threshold; identifying, based on the determination, a deepest level from a top level of the hierarchical taxonomy that includes a given category having the cluster score that was determined to meet the pre-specified threshold, wherein the cluster score of a given category at a given level of the hierarchical taxonomy is a combination of the cluster score for the given category at that given level and cluster scores of one or more lower level categories that are subsumed by the given category at that level; and assigning the given category of the deepest level from the top level of the hierarchical taxonomy having the cluster score that meets the pre-specified threshold value as an assigned category for the property; generating a mapping of the property to at least some of the one or more categories, including generating a mapping of the property to the assigned category; receiving, by the computer system, a term submitted by a user;
identifying, by the computer system and using a mapping of terms to categories, the assigned category as a category that is mapped to the term; andproviding, to the user, information identifying the property based on the property being assigned to the assigned category that is mapped to the term. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer-readable storage medium storing instructions that when executed by one or more data processors, cause the one or more data processors to perform operations comprising:
-
receiving information about a property; identifying, using the received information about the property, multiple semantic clusters of re-occurring terms within the information; identifying a set of one or more categories using the multiple semantic clusters, including; for each level of multiple different levels of a hierarchical taxonomy of categories, determining whether a cluster score for a category at that level of the hierarchical taxonomy meets a pre-specified cluster score threshold; identifying, based on the determination, a deepest level from a top level of the hierarchical taxonomy that includes a given category having the cluster score that was determined to meet the pre-specified threshold, wherein the cluster score of a given category at a given level of the hierarchical taxonomy is a combination of the cluster score for the given category at that given level and cluster scores of one or more lower level categories that are subsumed by the given category at that level; and assigning the given category of the deepest level from the top level of the hierarchical taxonomy having the cluster score that meets the pre-specified threshold value as an assigned category for the property; generating a mapping of the property to at least some of the one or more categories, including generating a mapping of the property to the assigned category;
receiving a term submitted by a user;identifying, using a mapping of terms to categories, the assigned category as a category that is mapped to the term; and providing, to the user, information identifying the property based on the property being assigned to the assigned category that is mapped to the term. - View Dependent Claims (20)
-
Specification