DERIVING ONTOLOGY BASED ON LINGUISTICS AND COMMUNITY TAG CLOUDS
First Claim
1. A method comprising:
- receiving a tag cloud including tags that hyperlink to web content;
separating the tags into linguistic categories;
assigning a weight to each tag, wherein the weight is based on how many times the tag appears in the web content;
grouping the tags into clusters, wherein each tag in each cluster is associated with a context;
determining one or more domains for the tag clusters, wherein each domain defines one or more of the tags in each linguistic category;
determining a hierarchy for the tags based on the weights of the tags, wherein the hierarchy is based on results from a visual thesaurus;
identifying linguistic relationships between one or more of the tags;
determining properties associated with one or more of the tags and one or more of the domains, wherein the tag'"'"'s properties are determined using linguistic analysis.
1 Assignment
0 Petitions
Accused Products
Abstract
In some embodiments, a method comprises receiving a tag cloud including tags that hyperlink to web content. The method can also comprise separating the tags into different linguistic categories, assigning a weight to each tag, and grouping the tags into clusters, wherein tags in a cluster are associated with a context. The method can also include determining one or more domains for the tag clusters, wherein a domain is a broadest class that defines one or more of the tags in a linguistic category, determining a hierarchy for the tags based on the weights of the tags, and identifying linguistic relationships between the tags. The method can also comprise determining properties associated with one or more of the tags and one or more of the domains, wherein the tag'"'"'s properties are determined using linguistic analysis and storing the tags, the hierarchies, the linguistic relationships, and the properties.
-
Citations
25 Claims
-
1. A method comprising:
-
receiving a tag cloud including tags that hyperlink to web content; separating the tags into linguistic categories; assigning a weight to each tag, wherein the weight is based on how many times the tag appears in the web content; grouping the tags into clusters, wherein each tag in each cluster is associated with a context; determining one or more domains for the tag clusters, wherein each domain defines one or more of the tags in each linguistic category; determining a hierarchy for the tags based on the weights of the tags, wherein the hierarchy is based on results from a visual thesaurus; identifying linguistic relationships between one or more of the tags; determining properties associated with one or more of the tags and one or more of the domains, wherein the tag'"'"'s properties are determined using linguistic analysis. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
a tag cloud linguistic analyzer configured to receive a tag cloud including tags that hyperlink to web content, to separate the tags into linguistic categories, and to assign a weight to each tag, wherein the weight is based on how many times the tag appears in the web content; a semantic domain analyzer configured to group the tags into clusters, wherein tags in each cluster are associated with a context, and to determine one or more domains for the tag clusters, wherein each domain defines one or more of the tags in one of the linguistic categories; a taxonomy builder configured to determine a hierarchy for the tags based on the weights of the tags, wherein the hierarchy is based on results from a visual thesaurus; a relationship analyzer configured to identify linguistic relationships between one or more of the tags; an attribute analyzer configured to determine properties associated with one or more of the tags and one or more of the domains, wherein the tag'"'"'s properties are determined using linguistic analysis; and an ontology repository to store the tags, the hierarchies, the linguistic relationships, and the properties. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. One or more machine-readable media having stored therein a program product, which when executed a set of one or more processor units causes the set of one or more processor units to perform operations comprising:
-
receiving a tag cloud including tags that hyperlink to web content; separating the tags into linguistic categories; assigning a weight to each tag, wherein the weight is based on how many times the tag appears in the web content; grouping the tags into clusters, wherein each tag in each cluster is associated with a context; determining one or more domains for the tag clusters, wherein each domain defines one or more of the tags in each linguistic category; determining a hierarchy for the tags based on the weights of the tags, wherein the hierarchy is also based on results from a visual thesaurus; identifying linguistic relationships between one or more of the tags; determining properties associated with one or more of the tags and one or more of the domains, wherein the tag'"'"'s properties are determined using linguistic analysis. - View Dependent Claims (22, 23, 24, 25)
-
Specification