Relating similar terms for information retrieval
First Claim
1. A non-transitory tangible computer-readable medium having instructions stored thereon, the instructions, when executed by a processing device, enabling the processing device to perform operations of:
- selecting a resource from a grouping of resources;
wherein the grouping of resources includes one or more of;
documents, web sites, bookmarks, audio files, and images;
after selecting the resource, assigning a controlled tag to the selected resource;
wherein the controlled tag is a first textual term selected from a controlled vocabulary including a finite and predetermined set of textual terms;
after selecting the resource, assigning an uncontrolled tag to the selected resource;
wherein the uncontrolled tag is a second textual term selected from an uncontrolled vocabulary including an open set of textual terms;
identifying a first set of resources in the grouping of resources, each resource in the first set of resources having also been previously assigned the same uncontrolled tag;
the same uncontrolled tag assigned to the first set of resources being the second textual term;
identifying a second set of resources in the grouping of resources, each resource in the second set of resources having also been previously assigned the same controlled tag;
the same controlled tag assigned to the second set of resources being the first textual term;
producing a comparison result indicative of a similarity between the first set of resources and the second set of resources; and
when the similarity exceeds a threshold value, generating a rule that for any resources that are assigned the second textual term as the uncontrolled tag and are not assigned the first textual term as the controlled tag, these resources will be assigned the first textual term as the controlled tag.
2 Assignments
0 Petitions
Accused Products
Abstract
A resource analyzer selects a resource (e.g., document) from a grouping of resources. The grouping of resources can be any type of social tagging system used for information retrieval. The selected resource has an assigned uncontrolled tag and an assigned controlled tag. The controlled tag is a term derived from a controlled vocabulary of terms. Having selected the resource for analyzing, the resource analyzer identifies a first set of resources in the grouping of resources having also been assigned a same value as the uncontrolled tag as the selected resource. Similarly, the resource analyzer identifies a second set of resources in the grouping of resources having also been assigned a same value as the controlled tag. With this information, the resource analyzer then produces a comparison result indicative of a similarity between the first set of resources and the second set of resources.
-
Citations
16 Claims
-
1. A non-transitory tangible computer-readable medium having instructions stored thereon, the instructions, when executed by a processing device, enabling the processing device to perform operations of:
-
selecting a resource from a grouping of resources; wherein the grouping of resources includes one or more of;
documents, web sites, bookmarks, audio files, and images;after selecting the resource, assigning a controlled tag to the selected resource; wherein the controlled tag is a first textual term selected from a controlled vocabulary including a finite and predetermined set of textual terms; after selecting the resource, assigning an uncontrolled tag to the selected resource; wherein the uncontrolled tag is a second textual term selected from an uncontrolled vocabulary including an open set of textual terms; identifying a first set of resources in the grouping of resources, each resource in the first set of resources having also been previously assigned the same uncontrolled tag; the same uncontrolled tag assigned to the first set of resources being the second textual term; identifying a second set of resources in the grouping of resources, each resource in the second set of resources having also been previously assigned the same controlled tag; the same controlled tag assigned to the second set of resources being the first textual term; producing a comparison result indicative of a similarity between the first set of resources and the second set of resources; and when the similarity exceeds a threshold value, generating a rule that for any resources that are assigned the second textual term as the uncontrolled tag and are not assigned the first textual term as the controlled tag, these resources will be assigned the first textual term as the controlled tag. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computer system comprising:
-
a processor; a memory unit that stores instructions associated with an application executed by the processor; and an interconnect coupling the processor and the memory unit, enabling the computer system to execute the application and perform operations of; selecting a resource from a grouping of resources; wherein the grouping of resources includes one or more of;
documents, web sites, bookmarks, audio files, and images;after selecting the resource, assigning a controlled tag to the selected resource; wherein the controlled tag is a first textual term selected from a controlled vocabulary including a finite and predetermined set of textual terms; after selecting the resource, assigning an uncontrolled tag to the selected resource; wherein the uncontrolled tag is a second textual term selected from an uncontrolled vocabulary including an open set of textual terms; identifying a first set of resources in the grouping of resources having also been previously assigned the same uncontrolled tag; the same uncontrolled tag assigned to the first set of resources being the second textual term; identifying a second set of resources in the grouping of resources having also been previously assigned the same controlled tag; the same controlled tag assigned to the second set of resources being the first textual term; producing a comparison result indicative of a similarity between the first set of resources and the second set of resources; and when the similarity exceeds a threshold value, generating a rule that for any resources that are assigned the second textual term as the uncontrolled tag and are not assigned the first textual term as the controlled tag, these resources will be assigned the first textual term as the controlled tag. - View Dependent Claims (16)
-
Specification