Refinement and calibration mechanism for improving classification of information assets
First Claim
Patent Images
1. A computer-implemented method for refining asset classifications, the method comprising:
- receiving a plurality of assets, each asset having a classification of a term, wherein each term is selected from a business glossary which provides a hierarchy of controlled vocabulary of terms used within an organization and wherein each asset is characterized using a set of attributes selected from a domain ontology; and
upon determining a first term assigned to a first one of the assets satisfies a set of refinement criteria, refining the classification of the first asset by assigning the first asset a second term from the business glossary, wherein the second term is more precise in the business glossary than the first term and wherein the refinement criteria includes;
determining that the term of a second one of the assets comprises a descendent of the classification of the first asset, anddetermining that each attribute of the first asset is at a lower level in the domain ontology than a corresponding attribute in the second asset.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques are described for refining the manual classification of assets classified or categorized using the terms of a business glossary. A semantic refinement mechanism is used to refine the manual classification of such assets, as well as subsequently evaluate the refined asset classifications. Further, the refined asset classifications may be used as a training set for a machine learning classifier. That is, should the classification of an asset contributing to a refinement change, the refinement based on that classification may be undone, at least in some cases.
9 Citations
6 Claims
-
1. A computer-implemented method for refining asset classifications, the method comprising:
-
receiving a plurality of assets, each asset having a classification of a term, wherein each term is selected from a business glossary which provides a hierarchy of controlled vocabulary of terms used within an organization and wherein each asset is characterized using a set of attributes selected from a domain ontology; and upon determining a first term assigned to a first one of the assets satisfies a set of refinement criteria, refining the classification of the first asset by assigning the first asset a second term from the business glossary, wherein the second term is more precise in the business glossary than the first term and wherein the refinement criteria includes; determining that the term of a second one of the assets comprises a descendent of the classification of the first asset, and determining that each attribute of the first asset is at a lower level in the domain ontology than a corresponding attribute in the second asset. - View Dependent Claims (2, 3, 4, 5, 6)
-
Specification