System and method of determining and recommending a document control policy for a document
First Claim
1. A computer-implemented method comprising:
- determining a genre classification for a document, the genre classification comprising multiple terms and corresponding scores, each score for a term indicating a confidence level for the term with respect to the document;
accessing a stored document control policy ontology, the document control policy ontology comprising a hierarchy of nodes that represent document genres and have corresponding document control policies;
identifying an entry point node in the hierarchy of nodes of the document control policy ontology by successively comparing the multiple terms in the genre classification with the nodes in the document control policy ontology in order of increasing hierarchical position of the terms until either a matching node in the ontology is found or the term in the highest hierarchical position of the multiple terms is reached and no matching node is found in which case a document control policy corresponding to a root node is used the identifying comprising, when the document control policy ontology has an underlying classification structure different from a classification structure used for determining the genre classification, identifying a correlation between the different classification structures;
assessing a confidence level for applicability of the entry point node based at least in part upon at least one of the scores;
inferencing within the document control policy ontology to find a document control policy more conservative than the policy corresponding to the entry point node, the inferencing comprising selecting a parent node of the entry point node in the document control policy ontology to stand in for the entry point node if the assessed confidence level for applicability of the entry point node falls below a threshold, the parent node inheriting at least one document control policy derived from at least the entry point node; and
outputting, to a hardware device, a recommendation that identifies at least one document control policy to govern access to the document based on the identified entry point node or the selected parent node in the document control policy ontology.
2 Assignments
0 Petitions
Accused Products
Abstract
In general, a genre classification can be determined for a document, the genre classification including multiple terms and corresponding scores indicating confidence levels for the terms with respect to the document. A relevant node in a document control policy ontology can be identified in accordance with the genre classification, and a confidence level for applicability of the relevant node can be assessed based at least in part upon at least one of the scores. A parent node of the relevant node in the document control policy ontology can be selected to stand in for the relevant node if the assessed confidence level for applicability of the relevant node falls below a threshold. At least one document control policy can be recommended to govern access to the document based on the identified or selected relevant node in the document control policy ontology.
-
Citations
30 Claims
-
1. A computer-implemented method comprising:
-
determining a genre classification for a document, the genre classification comprising multiple terms and corresponding scores, each score for a term indicating a confidence level for the term with respect to the document; accessing a stored document control policy ontology, the document control policy ontology comprising a hierarchy of nodes that represent document genres and have corresponding document control policies; identifying an entry point node in the hierarchy of nodes of the document control policy ontology by successively comparing the multiple terms in the genre classification with the nodes in the document control policy ontology in order of increasing hierarchical position of the terms until either a matching node in the ontology is found or the term in the highest hierarchical position of the multiple terms is reached and no matching node is found in which case a document control policy corresponding to a root node is used the identifying comprising, when the document control policy ontology has an underlying classification structure different from a classification structure used for determining the genre classification, identifying a correlation between the different classification structures; assessing a confidence level for applicability of the entry point node based at least in part upon at least one of the scores; inferencing within the document control policy ontology to find a document control policy more conservative than the policy corresponding to the entry point node, the inferencing comprising selecting a parent node of the entry point node in the document control policy ontology to stand in for the entry point node if the assessed confidence level for applicability of the entry point node falls below a threshold, the parent node inheriting at least one document control policy derived from at least the entry point node; and outputting, to a hardware device, a recommendation that identifies at least one document control policy to govern access to the document based on the identified entry point node or the selected parent node in the document control policy ontology. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
a user interface device; a document control component comprising a hierarchical knowledge structure including document control policies, including at least one document control policy inherited by a parent node in accordance with an algebraic maximum of rules associated with child nodes of the parent node; and one or more computers operable to interact with the user interface device and the document control component to determine a genre classification for a document, compare the genre classification with the hierarchical knowledge structure to identify a relevant node among multiple nodes, including the parent node, and recommend at least one document control policy to govern access to the document based on the identified relevant node in the hierarchical knowledge structure, the relevant node identified by successively comparing multiple terms in the genre classification with the hierarchical knowledge structure based on hierarchical positions of the terms until either a matching node in the hierarchical knowledge structure is found or the term in the highest hierarchical position of the multiple terms is reached and no matching node is found in which case a document control policy corresponding to a root node is used, and when the hierarchical knowledge structure of the document control component differs from a hierarchical knowledge structure used for determining the genre classification, the one or more computers being operable to identify a correlation between the different knowledge structures. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer program product, encoded on a machine-readable storage device, configured to cause one or more data processing apparatus to perform operations comprising:
-
receiving a genre classification for a document, the genre classification comprising multiple terms and corresponding scores, each score for a term indicating a confidence level for the term with respect to the document; accessing a stored document control policy ontology, the document control policy ontology comprising a hierarchy of nodes that represent document genres and have corresponding document control policies; identifying an entry point node in the hierarchy of nodes of the document control policy ontology by successively comparing the multiple terms in the genre classification with the nodes in the document control policy ontology in order of increasing hierarchical position of the terms until either a matching node in the ontology is found or the term in the highest hierarchical position of the multiple terms is reached and no matching node is found in which case a document control policy corresponding to a root node is used the identifying comprising, when the document control policy ontology has an underlying classification structure different from a classification structure used for determining the genre classification, identifying a correlation between the different classification structures; assessing a confidence level for applicability of the entry point node based at least in part upon at least one of the scores; inferencing within the document control policy ontology to find a document control policy more conservative than the policy corresponding to the entry point node, the inferencing comprising selecting a parent node of the entry point node in the document control policy ontology to stand in for the entry point node if the assessed confidence level for applicability of the entry point node falls below a threshold, the parent node inheriting at least one document control policy derived from at least the entry point node; and recommending at least one document control policy to govern access to the document based on the identified entry point node or the selected parent node in the document control policy ontology. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
-
Specification