Method and system to identify records that relate to a pre-defined context in a data set
First Claim
1. A method for identifying relevant information, which relates to a pre-defined context, from a data set represented in the form of a tree structure, the tree structure being a representation of the data set based on a logical relationship between different data objects in the data set, the method comprising the steps of:
- a. identifying nodes of interest in the tree structure, each node of interest being a node that contains information, which is relevant to the pre-defined context;
b. iteratively extracting sub-trees from the tree structure, each sub-tree being a hierarchical structure that shows the relationship of the node of interest with its ancestor nodes in the tree structure; and
c. identifying records in the extracted sub-trees, each record being a group of sub-tree nodes, containing at least one node of interest.
4 Assignments
0 Petitions
Accused Products
Abstract
The present invention provides a method and a system for identifying relevant information in a data set. The method involves the identification of nodes of interest in a tree structure. A node of interest is a node that contains information, which is relevant to a pre-defined context. The method further involves the step of iteratively extracting sub-trees from the tree structure and identifying records in the extracted sub-trees. The sub-tree is a hierarchical structure that shows the relationship of each node of interest with its ancestor nodes in the tree structure. Each record is a group of sub-tree nodes and contains at least one node of interest.
-
Citations
11 Claims
-
1. A method for identifying relevant information, which relates to a pre-defined context, from a data set represented in the form of a tree structure, the tree structure being a representation of the data set based on a logical relationship between different data objects in the data set, the method comprising the steps of:
-
a. identifying nodes of interest in the tree structure, each node of interest being a node that contains information, which is relevant to the pre-defined context;
b. iteratively extracting sub-trees from the tree structure, each sub-tree being a hierarchical structure that shows the relationship of the node of interest with its ancestor nodes in the tree structure; and
c. identifying records in the extracted sub-trees, each record being a group of sub-tree nodes, containing at least one node of interest. - View Dependent Claims (2, 3, 4)
-
-
5. A system for identifying relevant information, which relates to a pre-defined context, from a data set represented in the form of a tree structure, the tree structure being a representation of the data set based on a logical relationship between different data objects in the data set, the system comprising:
-
a. a node identifier, to identify nodes of interest in the tree structure, each node of interest being a node that contains information, which is relevant to the pre-defined context;
b. a sub-tree extractor, to iteratively extract sub-trees from the tree structure, each sub-tree being a hierarchical structure showing the relationship of the node of interest with its ancestor nodes in the tree structure; and
c. a record recognizer, to identify records, each record being a group of sub-tree nodes, containing at least one node of interest. - View Dependent Claims (6, 7)
-
-
8. A computer program product for use with a computer, the computer program product comprising a computer usable medium having a computer program code embodied therein to identify relevant information, which relates to a pre-defined context, from a data set represented in the form of a tree structure, the tree structure being a representation of the data set based on a logical relationship between different data objects in the data set, the computer program code performing the steps of:
-
a. identifying nodes of interest in the tree structure, each node of interest being a node that contains information, which is relevant to the pre-defined context;
b. iteratively extracting sub-trees from the tree structure, each sub-tree being a hierarchical structure that shows the relationship of the node of interest with its ancestor nodes in the tree structure; and
c. identifying records, each record being a group of sub-tree nodes, containing at least one node of interest. - View Dependent Claims (9, 10, 11)
-
Specification