Method, apparatus, and computer program product for locating data in large datasets
First Claim
1. , A data analysis system for analyzing data sets, comprising:
- a database for storing data;
analysis processing means for analyzing a relation among data elements based on a number of occurrences, in stored data, of a first data element, and a number of simultaneous occurrences, in the stored data, of the first data element and a second data element; and
output means for outputting results provided by the analysis processing means.
1 Assignment
0 Petitions
Accused Products
Abstract
To analyze a data set having a one-to-many relation, the number of simultaneous occurrences of data in which two data elements are coexistent is obtained for all combinations of two data elements. A dependence ratio of one data element upon the other data element is calculated from the numbers of simultaneous occurrences. The data elements are grouped based upon the numbers of occurrences of individual data elements and the dependence ratios compared with the predetermined thresholds. Based on the number of occurrences of individual data elements and the dependence ratios, subordinate relations of data elements within the same group are specified and displayed to a user in the form of a tree or balloon figure.
-
Citations
14 Claims
-
1. , A data analysis system for analyzing data sets, comprising:
-
a database for storing data;
analysis processing means for analyzing a relation among data elements based on a number of occurrences, in stored data, of a first data element, and a number of simultaneous occurrences, in the stored data, of the first data element and a second data element; and
output means for outputting results provided by the analysis processing means. - View Dependent Claims (2, 3, 4)
-
-
5. , Apparatus for data analysis, comprising:
-
analysis processing means for specifying pairs of keywords based on frequencies that keywords occur in a set of keywords of data stored in a database, and grouping the set of keywords into groups of keywords based on the specified pairs of keywords; and
output means for outputting results provided by the analysis processing means. - View Dependent Claims (6, 7, 8)
-
-
9. , A display terminal comprising:
-
an interface for requesting analysis of data;
accepting means for accepting analysis results giving a relation based on a number of occurrences of two data elements of a plurality of data elements; and
output means for displaying the relation as a figure, based on the analysis results. - View Dependent Claims (10, 11)
-
-
12. , A data analysis method, comprising the steps of:
-
calculating dependence ratios of a plurality of data elements to be analyzed;
grouping the data elements according to the dependence ratios; and
outputting the grouped data elements. - View Dependent Claims (13, 14)
-
Specification