Data classifier system, data classifier method and data classifier program stored on storage medium
First Claim
1. A data classifier system which refers to a classified hierarchy establishing a parent-child relationship between a plurality of classifications to produce a plurality of classification axes based on classifications and data groups input to the data classifier system, said data classifier system comprising:
- hardware, including a processor and memory, wherein;
the hardware is configured to implement a basic category accumulation part which accumulates classifications serving as basic categories on which basis desired classifications are selected;
the hardware is configured to implement a classification axis candidate reduction part which selects a plurality of classifications from among classifications descendant from each basic category in consideration of semantic relevancy therebetween, to produce a plurality of classification axis candidates based on data quantity of classifications with reference to a common ancestor among classifications in the classified hierarchy; and
the hardware is configured to implement a priority calculation part which calculates priority in displaying the plurality of classification axis candidates with reference to hierarchical distances, each representing a length of a path reaching the common ancestor among classifications associated with classification axis candidates,wherein one of;
(a) the priority concerning a basic category X related to a plurality of classifications C in the classified hierarchy is calculated by
Priority (X;
C) =1/Max(X)×
1/(2×
N)×
Σ
ComDist(ci,cj),where Max(X) denotes a depth of a deepest classification among classifications descending from the basic category X;
N denotes a number of the plurality of classifications; and
ComDist(ci,cj) denotes a distance between classifications ci and cj selected from among the plurality of classifications; and
(b) the priority concerning a basic category X related to a plurality of classifications C in the classified hierarchy is calculated by
Priority (X;
C)=W1×
Independence (X;
C)+W2×
Specifics (X;
C) +W3×
Exhausitivity (X;
C)+W4×
Uniqueness (X;
C),where W1, W2, W3, and W4 denote weight coefficients to indexes which are determined in advance.
1 Assignment
0 Petitions
Accused Products
Abstract
A data classifier system of the present invention selects a plurality of classifications correlated to data groups so as to output classification axes based on hierarchical classifications and data groups. The data classifier system includes a basic category accumulation means, a classification axis candidate reduction means and a priority calculation means. The basic category accumulation means accumulates classifications serving as basic categories used for desired classifications in advance. The classification axis candidate reduction means selects a plurality of classifications from among classifications descendant from each basic category so as to create classification axis candidates, thus reducing classification axis candidates subjected to calculations based on data quantity of classifications and hierarchical distances of classifications. The priority calculation means calculates priority in displaying classification axis candidates, the number of which is reduced by the classification axis candidate reduction means.
60 Citations
20 Claims
-
1. A data classifier system which refers to a classified hierarchy establishing a parent-child relationship between a plurality of classifications to produce a plurality of classification axes based on classifications and data groups input to the data classifier system, said data classifier system comprising:
-
hardware, including a processor and memory, wherein; the hardware is configured to implement a basic category accumulation part which accumulates classifications serving as basic categories on which basis desired classifications are selected; the hardware is configured to implement a classification axis candidate reduction part which selects a plurality of classifications from among classifications descendant from each basic category in consideration of semantic relevancy therebetween, to produce a plurality of classification axis candidates based on data quantity of classifications with reference to a common ancestor among classifications in the classified hierarchy; and the hardware is configured to implement a priority calculation part which calculates priority in displaying the plurality of classification axis candidates with reference to hierarchical distances, each representing a length of a path reaching the common ancestor among classifications associated with classification axis candidates, wherein one of; (a) the priority concerning a basic category X related to a plurality of classifications C in the classified hierarchy is calculated by
Priority (X;
C) =1/Max(X)×
1/(2×
N)×
Σ
ComDist(ci,cj),where Max(X) denotes a depth of a deepest classification among classifications descending from the basic category X;
N denotes a number of the plurality of classifications; and
ComDist(ci,cj) denotes a distance between classifications ci and cj selected from among the plurality of classifications; and(b) the priority concerning a basic category X related to a plurality of classifications C in the classified hierarchy is calculated by
Priority (X;
C)=W1×
Independence (X;
C)+W2×
Specifics (X;
C) +W3×
Exhausitivity (X;
C)+W4×
Uniqueness (X;
C),where W1, W2, W3, and W4 denote weight coefficients to indexes which are determined in advance. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A data classifier method which refers to a classified hierarchy establishing a parent-child relationship between a plurality of classifications to produce a plurality of classification axes based on classifications and data groups input to the method, said data classifier method comprising:
-
a classification axis candidate reduction process which accumulates classifications serving as basic categories on which basis desired classifications in a database are selected, which selects a plurality of classifications from among classifications descendant from each basic category in consideration of semantic relevancy therebetween, to produce a plurality of classification axis candidates based on data quantity of classifications with reference to a common ancestor among classifications in the classified hierarchy; and a priority calculation process which calculates priority in displaying the plurality of classification axis candidates with reference to hierarchical distances, each representing a length of a path reaching the common ancestor among classifications associated with classification axis candidates, wherein one of; (a) the priority concerning a basic category X related to a plurality of classifications C in the classified hierarchy is calculated by
Priority (X;
C)=1/Max(X)×
1/(2×
N)×
Σ
ComDist(ci,cj),where Max(X) denotes a depth of a deepest classification among classifications descending from the basic category X;
N denotes a number of the plurality of classifications; and
ComDist(ci,cj) denotes a distance between classifications ci and cj selected from among the plurality of classifications; and(b) the priority concerning a basic category X related to a plurality of classifications C in the classified hierarchy is calculated by
Priority (X;
C)=W1×
Independence (X;
C)+W2×
Specifics (X;
C) +W3×
Exhausitivity (X;
C)+W4×
Uniqueness (X;
C),where W1, W2, W3 and W4denote weight coefficients to indexes which are determined in advance. - View Dependent Claims (14, 15)
-
-
16. A non-transitory computer-readable storage medium storing a data classifier program, executable by a computer including a processor and a storage unit, which refers to a classified hierarchy establishing a parent-child relationship between a plurality of classifications to produce a plurality of classification axes based on classifications and data groups input into the data classifier program, said data classifier program comprising:
-
a basic category accumulation process which accumulates classifications serving as basic categories on which basis desired classifications are selected; a classification axis candidate reduction process which selects a plurality of classifications from among classifications descendant from each basic category in consideration of semantic relevancy therebetween, to produce a plurality of classification axis candidates based on data quantity of classifications with reference to a common ancestor among classifications in the classified hierarchy; and a priority calculation process which calculates priority in displaying the plurality of classification axis candidates with reference to hierarchical distances, each representing a length of a path reaching the common ancestor among classifications associated with classification axis candidates, wherein one of; (a) the priority concerning a basic category X related to a plurality of classifications C in the classified hierarchy is calculated by
Priority (X;
C)=1/Max(X)×
1/(2×
N)×
Σ
ComDist(ci,cj),where Max(X) denotes a depth of a deepest classification among classifications descending from the basic category X;
N denotes a number of the plurality of classifications; and
ComDist(ci,cj) denotes a distance between classifications ci and cj selected from among the plurality of classifications; and(b) the priority concerning a basic category X related to a plurality of classifications C in the classified hierarchy is calculated by
Priority (X;
C)=W1×
Independence (X;
C)+W2×
Specifics (X;
C)+W3×
Exhausitivity (X;
C)+W4×
Uniqueness (X;
C),where W1, W2, W3, and W4 denote weight coefficients to indexes which are determined in advance. - View Dependent Claims (17, 18, 19, 20)
-
Specification