Information processing method, information processing apparatus, and storage medium
First Claim
1. An information processing method for generating a classification tree, which is a recognition dictionary used for character recognition, comprising:
- a division step of dividing a predetermined training stroke into a plurality of segments;
a vector quantization step of performing vector quantization of said strokes in said segments obtained at said division step;
a layered stroke vector generation step of synthesizing adjacent strokes of said segments, obtained at said division step, to obtain stroke sets to generate upper rank stroke vectors, and of producing a layered vector series; and
a classification tree generation step of selecting a stroke vector, for which a predetermined entropy function is the greatest, from upper rank stroke vectors in said layered stroke vector series that is obtained at said layered stroke vector generation step, and of developing said stroke vector to produce lower rank stroke vectors to generate a classification tree.
1 Assignment
0 Petitions
Accused Products
Abstract
This relates to a classification generation method whereby, in order to efficiently and accurately recognize a pattern having a large number of characteristics, a pattern classification tree is generated, with which a macro structural characteristic of a pattern is appropriately reflected and a competitive relationship between categories is adequately reflected, and to a method for recognizing an input pattern by using the generated classification tree. When an input pattern is formed using strokes, a training stroke is divided into a plurality of segments, and vector quantization is performed for the strokes in the segments. Among the quantized strokes in the segments, adjacent stroke sets are synthesized to repetitively generate upper rank stroke vectors. A stroke vector for which a predetermined entropy function is maximized is selected from the upper rank stroke vectors in a layered stroke vector series, and development is performed extending down into the lower rank stroke vector sets. As a result, a classification tree is prepared.
39 Citations
74 Claims
-
1. An information processing method for generating a classification tree, which is a recognition dictionary used for character recognition, comprising:
-
a division step of dividing a predetermined training stroke into a plurality of segments; a vector quantization step of performing vector quantization of said strokes in said segments obtained at said division step; a layered stroke vector generation step of synthesizing adjacent strokes of said segments, obtained at said division step, to obtain stroke sets to generate upper rank stroke vectors, and of producing a layered vector series; and a classification tree generation step of selecting a stroke vector, for which a predetermined entropy function is the greatest, from upper rank stroke vectors in said layered stroke vector series that is obtained at said layered stroke vector generation step, and of developing said stroke vector to produce lower rank stroke vectors to generate a classification tree. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 69)
-
-
9. An information processing apparatus for generating a classification tree, which is a recognition dictionary used for character recognition, comprising:
-
division means for dividing a predetermined training stroke into a plurality of segments; vector quantization means for performing vector quantization of said strokes in said segments obtained by said division means; layered stroke vector generation means for synthesizing adjacent strokes of said segments, obtained by said division means, to obtain stroke sets to generate upper rank stroke vectors, and for producing a layered vector series; and classification tree generation means for selecting a stroke vector, for which a predetermined entropy function is the greatest, from upper rank stroke vectors in said layered stroke vector series that is obtained by said layered stroke vector generation means, and for developing said stroke vector to produce lower rank stroke vectors to generate a classification tree. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 70)
-
-
17. A computer accessible memory in which are stored stroke vector classification tree data that are constituted by a stroke vector series extending from a low-level to a high-level layer, said stroke vector classification data comprising:
-
root node production data, leaf node attribute data, an attribute of which is that only a stroke vector series of a single category is present, that include said stroke vector series of said single category, and node attribute data, for plural categories, an attribute of which is that only a stroke vector series for said plurality of categories is present, that include said stroke vector series of said plurality of categories, wherein said stroke vector series, in said low-rank to high-rank layers, is assigned, for each of said layers, to respective nodes for said leaf node attribute data and said node attribute data for plural categories, while said root node attribute data are regarded as a start node of a lowest-level layer; and wherein, when a stroke vector series, composed of a plurality of categories, that is constituted by a plurality of stroke vector series is present in a training stroke vector series at said layers, a stroke vector with which a predetermined entropy function is maximized is selected from a stroke vector series that is included in said node for said node attribute data for plural categories, and a lower rank stroke vector series is developed for said stroke vector that is selected, so that a node at the next layer is acquired that is linked with said node for said node attribute data for plural categories, to which said stroke vector series for plural categories is assigned. - View Dependent Claims (59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 71)
-
-
18. An information processing method for generating a layered pattern comprising:
-
a first degenerated pattern generation step of calculating, for a pattern divided into a plurality of segments, a logical sum for data of adjacent segments among said plurality of segments, and of generating a first degenerated pattern; a second degenerated pattern generation step of calculating a logical sum for data of adjacent segments among a plurality of segments of said first degenerated pattern obtained at said first degenerated pattern generation step, and of generating a second degenerated pattern; and a repetition step of repeating a process at said second degenerated pattern generation step a predetermined n times, based on said second degenerated pattern obtained at said second degenerated pattern generation step, and of generating a layered pattern of n-1 layers. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 72)
-
-
28. An information processing apparatus for generating a layered pattern comprising:
-
first degenerated pattern generation means for calculating, for a pattern divided into a plurality of segments, a logical sum for data of adjacent segments among said plurality of segments, and for generating a first degenerated pattern; second degenerated pattern generation means for calculating a logical sum for data of adjacent segments among a plurality of segments of said first degenerated pattern obtained by said first degenerated pattern generation means, and for generating a second degenerated pattern; and repetition means for repeating a process performed by said second degenerated pattern generation means a predetermined n times, based on said second degenerated pattern obtained by said second degenerated pattern generation means, and for generating a layered pattern of n-1 layers. - View Dependent Claims (29, 30, 31, 32, 33, 34, 35, 36, 37, 73)
-
-
38. A computer program product having a storage medium in which is stored computer accessible program code for generating a layered character pattern, said program code including:
-
program code for calculating, for a character pattern divided into a plurality of segments, a logical sum for data of adjacent segments among said plurality of segments, and for generating a first degenerated character pattern; program code for calculating a logical sum for data of adjacent segments among a plurality of segments of said first degenerated character pattern, and for generating a second degenerated character pattern; and program code for repeating a process for generating said second degenerated character pattern a predetermined n times based on said second degenerated character pattern, and for generating a layered character pattern of n-1 layers. - View Dependent Claims (74)
-
Specification