Apparatus, method, and program for clustering phonemic models
First Claim
1. A clustering apparatus comprising:
- an input unit configured to input at least one phonemic model attached with determination information indicating a small amount of speech data for training, and at least one phonemic model not attached with the determination information;
a node initializing unit configured to generate a node including the inputted phonemic models as a root node of a tree structure;
a candidate generating unit configured to generate candidates of a pair of child sets for a node having no child node among nodes in the tree structure, by partitioning a set of phonemic models included in the node into two;
a candidate deleting unit configured to delete a candidate including the two child sets at least one of which includes only phonemic models attached with the determination information, from the generated candidates;
a similarity calculating unit configured to calculate a similarity among the phonemic models included in each of the two child sets included in each of the candidates other than the deleted candidates, and calculates a sum of the similarities calculated for the child sets;
a candidate selecting unit configured to select one of the candidates having a largest calculated sum;
a node generating unit configured to generate two nodes including two child sets included in the selected candidate, respectively, as child nodes of the node that is a generation source of the selected candidate; and
a clustering unit configured to cluster the phonemic models in units of phonemic model sets included in nodes of the tree structure.
1 Assignment
0 Petitions
Accused Products
Abstract
A node initializing unit generates a root node including inputted phonemic models. A candidate generating unit generates candidates of a pair of child sets by partitioning a set of phonemic models included in a node having no child node into two. A candidate deleting unit deletes candidates each including only phonemic models attached with determination information indicating that at least one of the child sets has a small amount of speech data for training. A similarity calculating unit calculates a sum of similarities among the phonemic models included in the child sets. A candidate selecting unit selects one of the candidates having a largest sum. A node generating unit generates two nodes including the two child sets included in the selected candidate, respectively. A clustering unit clusters the phonemic models in units of phonemic model sets each included in a node.
11 Citations
6 Claims
-
1. A clustering apparatus comprising:
-
an input unit configured to input at least one phonemic model attached with determination information indicating a small amount of speech data for training, and at least one phonemic model not attached with the determination information; a node initializing unit configured to generate a node including the inputted phonemic models as a root node of a tree structure; a candidate generating unit configured to generate candidates of a pair of child sets for a node having no child node among nodes in the tree structure, by partitioning a set of phonemic models included in the node into two; a candidate deleting unit configured to delete a candidate including the two child sets at least one of which includes only phonemic models attached with the determination information, from the generated candidates; a similarity calculating unit configured to calculate a similarity among the phonemic models included in each of the two child sets included in each of the candidates other than the deleted candidates, and calculates a sum of the similarities calculated for the child sets; a candidate selecting unit configured to select one of the candidates having a largest calculated sum; a node generating unit configured to generate two nodes including two child sets included in the selected candidate, respectively, as child nodes of the node that is a generation source of the selected candidate; and a clustering unit configured to cluster the phonemic models in units of phonemic model sets included in nodes of the tree structure. - View Dependent Claims (2, 3, 4)
-
-
5. A clustering method comprising:
-
inputting at least one phonemic model attached with determination information indicating a small amount of speech data for training, and at least one phonemic model not attached with the determination information; generating a node including the inputted phonemic models as a root node of a tree structure; generating candidates of a pair of child sets for a node having no child node among nodes in the tree structure, by partitioning a set of phonemic models included in the node into two; deleting a candidate including the two child sets at least one of which includes only phonemic models attached with the determination information, from the generated candidates; calculating a similarity among the phonemic models included in each of the two child sets included in each of the candidates other than the deleted candidates, and calculating a sum of the similarities calculated for the child sets; selecting one of the candidates having a largest calculated sum; generating two nodes including two child sets included in the selected candidate, respectively, as child nodes of the node that is a generation source of the selected candidate; and clustering the phonemic models in units of phonemic model sets included in nodes of the tree structure.
-
-
6. A program stored on a nontransitory computer readable medium including programmed instructions for clustering phonemic models, wherein the instructions, when executed by a computer, cause the computer to perform:
-
inputting at least one phonemic model attached with determination information indicating a small amount of speech data for training, and at least one phonemic model not attached with the determination information; generating a node including the inputted phonemic models as a root node of a tree structure; generating candidates of a pair of child sets for a node having no child node among nodes in the tree structure, by partitioning a set of phonemic models included in the node into two; deleting a candidate including the two child sets at least one of which includes only phonemic models attached with the determination information, from the generated candidates; calculating a similarity among the phonemic models included in each of the two child sets included in each of the candidates other than the deleted candidates, and calculating a sum of the similarities calculated for the child sets; selecting one of the candidates having a largest calculated sum; generating two nodes including two child sets included in the selected candidate, respectively, as child nodes of the node that is a generation source of the selected candidate; and clustering the phonemic models in units of phonemic model sets included in nodes of the tree structure.
-
Specification