Speech recognition method and apparatus using lexicon group tree
First Claim
Patent Images
1. A method of generating a lexicon group tree, comprising the steps of:
- (a) generating, using at least one processor, a centroid lexicon representing lexicons belonging to a predetermined lexicon group;
(b) selecting two lexicons, having a longest distance therebetween in the lexicon group, using the centroid lexicon from the lexicon group, and branching a node indicating the lexicon group, based on the two selected lexicons; and
(c) selecting a node having low group similarity from among current terminal nodes, including branch nodes, and repeatedly performing steps (a) and (b) on a lexicon group indicated by the selected node,wherein the steps (a) and (b) are repeatedly performed until a variance, indicating group similarity, becomes lower than a predetermined threshold value and/or until the number of lexicons, belonging to a node, decreases to a predetermined number or less, andwherein the step (b) comprises the steps of;
(b1) selecting a first lexicon having a longest distance to the centroid lexicon from the lexicon group;
(b2) selecting a second lexicon having a longest distance to the first lexicon from the lexicon group; and
(b3) bisecting remaining lexicons belonging to the lexicon group, based on the two selected lexicons.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and an apparatus for selecting a vocabulary closest to an input speech from among lexicons stored in memory, wherein a centroid lexicon representing lexicons belonging to a predetermined lexicon group is generated. Two lexicons, having a longest distance therebetween in the lexicon group, are selected using the centroid lexicon from the lexicon group, and a node indicating the lexicon group branches based on the two selected lexicons. A node having low group similarity is selected from among current terminal nodes, including branch nodes, and the above procedure is repeatedly performed on a lexicon group indicated by the selected node.
-
Citations
16 Claims
-
1. A method of generating a lexicon group tree, comprising the steps of:
-
(a) generating, using at least one processor, a centroid lexicon representing lexicons belonging to a predetermined lexicon group; (b) selecting two lexicons, having a longest distance therebetween in the lexicon group, using the centroid lexicon from the lexicon group, and branching a node indicating the lexicon group, based on the two selected lexicons; and (c) selecting a node having low group similarity from among current terminal nodes, including branch nodes, and repeatedly performing steps (a) and (b) on a lexicon group indicated by the selected node, wherein the steps (a) and (b) are repeatedly performed until a variance, indicating group similarity, becomes lower than a predetermined threshold value and/or until the number of lexicons, belonging to a node, decreases to a predetermined number or less, and wherein the step (b) comprises the steps of; (b1) selecting a first lexicon having a longest distance to the centroid lexicon from the lexicon group; (b2) selecting a second lexicon having a longest distance to the first lexicon from the lexicon group; and (b3) bisecting remaining lexicons belonging to the lexicon group, based on the two selected lexicons. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A device for generating a lexicon group tree, comprising:
- at least one processing unit comprising;
a centroid lexicon generation unit, using the at least one processing unit to generate a centroid lexicon representing lexicons belonging to a predetermined lexicon group; a node branching determination unit to select a node having low group similarity from among current terminal nodes; and a node branching unit to select two lexicons, having a longest distance therebetween in the lexicon group, using the centroid lexicon from the lexicon group, and branching a node indicating the lexicon group, based on the two selected lexicons, wherein the node branching is repeatedly performed until a variance, indicating group similarity, becomes lower than a predetermined threshold value and/or until the number of lexicons, belonging to a node, decreases to a predetermined number or less, and wherein the node branching unit selects a first lexicon having a longest distance to the centroid lexicon from the lexicon group, selects a second lexicon having a longest distance to the first lexicon from the lexicon group, and bisects remaining lexicons belonging to the lexicon group, based on the two selected lexicons. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
- at least one processing unit comprising;
Specification