Speech recognition method and apparatus using lexicon group tree
First Claim
1. A method of generating a lexicon group tree, comprising the steps of:
- (a) generating a centroid lexicon representing lexicons belonging to a predetermined lexicon group;
(b) selecting two lexicons, having a longest distance therebetween in the lexicon group, using the centroid lexicon from the lexicon group, and branching a node indicating the lexicon group, based on the two selected lexicons; and
(c) selecting a node having low group similarity from among current terminal nodes, including branch nodes, and repeatedly performing steps (a) and (b) on a lexicon group indicated by the selected node.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and an apparatus for selecting a vocabulary closest to an input speech from among lexicons stored in memory, wherein a centroid lexicon representing lexicons belonging to a predetermined lexicon group is generated. Two lexicons, having a longest distance therebetween in the lexicon group, are selected using the centroid lexicon from the lexicon group, and a node indicating the lexicon group branches based on the two selected lexicons. A node having low group similarity is selected from among current terminal nodes, including branch nodes, and the above procedure is repeatedly performed on a lexicon group indicated by the selected node.
32 Citations
28 Claims
-
1. A method of generating a lexicon group tree, comprising the steps of:
-
(a) generating a centroid lexicon representing lexicons belonging to a predetermined lexicon group;
(b) selecting two lexicons, having a longest distance therebetween in the lexicon group, using the centroid lexicon from the lexicon group, and branching a node indicating the lexicon group, based on the two selected lexicons; and
(c) selecting a node having low group similarity from among current terminal nodes, including branch nodes, and repeatedly performing steps (a) and (b) on a lexicon group indicated by the selected node. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of recognizing speech, comprising the steps of:
-
(a) segmenting an input acoustic signal into frames;
(b) performing a feature transform on the segmented acoustic signal;
(c) determining similarities between centroid lexicons, representing two branch nodes, and the feature-transformed acoustic signal, and selecting a node having higher similarity;
(d) repeatedly performing step (c) until the selected node is a terminal node; and
(e) loading a lexicon group of the terminal node if the selected node is the terminal node, and selecting a lexicon having higher similarity between the lexicon and the feature-transformed acoustic signal from the loaded lexicon group. - View Dependent Claims (12, 13, 14)
-
-
15. A device for generating a lexicon group tree, comprising:
-
a centroid lexicon generation unit for generating a centroid lexicon representing lexicons belonging to a predetermined lexicon group;
a node branching determination unit for selecting a node having low group similarity from among current terminal nodes; and
a node branching unit for selecting two lexicons, having a longest distance therebetween in the lexicon group, using the centroid lexicon from the lexicon group, and branching a node indicating the lexicon group, based on the two selected lexicons. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A device for recognizing speech, comprising:
-
a frame segmentation unit for segmenting an input acoustic signal into frames;
a feature transform unit for performing a feature transform on the segmented acoustic signal;
a node branching determination unit for repeatedly performing a procedure of determining similarities between centroid lexicons, representing two branch nodes, and the feature-transformed acoustic signal and selecting a node having higher similarity until the selected node is a terminal node; and
a lexicon selection unit for loading a lexicon group of the terminal node if the selected node is the terminal node, and selecting a lexicon having higher similarity between the lexicon and the feature-transformed acoustic signal from the loaded lexicon group. - View Dependent Claims (26, 27, 28)
-
Specification