Speech recognition apparatus and speech recognition program
First Claim
1. A speech recognition apparatus comprising:
- a storage medium for storing tree structured dictionary data that contains a plurality of words as nodes in a tree structure;
a backward speech comparison unit for comparing a backward speech resulting from reproducing an input speech in chronologically backward order with a backward acoustic model corresponding to a speech of reversely reproducing a word string sequence toward a root of the tree structure, wherein a comparison is performed in reverse order of the sequence;
a forward speech comparison unit for comparing the input speech with a forward acoustic model corresponding to a speech resulting from reproducing a leaf node word of the tree structure in chronologically forward order; and
an output unit for outputting a word or a word string highly likely matching the input speech based on a comparison result each from the backward speech comparison unit and the forward speech comparison unit.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech recognition apparatus includes a storage medium for storing tree structured dictionary data containing words as nodes in a tree structure with a root and leaf nodes. An input speech is compared with a forward acoustic model corresponding to a speech resulting from chronologically reproducing words indicated by nodes for leaf nodes. A backward speech is further compared with a backward acoustic model. The backward speech is generated by reproducing the input speech in chronologically backward order. The backward acoustic model corresponds to a speech resulting from reproducing a word string toward the root in chronologically backward order. The comparison is performed in the backward order of a sequence starting from one of separator nodes. The speech recognition apparatus thereby outputs a word string that highly likely matches the input speech.
27 Citations
23 Claims
-
1. A speech recognition apparatus comprising:
-
a storage medium for storing tree structured dictionary data that contains a plurality of words as nodes in a tree structure; a backward speech comparison unit for comparing a backward speech resulting from reproducing an input speech in chronologically backward order with a backward acoustic model corresponding to a speech of reversely reproducing a word string sequence toward a root of the tree structure, wherein a comparison is performed in reverse order of the sequence; a forward speech comparison unit for comparing the input speech with a forward acoustic model corresponding to a speech resulting from reproducing a leaf node word of the tree structure in chronologically forward order; and an output unit for outputting a word or a word string highly likely matching the input speech based on a comparison result each from the backward speech comparison unit and the forward speech comparison unit. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A speech recognition apparatus comprising:
-
a storage medium for storing tree structured dictionary data that contains a plurality of words as nodes in a tree structure; a backward speech comparison unit for comparing a backward speech resulting from reproducing an input speech in chronologically backward order with a backward acoustic model corresponding to a speech of reproducing, in chronologically backward order, a word string sequence toward a root of the tree structure, wherein a comparison is performed in reverse order of the sequence; a forward speech comparison unit for comparing the input speech with a forward acoustic model corresponding to a speech resulting from chronologically reproducing an intermediate word string, which is a sequence of a word string starting from a leaf node word toward a root and ending with a node other than the root of the tree structure, wherein a comparison is performed in order of the sequence; and an output unit for outputting a word string highly likely matching the input speech based on a comparison result each from the backward speech comparison unit and the forward speech comparison unit. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A speech recognition apparatus comprising:
-
a storage medium for storing tree structured dictionary data and separator specification data, wherein the tree structured dictionary data contains a plurality of words as nodes in a tree structure and the separator specification data specifies a plurality of nodes equivalent to separators for words of the tree structured dictionary data other than a root in the tree structure; a backward speech comparison unit for comparing a backward speech resulting from reproducing an input speech in chronologically backward order with a backward acoustic model corresponding to a speech of reproducing, in chronologically backward order, a sequence of words or word string that is arranged toward a root of the tree structure and ends with one of the plurality of nodes specified by the separator specification data, wherein a comparison is performed in reverse order of the sequence; and an output unit for outputting a word or a word string highly likely matching the input speech based on a comparison result from the backward speech comparison unit. - View Dependent Claims (18, 19, 20)
-
-
21. A computer program product in a computer-readable medium for use in recognizing a speech, the product comprising:
-
instructions for performing backward speech comparison to compare a backward speech resulting from reproducing an input speech in chronologically backward order with a backward acoustic model corresponding to a speech of reversely reproducing a word string sequence toward a root of a tree structure for tree structured dictionary data containing a plurality of words as nodes in the tree structure, wherein the comparison is made in reverse order of the sequence; instructions for performing forward speech comparison to compare the input speech with a forward acoustic model corresponding to a speech resulting from reproducing a leaf node word of the tree structure in chronologically forward order; and instructions for outputting a word or a word string highly likely matching the input speech based on a comparison result each from the backward speech comparison and the forward speech comparison.
-
-
22. A computer program product in a computer-readable medium for use in recognizing a speech, the product comprising:
-
instructions for performing backward speech comparison to compare a backward speech resulting from reproducing an input speech in chronologically backward order with a backward acoustic model corresponding to a speech of reproducing, in chronologically backward order, a word string sequence toward a root of a tree structure for tree structured dictionary data containing a plurality of words as nodes in the tree structure, wherein the comparison is performed in reverse order of the sequence; instructions for performing forward speech to compare the input speech with a forward acoustic model corresponding to a speech resulting from chronologically reproducing an intermediate word string, which is a sequence of a word string starting from a leaf node word toward a root and ending with a node other than the root of the tree structure, wherein the comparison is performed in order of the sequence; and instructions for outputting a word string highly likely matching the input speech based on a comparison result each from the backward speech comparison and the forward speech comparison.
-
-
23. A computer program product in a computer-readable medium for use in recognizing a speech, the product comprising:
-
instructions for reading tree structured dictionary data and separator specification data from a storage medium for storing the data, wherein the tree structured dictionary data contains a plurality of words as nodes in a tree structure and the separator specification data specifies a plurality of nodes equivalent to separators for words of the tree structured dictionary data other than a root in the tree structure; instructions for performing backward speech comparison to compare a backward speech resulting from reproducing an input speech in chronologically backward order with a backward acoustic model corresponding to a speech of reproducing, in chronologically backward order, a sequence of words or word string that is arranged toward a root of the tree structure and ends with one of the plurality of nodes specified by the separator specification data, wherein the comparison is performed in reverse order of the sequence; and instructions for outputting a word or a word string highly likely matching the input speech based on a comparison result from the backward speech comparison.
-
Specification