Method of and apparatus for deriving a plurality of sequences of words from a speech signal
First Claim
1. A method of deriving at least one sequence of words of a predetermined vocabulary from a speech signal, including the following steps:
- a) providing a store of words and reference signals, different words being associated with different sequences of said reference signals,b) deriving test signals from the speech signal,c) comparing the test signals with said different sequences of said reference signals,d) determining words with scores based on the comparisons of test signals and forming a word graph therefrom which contains nodes as word boundaries and words as edges,e) identifying a first path through the word graph with the smallest sum of scores of the sequence of words,characterized in that the method further comrpises the steps of;
f) storing, in order to output further sequences of words having a next-larger sum of scores, the sequence of words on the first path as a basis for a tree having tree nodes at the word boundaries and one end node,g) forming, for each tree node and for the path segment of the word graph which extends to the corresponding graph node, the overall sum of scores which are situated thereon as from the start of the word graph, the path segments with the sequences of words determined thus far being excluded therefrom,h) identifying the sequence of words for which the sum of the scores of the words on a path segment in the word graph as far as the graph node corresponding to a tree node, and from this tree node in the tree, is smallest, the already stored tree being extended by the sequence of words on this path segment, andi) repeating steps g) and h) at least once, andj) outputting a plurality of mutually different sequences of words having progressively higher scores.
1 Assignment
0 Petitions
Accused Products
Abstract
The determination of a plurality of sequences of words from a speech signal with a decreasing probability of correspondence utilizes the best word sequence as a basis and as further word sequences there are determined only those which enclose a part of the best word sequence, that is to say the remainder of these word sequences. To this end, the recognition involves first the formation of a word graph and the best word sequence is separately stored as a tree which initially has one branch only. The word boundaries of this word sequence form nodes in this tree. Because only nodes of this tree have to be taken into account for the next-best word sequences, the calculation is substantially simpler than if the complete word graph were first completely expanded in the form of a tree and completely searched again for each new word sequence.
26 Citations
5 Claims
-
1. A method of deriving at least one sequence of words of a predetermined vocabulary from a speech signal, including the following steps:
-
a) providing a store of words and reference signals, different words being associated with different sequences of said reference signals, b) deriving test signals from the speech signal, c) comparing the test signals with said different sequences of said reference signals, d) determining words with scores based on the comparisons of test signals and forming a word graph therefrom which contains nodes as word boundaries and words as edges, e) identifying a first path through the word graph with the smallest sum of scores of the sequence of words, characterized in that the method further comrpises the steps of; f) storing, in order to output further sequences of words having a next-larger sum of scores, the sequence of words on the first path as a basis for a tree having tree nodes at the word boundaries and one end node, g) forming, for each tree node and for the path segment of the word graph which extends to the corresponding graph node, the overall sum of scores which are situated thereon as from the start of the word graph, the path segments with the sequences of words determined thus far being excluded therefrom, h) identifying the sequence of words for which the sum of the scores of the words on a path segment in the word graph as far as the graph node corresponding to a tree node, and from this tree node in the tree, is smallest, the already stored tree being extended by the sequence of words on this path segment, and i) repeating steps g) and h) at least once, and j) outputting a plurality of mutually different sequences of words having progressively higher scores. - View Dependent Claims (2, 3, 4)
-
-
5. An apparatus for deriving a plurality of sequences of words of a predetermined vocabulary from a speech signal, comprising:
-
a sequence and acoustic recognition system for deriving a word graph based on the speech signal, means for assigning probability scores for each word in the word graph, means for storing and outputting at least one word sequence, based on said word graph, having a minimum sum of scores, a memory for storing the output word sequence in the form of a tree, a processor, including means for receiving the word graph, for deriving from the word graph, after the first word sequence output, only those word sequences having at least one word boundary in common with the already stored word sequences, and excluding sub-sequences of words which start from the beginning of the word graph and at least the last word of which corresponds to a sub-sequence stored in the memory, and storing in said memory from each newly output word sequence only the parts which differ from the word sequences already stored, and means for additionally outputting said word sequences having at least one word boundary in common with the already stored word sequences and excluding said sub-sequences of words.
-
Specification