Process for the recognition of a continuous flow of spoken words
First Claim
1. A process for the recognition of speech signal derived from a continuous flow of spoken words, which speech signal comprises a temporal sequence of speech values, each of which values specifies a section of the speech signal;
- comprising;
comparing the speech values with predetermined stored reference values, a group of which reference values represents one word of a predetermined vocabulary for forming an initial evaluation value;
summing the comparison results over various sequences of combinations of reference values and speech values per sequence whose order is permissible in accordance with a predetermined stored first list containing, for predetermined syntactic categories, at least one assignment per category to a combination of further syntactic categories and/or words for forming a cumulative evaluation value;
generating a second list and a third list the second list including references to the reference values of all those words which are compared with the respective next speech value as well as a sequence number per word, and the third list including, for each speech value which has been compared with the last reference value of at least one word, a plurality of entries, each entry including a current sequence number and;
(a) a reference to a syntactic category of the first list,(b) a first specification for a sequence of compared words and/or syntactic categories which are assigned to sequences of already compared speech values,(c) a second specification for a sequence of words and/or syntactic categories which can be assigned to subsequent speech values on the basis of the first list,(d) a further sequence number assigned to the respective entry,(e) a first cumulative evaluation value,(f) a second initial evaluation value and(g) a sequence of compared words;
determining a new sequence number at least after every comparison of a new speech value with the last reference value of at least one word, and after each such comparison, searching through the group of entries of the third list associated with the sequence number stored in the second list at this word for such entries in which the sequence contained in the second specification begins with the compared word, and deriving a new entry for each such entry present, for the new group of the third list associated with the new sequence number;
making a first further entry in the new group for each new entry in which the abbreviated sequence contained in the second specification begins with a syntactic category, for which at least one assignment is present in the first list, and, deriving a second further entry for the new group for each of the new and first further entries of the new group for which the second specification contains an empty sequence;
repeating the steps of deriving and making the first and second further entries alternately until, after at least one first further entry, no second further entry occurs;
entering a reference to the reference data of the first word of each entry of the new group in which the second sequence begins with a word to be recognized;
comparing the next speech value with the reference values of all words contained in the second list;
repeating the process steps until the last speech value of the speech signal to be recognized has been processed;
checking the last group of the third list for all entries containing;
a reference to the syntactic initial category, an empty sequence, and a sequence number; and
reading out the sequence of compared words from those entries having the smallest first evaluation value.
1 Assignment
0 Petitions
Accused Products
Abstract
Continuous speech recognition assigns predetermined words to syntactic categories and defines the syntactic categories which can follow and precede each predetermined word. The recognition process is achieved by comparing the input sequence of speech signals to reference values and summing those which are syntactically permissible until they form a valid word. Subsequent speech values to previouly calculated valid words are compared to reference values listed in syntactic categories which can follow the predetermined word. For each word, values are updated indicating the current word'"'"'s sequence number, syntax category, cumulative comparison sum, and the current list of compared words. Values are also stored for each word which identify the previous word, the following word and their syntax categories. This process is repeated until all input values have been processed. The results are then checked to verify valid syntax and the words with the closest match are read out.
-
Citations
13 Claims
-
1. A process for the recognition of speech signal derived from a continuous flow of spoken words, which speech signal comprises a temporal sequence of speech values, each of which values specifies a section of the speech signal;
- comprising;
comparing the speech values with predetermined stored reference values, a group of which reference values represents one word of a predetermined vocabulary for forming an initial evaluation value; summing the comparison results over various sequences of combinations of reference values and speech values per sequence whose order is permissible in accordance with a predetermined stored first list containing, for predetermined syntactic categories, at least one assignment per category to a combination of further syntactic categories and/or words for forming a cumulative evaluation value; generating a second list and a third list the second list including references to the reference values of all those words which are compared with the respective next speech value as well as a sequence number per word, and the third list including, for each speech value which has been compared with the last reference value of at least one word, a plurality of entries, each entry including a current sequence number and; (a) a reference to a syntactic category of the first list, (b) a first specification for a sequence of compared words and/or syntactic categories which are assigned to sequences of already compared speech values, (c) a second specification for a sequence of words and/or syntactic categories which can be assigned to subsequent speech values on the basis of the first list, (d) a further sequence number assigned to the respective entry, (e) a first cumulative evaluation value, (f) a second initial evaluation value and (g) a sequence of compared words; determining a new sequence number at least after every comparison of a new speech value with the last reference value of at least one word, and after each such comparison, searching through the group of entries of the third list associated with the sequence number stored in the second list at this word for such entries in which the sequence contained in the second specification begins with the compared word, and deriving a new entry for each such entry present, for the new group of the third list associated with the new sequence number; making a first further entry in the new group for each new entry in which the abbreviated sequence contained in the second specification begins with a syntactic category, for which at least one assignment is present in the first list, and, deriving a second further entry for the new group for each of the new and first further entries of the new group for which the second specification contains an empty sequence; repeating the steps of deriving and making the first and second further entries alternately until, after at least one first further entry, no second further entry occurs; entering a reference to the reference data of the first word of each entry of the new group in which the second sequence begins with a word to be recognized; comparing the next speech value with the reference values of all words contained in the second list; repeating the process steps until the last speech value of the speech signal to be recognized has been processed;
checking the last group of the third list for all entries containing;
a reference to the syntactic initial category, an empty sequence, and a sequence number; andreading out the sequence of compared words from those entries having the smallest first evaluation value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
- comprising;
Specification