Methods and apparatus for improving the reliability of recognizing words in a large database when the words are spelled or spoken
First Claim
1. A method for identifying any one of a plurality of words using a programmed digital data processing system, each word having an audible form represented by a sequence of spoken speech elements, with each speech element having a respective position in the sequence, the digital data processing system being connected to means for receiving spoken speech elements of a word and interpreting each received speech element,wherein there is a plurality of possible speech elements, each spoken speech element is a speech element α
- , each interpreted speech elements is a speech element β
, and each spoken speech element a may be interpreted as any one of a plurality of different speech elements β
, one of the speech elements β
being the same as speech element α
, said method comprising;
assigning to each of the possible speech elements a respective plurality of probabilities, P.sub.α
β
, that the speech element will be interpreted as a speech element β
when a speech element a has been spoken;
storing data representing each word of the plurality of words, the data for each word including identification of each speech element in the word and identification of the respective position of each speech element in the sequence of speech elements representing the word;
in the means for receiving and interpreting, receiving a sequence of speech elements spoken by a person and representing one of the stored words, and interpreting each speech element of the spoken word and the position of each speech element in the sequence of spoken speech elements; and
comparing the interpreted speech elements with stored data representing each word of the plurality of words and performing a computation, using the probability, P.sub.α
β
, associated with each interpreted speech element β
to identify the word of the plurality of words whose speech elements correspond most closely to interpreted speech elements.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for identifying any one of a plurality of words, each word having an audible form represented by a sequence of spoken speech elements, with each speech element having a respective position in the sequence, which involves: receiving spoken speech elements of a word and interpreting each received speech element, wherein each spoken speech element α may be interpreted as any one of a plurality of different speech elements β, one of the speech elements β being the same as speech element α; assigning to each of the possible speech elements a respective plurality of probabilities, P.sub.αβ, that the speech element will be interpreted as a speech element β when a speech element α has been spoken; storing data representing each word, the data for each word including identification of each speech element in the word and identification of the respective position of each speech element in the sequence of speech elements representing the word; receiving a sequence of speech elements spoken by a person and representing one of the stored words, and interpreting each speech element of the spoken word and the position of each speech element in the sequence of spoken speech elements; and comparing the interpreted speech elements with stored data representing each word of the plurality of words and performing a computation, using the probability, P.sub.αβ, associated with each interpreted speech element β to identify the word whose speech elements correspond most closely to interpreted speech elements.
-
Citations
12 Claims
-
1. A method for identifying any one of a plurality of words using a programmed digital data processing system, each word having an audible form represented by a sequence of spoken speech elements, with each speech element having a respective position in the sequence, the digital data processing system being connected to means for receiving spoken speech elements of a word and interpreting each received speech element,
wherein there is a plurality of possible speech elements, each spoken speech element is a speech element α - , each interpreted speech elements is a speech element β
, and each spoken speech element a may be interpreted as any one of a plurality of different speech elements β
, one of the speech elements β
being the same as speech element α
, said method comprising;assigning to each of the possible speech elements a respective plurality of probabilities, P.sub.α
β
, that the speech element will be interpreted as a speech element β
when a speech element a has been spoken;storing data representing each word of the plurality of words, the data for each word including identification of each speech element in the word and identification of the respective position of each speech element in the sequence of speech elements representing the word; in the means for receiving and interpreting, receiving a sequence of speech elements spoken by a person and representing one of the stored words, and interpreting each speech element of the spoken word and the position of each speech element in the sequence of spoken speech elements; and comparing the interpreted speech elements with stored data representing each word of the plurality of words and performing a computation, using the probability, P.sub.α
β
, associated with each interpreted speech element β
to identify the word of the plurality of words whose speech elements correspond most closely to interpreted speech elements. - View Dependent Claims (2, 3, 4, 5, 6)
- , each interpreted speech elements is a speech element β
-
7. A programmed digital data processing system for identifying any one of a plurality of words, each word having an audible form represented by a sequence of spoken speech elements, with each speech element having a respective position in the sequence, wherein there is a plurality of possible speech elements, each spoken speech element is a speech element α
- , each interpreted speech elements ia speech element β
, and each spoken speech element α
may be interpreted as any one of a plurality of different speech elements β
, one of the speech elements β
being the same as speech element α
, said apparatus comprising;first data storage means for storing, for each of the possible speech elements, a respective plurality of probabilities, P.sub.α
β
, that the speech element will be interpreted as a speech element β
when a speech element α
has been spoken;second data storage means for storing data representing each word of the plurality of words, the data for each word including identification of each speech element in the word and identification of the respective position of each speech element in the sequence of speech elements representing the word; means for receiving a sequence of speech elements spoken by a person and representing one of the stored words, and for interpreting each speech element of the spoken word and the position of each speech element in the sequence of spoken speech elements; and means connected for comparing the interpreted speech elements with stored data representing each word of the plurality of words and performing a computation, using the probability, P.sub.α
β
, associated with each interpreted speech element β
to identify the word of the plurality of words whose speech elements correspond most closely to interpreted speech elements. - View Dependent Claims (8, 9, 10)
- , each interpreted speech elements ia speech element β
-
11. A method for identifying any one of a plurality of words using a programmed digital computing system, each word having an audible form representable by a sequence of speech elements each having a respective position in the sequence, wherein each speech element has at least one identifiable acoustic characteristic and a plurality of the speech elements are substantially identical with respect to the at least one identifiable acoustic characteristic, said method comprising:
-
storing, in the digital computing system, a digital representation corresponding to each of the plurality of words; receiving a sequence of speech elements spoken by a person and representing the audible form of one of the plurality of words, and storing representations of the received speech elements and their respective positions in the spoken sequence; at each position in the spoken sequence, determining each speech element, other than the speech element for which a representation is stored, which is substantially identical to the speech element for which a representation is stored with respect to the at least one identifiable acoustic characteristic, comparing combinations of speech elements for which representations are stored and determined speech elements for a word with stored words; and identifying the stored word for which the comparison produces the best match with one of the combinations of speech elements. - View Dependent Claims (12)
-
Specification