Automatic speech recognition
First Claim
1. A method for recognizing speech from a received signal representing a spoken sequence of one or more words comprising the steps ofreceiving a sequence of frames of acoustic events separated by boundaries,assigning to received frames respective boundary probabilities representative of the degree to which the received frames of speech correspond to stored representations of boundaries between acoustic events,selecting boundary frames based on the boundary probabilities assigned to the frames,using selected boundary frames to generate sequences of one or more words between a first selected boundary frame and a subsequent selected boundary frame, wherein multiple words in any given sequence are separated by one or more selected boundary frames,assigning a score to each generated sequence, andproviding an output corresponding to recognized speech using the sequence of one or more words with the highest assigned score.
6 Assignments
0 Petitions
Accused Products
Abstract
A scheme for recognizing speech represented by a sequence of frames of acoustic events separated by boundaries, according to which the frames of speech are processed to assign to received frames respective boundary probabilities representative of the degree to which the frames of speech correspond to stored representations of boundaries between acoustic events. The assigned boundary probabilities are used in subsequent processing steps to enhance recognition of speech. The assignment of boundary probabilities and further adjustments of the assigned probabilities are preferably conducted by an artificial neural network (ANN).
-
Citations
34 Claims
-
1. A method for recognizing speech from a received signal representing a spoken sequence of one or more words comprising the steps of
receiving a sequence of frames of acoustic events separated by boundaries, assigning to received frames respective boundary probabilities representative of the degree to which the received frames of speech correspond to stored representations of boundaries between acoustic events, selecting boundary frames based on the boundary probabilities assigned to the frames, using selected boundary frames to generate sequences of one or more words between a first selected boundary frame and a subsequent selected boundary frame, wherein multiple words in any given sequence are separated by one or more selected boundary frames, assigning a score to each generated sequence, and providing an output corresponding to recognized speech using the sequence of one or more words with the highest assigned score.
-
23. A speech recognizer for recognizing speech from a received signal representing a spoken sequence of one or more words comprising
a boundary classifier having an input adapted to receive a sequence of frames of acoustic events separated by boundaries, said boundary classifier being adapted to assign to received frames respective boundary probabilities representative of the degree to which the frames of speech correspond to stored representations of boundaries between acoustic events and to select boundary frames based on the boundary probabilities assigned to the frames, a network generator using boundary frames selected by said boundary classifier to generate sequences of one or more words between a first selected boundary frame and a subsequent selected boundary frame, wherein multiple words in any given sequence are separated by one or more selected boundary frames, a sequence classifier assigning a score to each generated sequence, and a processor adapted to provide an output corresponding to recognized speech using the sequence of one or more words with the highest assigned score.
Specification