Speech recognition with language-dependent model vectors
First Claim
1. A method for speaker-dependent speech recognition, comprising:
- capturing a speech signal, including a speech command, of a speaker;
breaking down the speech signal into time frames;
characterizing the speech signal in each captured time frame by forming a corresponding feature vector;
forming a language-independent feature vector sequence from at least one feature vector;
storing the language-independent feature vector sequence;
assigning the language-independent feature vector sequence to a language-dependent sequence of model vectors in a first language resource which includes a multiplicity of language-dependent model vectors;
storing first assignment information which specifies assignment of the language-independent feature vector sequence to the language-dependent sequence of model vectors;
recognizing the speech command which is assigned to the language-dependent sequence of model vectors;
selecting a second language resource different from the first language resource;
assigning the language-independent feature vector sequence previously stored to a language-dependent model vector sequence in the second language resource; and
storing second assignment information regarding said assigning of the language-independent feature vector sequence to the language-dependent model vector sequence in the second language resource.
9 Assignments
0 Petitions
Accused Products
Abstract
Speaker-dependent speech recognition is performed upon detecting a speech signal encompassing a voice command. The speech signal is divided into time frames and characterized in each detected time frame by forming a corresponding property vector. A language-independent feature vector sequence is formed from one or several property vectors and then stored. The language-independent feature vector sequence is allocated to a language-dependent sequence of model vectors in a speech resource having a plurality of model vectors. A piece of allocation information indicating allocation of the language-independent feature vector sequence to a language-dependent sequence of model vectors is stored, then the voice command allocated to the model vector sequence is identified.
30 Citations
12 Claims
-
1. A method for speaker-dependent speech recognition, comprising:
-
capturing a speech signal, including a speech command, of a speaker; breaking down the speech signal into time frames; characterizing the speech signal in each captured time frame by forming a corresponding feature vector; forming a language-independent feature vector sequence from at least one feature vector; storing the language-independent feature vector sequence; assigning the language-independent feature vector sequence to a language-dependent sequence of model vectors in a first language resource which includes a multiplicity of language-dependent model vectors; storing first assignment information which specifies assignment of the language-independent feature vector sequence to the language-dependent sequence of model vectors; recognizing the speech command which is assigned to the language-dependent sequence of model vectors; selecting a second language resource different from the first language resource; assigning the language-independent feature vector sequence previously stored to a language-dependent model vector sequence in the second language resource; and storing second assignment information regarding said assigning of the language-independent feature vector sequence to the language-dependent model vector sequence in the second language resource. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A communication device, comprising:
-
a microphone recording a speech signal, including a speech command, of a speaker; a processor processing the speech signal by breaking down the speech signal into time frames, characterizing the speech signal in each captured time frame by forming a corresponding feature vector and forming a language-independent feature vector sequence from at least one feature vector; a storage unit storing the language-independent feature vector sequence obtained from the speech signal; and a speech recognition entity, coupled to the microphone, configured for at least speaker-dependent speech recognition by assigning the language-independent feature vector sequence to a language-dependent sequence of model vectors in a first language resource which includes a multiplicity of language-dependent model vectors, storing first assignment information which specifies assignment of the language-independent feature vector sequence to the language-dependent sequence of model vectors, recognizing the speech command which is assigned to the language-dependent sequence of model vectors, selecting a second language resource different from the first language resource, assigning the language-independent feature vector sequence previously stored to a language-dependent model vector sequence in the second language resource, and storing second assignment information corresponding thereto. - View Dependent Claims (12)
-
Specification