Apparatus, system and method for voice dialogue activation and/or conduct
First Claim
Patent Images
1. An apparatus for at least one of voice dialogue activation and voice dialogue conduct, for use in a vehicle, comprising:
- at least one input for a voice signal;
a voice recognition unit configured to establish one or more command words contained in the voice signal;
a speaker recognition unit configured to determine a current speaker using the voice signal and at least one stored speaker profile;
a decision-maker unit comprising;
a voice recognition unit connection coupled to an output of the voice recognition unit configured to perform a result action based on the one or more command words, anda speaker recognition unit connection coupled to the speaker recognition unit,the decision-maker unit being configured such that the activation of the result action is dependent, at least in the case of at least one command word, on whether the at least one command word has been identified as coming from a speaker associated with a speaker profile; and
an echo cancellation unit that receives a multichannel voice signal and, on the basis of transit time differences among components of the multichannel signal with respect to the at least one input, removes all components from non-authorized speakers,wherein;
the speaker recognition unit is configured to identify the current speaker by extracting speaker features from the voice signal and comparing the speaker features with stored speaker-dependent features, and comprises a further unit configured for speaker adaptation to continually ascertain refined speaker-dependent features and store the refined speaker-dependent features in the stored speaker profiles, andthe speaker recognition unit is configured to, in the case that a plurality of speakers are speaking simultaneously, attribute the voice signal to no speaker.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus, a system and a method for voice dialogue activation and/or conduct. The apparatus for voice dialogue activation and/or conduct has a voice recognition unit, a speaker recognition unit and a decision-maker unit. The decision-maker unit is designed to activate a result action on the basis of results from the voice and speaker recognition units.
54 Citations
35 Claims
-
1. An apparatus for at least one of voice dialogue activation and voice dialogue conduct, for use in a vehicle, comprising:
-
at least one input for a voice signal; a voice recognition unit configured to establish one or more command words contained in the voice signal; a speaker recognition unit configured to determine a current speaker using the voice signal and at least one stored speaker profile; a decision-maker unit comprising; a voice recognition unit connection coupled to an output of the voice recognition unit configured to perform a result action based on the one or more command words, and a speaker recognition unit connection coupled to the speaker recognition unit, the decision-maker unit being configured such that the activation of the result action is dependent, at least in the case of at least one command word, on whether the at least one command word has been identified as coming from a speaker associated with a speaker profile; and an echo cancellation unit that receives a multichannel voice signal and, on the basis of transit time differences among components of the multichannel signal with respect to the at least one input, removes all components from non-authorized speakers, wherein; the speaker recognition unit is configured to identify the current speaker by extracting speaker features from the voice signal and comparing the speaker features with stored speaker-dependent features, and comprises a further unit configured for speaker adaptation to continually ascertain refined speaker-dependent features and store the refined speaker-dependent features in the stored speaker profiles, and the speaker recognition unit is configured to, in the case that a plurality of speakers are speaking simultaneously, attribute the voice signal to no speaker. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A system for voice dialogue activation and/or voice dialogue conduct comprising:
-
at least one input for a voice signal; a voice recognition unit configured to establish one or more command words contained in the voice signal; a speaker recognition unit configured to determine a current speaker using the voice signal and at least one stored speaker profile; a decision-maker unit comprising; a voice recognition unit connection coupled to an output of the voice recognition unit configured to perform a result action based on the one or more command words, and a speaker recognition unit connection coupled to the speaker recognition unit, the decision-maker unit being configured such that the activation of the result action is dependent, at least in the case of at least one command word, on whether the at least one command word has been identified as coming from a speaker associated with a speaker profile; at least one microphone coupled to the voice recognition unit; and
at least one loudspeaker coupled to the voice recognition unit; andan echo cancellation unit that receives a multichannel voice signal and, on the basis of transit time differences among components of the multichannel signal with respect to the at least one input, removes all components from non-authorized speakers, wherein; the speaker recognition unit is configured to identify the current speaker by extracting speaker features from the voice signal and comparing the speaker features with stored speaker-dependent features, and comprises a further unit configured for speaker adaptation to continually ascertain refined speaker-dependent features and store the refined speaker-dependent features in the stored speaker profiles, and the speaker recognition unit is configured to, in the case that a plurality of speakers are speaking simultaneously, attribute the voice signal to no speaker. - View Dependent Claims (17, 18)
-
-
19. A method for voice dialogue activation and/or conduct comprising:
-
picking up a voice signal; recognizing at least one of a command word and a command word structure from the voice signal; recognizing a speaker using the voice signal and at least one stored speaker profile; performing a result action based on a recognized command word and a recognized speaker, wherein the voice signal is a multichannel voice signal; removing, on the basis of transit time differences among components of the multichannel signal with respect to at least one microphone, all components from non-authorized speakers, wherein recognizing an authorized speaker involves speaker features being extracted from the voice signal and being aligned with individual speaker features stored in a speaker profile, wherein speaker adaptation is performed which continuously refines and complements the individual speaker features stored in the speaker profile, and wherein speaker recognition, in the case that a plurality of speakers are speaking simultaneously, attributes the voice signal to no speaker. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35)
-
Specification