Method and device for activating a voice-controlled function in a multi-station network through using both speaker-dependent and speaker-independent speech recognition
First Claim
1. A method for activating a voice-controlled function in a multi-station network by using both speaker-dependent and speaker-independent speech recognition facilities, and conditionally to recognizing one or more items or an applicable vocabulary, driving one or more network parts to activate said function, wherein said method comprises the following steps:
- receiving a station-initiated call containing one or more initial speech items from the vocabulary, executing speaker-independent recognition on said initial speech items through one or more general templates, whilst in an speech recognition improvement procedure, in case of successful ascertainment of what had been actually spoken, storing a particular speaker-specific template derived from the initial speech item so recognized and assigned to an origin of the call in question, said speaker-specific template being cyclically retained for subsequent speaker-dependent recognition of additional speech items having the same origin;
following said speech recognition improvement procedure, applying speaker-dependent recognition as an initial type of speech recognition if feasible for additional speech items received from the same origin, through one or more particular templates associated to that origin and only subsequently applying speaker-independent recognition as a fallback procedure if the recognition of the additional speech items cannot be ascertained by speaker-dependent recognition, wherein speaker-independent recognition is a first response for new or unidentified users of the voice-controlled function, and speaker-dependent recognition based on said speech recognition improvement procedure is a first response for repeat users of the voice-controlled function, with a reversion to speaker-independent recognition if the additional speech items are not recognized.
1 Assignment
0 Petitions
Accused Products
Abstract
A voice-controlled multi-station network has both speaker-dependent and speaker-independent speech recognition. Conditionally to recognizing items of an applicable vocabulary, the network executes a particular function. The method receives a call from a particular origin and executes speaker-independent speech recognition on the call. In an improvement procedure, in case of successful determination of what has been said, a template associated to the recognized speech item is stored and assigned to the origin. Next, speaker-dependent recognition is applied if feasible, for speech received from the same origin, using one or more templates associated to that station. Further, a fallback procedure to speaker-independent recognition is maintained for any particular station in order to cater for failure of the speaker-dependent recognition, while allowing reverting to the improvement procedure.
135 Citations
10 Claims
-
1. A method for activating a voice-controlled function in a multi-station network by using both speaker-dependent and speaker-independent speech recognition facilities, and conditionally to recognizing one or more items or an applicable vocabulary, driving one or more network parts to activate said function, wherein said method comprises the following steps:
-
receiving a station-initiated call containing one or more initial speech items from the vocabulary, executing speaker-independent recognition on said initial speech items through one or more general templates, whilst in an speech recognition improvement procedure, in case of successful ascertainment of what had been actually spoken, storing a particular speaker-specific template derived from the initial speech item so recognized and assigned to an origin of the call in question, said speaker-specific template being cyclically retained for subsequent speaker-dependent recognition of additional speech items having the same origin;
following said speech recognition improvement procedure, applying speaker-dependent recognition as an initial type of speech recognition if feasible for additional speech items received from the same origin, through one or more particular templates associated to that origin and only subsequently applying speaker-independent recognition as a fallback procedure if the recognition of the additional speech items cannot be ascertained by speaker-dependent recognition, wherein speaker-independent recognition is a first response for new or unidentified users of the voice-controlled function, and speaker-dependent recognition based on said speech recognition improvement procedure is a first response for repeat users of the voice-controlled function, with a reversion to speaker-independent recognition if the additional speech items are not recognized. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
Specification