Multiple-user voice-based control of devices in an endoscopic imaging system
First Claim
1. A multi-user voice control system for use in an endoscopic imaging system, the multi-user voice control system comprising:
- a first input channel to receive speech of a first user of the endoscopic imaging system;
a second input channel to receive speech of a second user of the endoscopic imaging system, wherein the first and second input channels are formed in a single device;
a selection unit to select the first input channel or the second input channel by applying a selection priority to the first and second input channels, wherein the selection unit comprises a voice activity detector (VAD) module to determine a first signal received on the first input channel when the first user starts speaking exceeds a first threshold and to determine whether a second signal received on the second input channel exceeds a second threshold, wherein the first threshold is less than the second threshold;
an automatic speech recognizer (ASR) to recognize speech received on a channel selected by the channel selector; and
a control unit to enable the multi-user voice control system to control a device in the endoscopic imaging system in response to speech recognized by the ASR.
1 Assignment
0 Petitions
Accused Products
Abstract
A multi-user voice control system for use in endoscopic imaging system includes a first input channel, a second input channel, an automatic speech recognizer (ASR), a control unit, and a selector. The first input channel receives speech of a first user, and the second input channel receives speech of a second user. The ASR recognizes speech received on the first channel and recognizes speech received on the second channel. The control unit enables the voice control system to control a device in the endoscopic imaging system in response to recognized speech. The selector selectively determines whether recognized speech associated with the first channel or recognized speech associated with the second channel is used to control the device, by applying a selection priority to the first and second channels.
36 Citations
34 Claims
-
1. A multi-user voice control system for use in an endoscopic imaging system, the multi-user voice control system comprising:
-
a first input channel to receive speech of a first user of the endoscopic imaging system; a second input channel to receive speech of a second user of the endoscopic imaging system, wherein the first and second input channels are formed in a single device; a selection unit to select the first input channel or the second input channel by applying a selection priority to the first and second input channels, wherein the selection unit comprises a voice activity detector (VAD) module to determine a first signal received on the first input channel when the first user starts speaking exceeds a first threshold and to determine whether a second signal received on the second input channel exceeds a second threshold, wherein the first threshold is less than the second threshold; an automatic speech recognizer (ASR) to recognize speech received on a channel selected by the channel selector; and a control unit to enable the multi-user voice control system to control a device in the endoscopic imaging system in response to speech recognized by the ASR. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A multi-user apparatus for use in an endoscopic imaging system, the apparatus comprising:
-
a first input channel to receive a first signal representing speech of a first user of the endoscopic imaging system; a second input channel to receive a second signal representing speech of a second user of the endoscopic imaging system, wherein the first and second input channels are formed in a single device; means for selecting the first signal and ignoring the second signal when the first signal exceeds a first threshold, and for selecting the second signal when the second signal exceeds a second threshold and the first signal is below the first threshold, wherein the means for selecting comprises a voice activity detection means to determine whether a first signal received on the first input channel when the first user starts speaking exceeds a first threshold and to determine whether a second signal received on the second input channel exceeds a second threshold, wherein the first threshold is less than the second threshold; an automatic speech recognizer (ASR) to recognize speech of the first or second user from a signal selected by the selecting means; and means for controlling a device in the endoscopic imaging system external to the apparatus in response to speech of the first user or the second user recognized by the ASR. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A method of controlling a device in an endoscopic imaging system based on speech, the method comprising:
-
receiving speech of a first user on a first channel and speech of a second user on a second channel, wherein the first and second users are users of the endoscopic imaging system, and the first and second channels are formed in a single device; determining whether speech associated with the first channel or speech associated with the second channel will be used to control the device in the endoscopic imaging system, by applying a prioritization to the first and second channels wherein said determining comprises determining whether a first signal received on the first channel when the first user starts speaking exceeds a first threshold, and determining whether a second signal received on the second channel exceeds a second threshold, wherein the first threshold is less than the second threshold; automatically recognizing speech of the first or second user according to a result of said determining; and using the automatically recognized speech to control the device. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A method comprising:
-
receiving a first signal representing speech of a first user on a first channel and a second signal representing speech of a second user on a second channel, wherein the first and second users are users of an endoscopic imaging system, and the first and second channels are formed in a single device; if the first signal exceeds a first threshold when the first user starts speaking, then enabling automatic speech recognition with respect to the first signal while preventing automatic speech recognition with respect to the second signal; if the second signal exceeds a second threshold while the first signal is below the first threshold, then enabling automatic speech recognition with respect to the second signal, wherein the first threshold is less than the second threshold; and controlling a device in the endoscopic imaging system in response to the recognized speech. - View Dependent Claims (27, 28, 29, 30, 31)
-
-
32. A method of operating a voice control system (VCS) for controlling a voice-controllable device in an endoscopic imaging system, the method comprising:
-
receiving at the VCS a first signal for conveying speech of a first user on a first channel, wherein the first user is a user of the endoscopic imaging system; receiving at the VCS a second signal for conveying speech of a second user on a second channel, wherein the second user is a user of the endoscopic imaging system, and the first and second channels are formed in a single device; buffering a sliding segment of the first signal and a sliding segment of the second signal; detecting when the first signal exceeds a first threshold and detecting when the second signal exceeds a second threshold; in response to the first signal exceeding the first threshold when the first user starts speaking, enabling automatic speech recognition to be performed with respect to the first signal, including a leading segment and a trailing segment of the first signal which are below the first threshold, in the VCS, while preventing automatic recognition from being performed with respect to the second signal; in response to the second signal exceeding the second threshold while the first signal is below the first threshold, enabling automatic speech recognition to be performed with respect to the second signal, including a leading segment and a trailing segment of the second signal which are below the second threshold, in the VCS, wherein the first threshold is less than the second threshold; and using recognized speech associated with the first or second signal to control the voice-controllable device. - View Dependent Claims (33, 34)
-
Specification