System and method for temporal and power based zone detection in speaker dependent microphone environments
First Claim
Patent Images
1. A computer-implemented method comprising:
- receiving, by a computing device, a speech signal from a speaker via a plurality of microphone zones;
determining temporal cue based confidence for at least a portion of the plurality of microphone zones;
determining power cue based confidence for the at least the portion of the plurality of microphone zones, wherein the at least the portion includes at least two zones of the plurality of zones;
identifying a microphone zone of the plurality of microphone zones from which the speech signal originates from the speaker, based upon, at least in part, a combination of the temporal cue based confidence determined for the at least the portion of the plurality of microphone zones and the power cue based confidence determined for the at least the portion of the plurality of microphone zones; and
using the speech signal from the identified microphone zone as an output signal in a speech system;
wherein the identifying includes comparing the temporal cue based confidence and the power cue based confidence, and wherein the identifying further includes selecting the temporal cue based confidence to identify the microphone zone when the temporal cue based confidence is higher than the power cue based confidence.
7 Assignments
0 Petitions
Accused Products
Abstract
A method, computer program product, and computer system for receiving, by a computing device, a speech signal from a speaker via a plurality of microphone zones. A temporal cue based confidence may be determined for at least a portion of the plurality of microphone zones. A power cue based confidence may be determined for at least a portion of the plurality of microphone zones. A microphone zone of the plurality of microphone zones from which to use an output signal of the speaker may be identified based upon, at least in part, a combination of the temporal cue based confidence and the power cue based confidence.
-
Citations
15 Claims
-
1. A computer-implemented method comprising:
-
receiving, by a computing device, a speech signal from a speaker via a plurality of microphone zones; determining temporal cue based confidence for at least a portion of the plurality of microphone zones; determining power cue based confidence for the at least the portion of the plurality of microphone zones, wherein the at least the portion includes at least two zones of the plurality of zones; identifying a microphone zone of the plurality of microphone zones from which the speech signal originates from the speaker, based upon, at least in part, a combination of the temporal cue based confidence determined for the at least the portion of the plurality of microphone zones and the power cue based confidence determined for the at least the portion of the plurality of microphone zones; and using the speech signal from the identified microphone zone as an output signal in a speech system; wherein the identifying includes comparing the temporal cue based confidence and the power cue based confidence, and wherein the identifying further includes selecting the temporal cue based confidence to identify the microphone zone when the temporal cue based confidence is higher than the power cue based confidence. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A computer program product residing on a non-transitory computer readable storage medium having a plurality of instructions stored thereon which, when executed across one or more processors, causes at least a portion of the one or more processors to perform operations comprising:
-
receiving a speech signal from a speaker via a plurality of microphone zones; determining temporal cue based confidence for at least a portion of the plurality of microphone zones; determining power cue based confidence for the at least the portion of the plurality of microphone zones, wherein the at least the portion includes at least two zones of the plurality of zones; identifying a microphone zone of the plurality of microphone zones from which the speech signal originates from the speaker, based upon, at least in part, a combination of the temporal cue based confidence determined for the at least the portion of the plurality of microphone zones and the power cue based confidence determined for the at least the portion of the plurality of microphone zones; and using the speech signal from the identified microphone zone as an output signal in a speech system; wherein the identifying includes comparing the temporal cue based confidence and the power cue based confidence, and wherein the identifying further includes selecting the temporal cue based confidence to identify the microphone zone when the temporal cue based confidence is higher than the power cue based confidence. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A computing system including one or more processors and one or more memories configured to perform operations comprising:
-
receiving a speech signal from a speaker via a plurality of microphone zones; determining temporal cue based confidence for at least a portion of the plurality of microphone zones; determining power cue based confidence for the at least the portion of the plurality of microphone zones, wherein the at least the portion includes at least two zones of the plurality of zones; identifying a microphone zone of the plurality of microphone zones from which the speech signal originates from the speaker, based upon, at least in part, a combination of the temporal cue based confidence determined for the at least the portion of the plurality of microphone zones and the power cue based confidence determined for the at least the portion of the plurality of microphone zones; and using the speech signal from the identified microphone zone as an output signal in a speech system; wherein the identifying includes comparing the temporal cue based confidence and the power cue based confidence, and wherein the identifying further includes selecting the temporal cue based confidence to identify the microphone zone when the temporal cue based confidence is higher than the power cue based confidence. - View Dependent Claims (12, 13, 14, 15)
-
Specification