Filtering audio-based interference from voice commands using natural language processing
First Claim
1. A computer-implemented method, comprising:
- receiving a complex audio signal, wherein the complex audio signal includes an intended audio signal and at least one interfering audio signal, wherein the intended audio signal is a voice-based command originating from a user, wherein the at least one interfering audio signal is background noise;
converting the intended audio signal and the at least one interfering audio signal into text which represents a plurality of words included in the complex audio signal;
identifying at least some of the text as representing words which correspond to the at least one interfering audio signal;
discarding the identified text;
evaluating a remaining portion of the text to determine whether the remaining portion of the text represents words which convey the voice-based command at an accuracy that is in a predetermined range; and
outputting the remaining portion of the text in response to determining that the remaining portion of the text represents words which convey the voice-based command at an accuracy that is in the predetermined range.
1 Assignment
0 Petitions
Accused Products
Abstract
A computer-implemented method, according to one embodiment, includes: receiving a complex audio signal which includes an intended audio signal and at least one interfering audio signal. The complex audio signal is converted into text which represents a plurality of words included in the complex audio signal, and at least some of the text is identified as representing words which correspond to the at least one interfering audio signal. The identified text is discarded, and a remaining portion of the text is evaluated to determine whether the remaining portion of the text represents words which convey the voice-based command at an accuracy that is in a predetermined range. Furthermore, the remaining portion of the text is output in response to determining that the remaining portion of the text represents words which convey the voice-based command at an accuracy that is in the predetermined range.
30 Citations
20 Claims
-
1. A computer-implemented method, comprising:
-
receiving a complex audio signal, wherein the complex audio signal includes an intended audio signal and at least one interfering audio signal, wherein the intended audio signal is a voice-based command originating from a user, wherein the at least one interfering audio signal is background noise; converting the intended audio signal and the at least one interfering audio signal into text which represents a plurality of words included in the complex audio signal; identifying at least some of the text as representing words which correspond to the at least one interfering audio signal; discarding the identified text; evaluating a remaining portion of the text to determine whether the remaining portion of the text represents words which convey the voice-based command at an accuracy that is in a predetermined range; and outputting the remaining portion of the text in response to determining that the remaining portion of the text represents words which convey the voice-based command at an accuracy that is in the predetermined range. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer program product comprising a computer readable storage medium having program instructions embodied therewith, wherein the computer readable storage medium is not a transitory signal per se, the program instructions readable and/or executable by a processor to cause the processor to perform a method comprising:
-
receiving, by the processor, a complex audio signal, wherein the complex audio signal includes an intended audio signal and at least one interfering audio signal, wherein the intended audio signal is a voice-based command originating from a user, wherein the at least one interfering audio signal is background noise; converting, by the processor, the intended audio signal and the at least one interfering audio signal into text which represents a plurality of words included in the complex audio signal; identifying, by the processor, at least some of the text as representing words which correspond to the at least one interfering audio signal; discarding, by the processor, the identified text; evaluating, by the processor, a remaining portion of the text to determine whether the remaining portion of the text represents words which convey the voice-based command at an accuracy that is in a predetermined range; and outputting, by the processor, the remaining portion of the text in response to determining that the remaining portion of the text represents words which convey the voice-based command at an accuracy that is in the predetermined range. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A system, comprising:
-
a processor; and logic integrated with the processor, executable by the processor, or integrated with and executable by the processor, the logic being configured to; receive, by the processor, a complex audio signal, wherein the complex audio signal includes an intended audio signal and at least one interfering audio signal, wherein the intended audio signal is a voice-based command originating from a user, wherein the at least one interfering audio signal is background noise; convert, by the processor, the intended audio signal and the at least one interfering audio signal into text which represents a plurality of words included in the complex audio signal; identify, by the processor, at least some of the text as representing words which correspond to the at least one interfering audio signal; discard, by the processor, the identified text; evaluate, by the processor, a remaining portion of the text to determine whether the remaining portion of the text represents words which convey the voice-based command at an accuracy that is in a predetermined range; and output, by the processor, the remaining portion of the text in response to determining that the remaining portion of the text represents words which convey the voice-based command at an accuracy that is in the predetermined range, wherein outputting the remaining portion of the text includes; selecting a known command which matches the remaining portion of the text most closely, and outputting the known command. - View Dependent Claims (18, 19, 20)
-
Specification