Audio firewall
First Claim
1. An audio firewall system comprising:
- a microphone configured to generate audio data;
a speech-to-text engine configured to convert the audio data to text data prior to detecting a service wake word;
a service engine configured to detect the service wake word in the text data after parsing the text data for the service wake word and corresponding content data, the service wake word identifying one of a local security system and a remote assistant server;
a text-to-speech engine configured to convert the text data comprising the service wake word and the corresponding content data to converted audio data;
an audio anonymizer coupled between the microphone and the speech-to-text engine, the audio anonymizer configured to adjust at least one of a pitch and a speed of the audio data, and to provide the adjusted audio data to the speech-to-text engine;
a remote service interface configured to provide the converted audio data to the remote assistant server; and
a local security system interface configured to provide the content data to the local security system.
2 Assignments
0 Petitions
Accused Products
Abstract
An audio firewall system has a microphone that generates audio data. A speech-to-text engine converts the audio data to text data. The text data is parsed for a service wake word and corresponding content data. The service wake word identifies one of a local security system and a remote assistant server. A text-to-speech engine converts the service wake word and the corresponding content data to converted audio data. The converted audio data is provided to the remote assistant server. The content data is provided to the local security system. The audio firewall system receives a response from the remote assistant server or the local security system and outputs an audio signal corresponding to the response.
23 Citations
18 Claims
-
1. An audio firewall system comprising:
-
a microphone configured to generate audio data; a speech-to-text engine configured to convert the audio data to text data prior to detecting a service wake word; a service engine configured to detect the service wake word in the text data after parsing the text data for the service wake word and corresponding content data, the service wake word identifying one of a local security system and a remote assistant server; a text-to-speech engine configured to convert the text data comprising the service wake word and the corresponding content data to converted audio data; an audio anonymizer coupled between the microphone and the speech-to-text engine, the audio anonymizer configured to adjust at least one of a pitch and a speed of the audio data, and to provide the adjusted audio data to the speech-to-text engine; a remote service interface configured to provide the converted audio data to the remote assistant server; and a local security system interface configured to provide the content data to the local security system. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method comprising:
-
generating audio data with a microphone of an audio firewall system; converting the audio data to text data with a speech-to-text engine prior to detecting a service wake word; detecting the service wake word in the text data after parsing the text data for the service wake word and corresponding content data, the service wake word identifying one of a local security system and a remote assistant server; converting the text data comprising the service wake word and the corresponding content data to converted audio data using a text-to-speech engine; adjusting at least one of a pitch and a speed of the audio data; providing the adjusted audio data to the speech-to-text engine; providing the converted audio data to the remote assistant server; and providing the content data to the local security system. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A machine-storage medium storing instructions that, when executed by one or more processors of a machine, cause the one or more processors to perform operations comprising:
-
generating audio data with a microphone of an audio firewall system; converting the audio data to text data prior to detecting a service wake word; detecting the service wake word in the text data after parsing the text data for the service wake word and corresponding content data, the service wake word identifying one of a local security system and a remote assistant server; converting the text data comprising the service wake word and the corresponding content data to converted audio data using a text-to-speech engine; adjusting at least one of a pitch and a speed of the audio data; providing the adjusted audio data to the speech-to-text engine; providing the converted audio data to the remote assistant server; and providing the content data to the local security system.
-
Specification