Dynamic switching between local and remote speech rendering
First Claim
1. A computer-implemented method of running an application on a computing end device defining a host, the application being distributed from a server to the host, the method comprising:
- obtaining the application from the server;
executing a multi-modal browser on the host to run the application, the application employing both text-to-speech (TTS) processing and automatic speech recognition (ASR) processing;
analyzing, by the host, an instruction in the application that instructs the host to perform the TTS processing and/or ASR processing locally or remotely;
determining, by the host, whether the host is capable of performing the TTS processing and/or ASR processing in accordance with the instruction, wherein;
if the instruction instructs the host to perform the TTS processing and the ASR processing locally, determining, by the host, whether the host supports performing the TTS processing and the ASR processing locally, andif the instruction instructs the host to perform the TTS processing locally and the ASR processing remotely, determining, by the host, whether the host supports performing the TTS processing locally and the ASR processing remotely;
if the host is capable of performing the TTS processing and/or ASR processing in accordance with the instruction, executing the application using the TTS processing and/or the ASR processing in accordance with the instruction; and
generating an error indication if the host is not capable of performing the TTS processing and/or ASR processing in accordance with the instruction.
2 Assignments
0 Petitions
Accused Products
Abstract
A multimodal browser for rendering a multimodal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodal document, and a voice browser component for rendering voice-based content, if any, of the multimodal document. The voice browser component can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content. The determination can be based upon the resources of the host running the application. The determination also can be based upon a processing instruction contained in the application.
56 Citations
12 Claims
-
1. A computer-implemented method of running an application on a computing end device defining a host, the application being distributed from a server to the host, the method comprising:
-
obtaining the application from the server; executing a multi-modal browser on the host to run the application, the application employing both text-to-speech (TTS) processing and automatic speech recognition (ASR) processing; analyzing, by the host, an instruction in the application that instructs the host to perform the TTS processing and/or ASR processing locally or remotely; determining, by the host, whether the host is capable of performing the TTS processing and/or ASR processing in accordance with the instruction, wherein; if the instruction instructs the host to perform the TTS processing and the ASR processing locally, determining, by the host, whether the host supports performing the TTS processing and the ASR processing locally, and if the instruction instructs the host to perform the TTS processing locally and the ASR processing remotely, determining, by the host, whether the host supports performing the TTS processing locally and the ASR processing remotely; if the host is capable of performing the TTS processing and/or ASR processing in accordance with the instruction, executing the application using the TTS processing and/or the ASR processing in accordance with the instruction; and generating an error indication if the host is not capable of performing the TTS processing and/or ASR processing in accordance with the instruction. - View Dependent Claims (2, 3, 4)
-
-
5. A non-transitory computer readable storage medium encoded with a plurality of instructions that, when executed on at least one processor, perform a method of running an application on an end device defining a host, the application being distributed from a server to the host, the method comprising:
-
obtaining the application from the server; executing a multi-modal browser on the host to run the application, the application employing both text-to-speech (TTS) processing and automatic speech recognition (ASR) processing; analyzing, by the host, an instruction in the application that instructs the host to perform the TTS processing and/or ASR processing locally or remotely; determining, by the host, whether the host is capable of performing the TTS processing and/or ASR processing in accordance with the instruction, wherein; if the instruction instructs the host to perform the TTS processing and the ASR processing locally, determining, by the host, whether the host supports performing the TTS processing and the ASR processing locally, and if the instruction instructs the host to perform the TTS processing locally and the ASR processing remotely, determining, by the host, whether the host supports performing the TTS processing locally and the ASR processing remotely; if the host is capable of performing the TTS processing and/or ASR processing in accordance with the instruction, executing the application using the TTS processing and/or the ASR processing in accordance with the instruction; and generating an error indication if the host is not capable of performing the TTS processing and/or ASR processing in accordance with the instruction. - View Dependent Claims (6, 7, 8)
-
-
9. A host device for running an application that is distributed from a server to the host over at least one network, comprising:
-
communication means for communicating over the at least one network, the communication means capable of receiving the application over the at least one network; and at least one computer coupled to the communication means, the at least one computer programmed to; execute a multi-modal browser on the host to run the application, the application employing both text-to-speech (TTS) processing and automatic speech recognition (ASR) processing; analyze an instruction in the application that instructs the host to perform the TTS processing and/or ASR processing locally or remotely; determine whether the host is capable of performing the TTS processing and/or ASR processing in accordance with the instruction, wherein; if the instruction instructs the host to perform the TTS processing and the ASR processing locally, the host determines whether the host supports performing the TTS processing and the ASR processing locally, and if the instruction instructs the host to perform the TTS processing locally and the ASR processing remotely, the host determines whether the host supports performing the TTS processing locally and the ASR processing remotely; if the host is capable of performing the TTS processing and/or ASR processing in accordance with the instruction, execute the application using the TTS processing and/or the ASR processing in accordance with the instruction; and generate an error indication if the host is not capable of performing the TTS processing and/or ASR processing in accordance with the instruction. - View Dependent Claims (10, 11, 12)
-
Specification