Method and system for gathering information by voice input
First Claim
1. A client system for gathering information via a network by voice input comprising:
- a speech recognition engine installed on said client system;
a communication component installed on said client system configured to establish communications with a communication component on a server system which provides access to information stored on said server; and
a voice navigation component configured to provide information-dependent grammars from said server to said speech recognition engine via said communication component based on initial information loaded from said server to said client and configured to process results of said speech recognition system.
3 Assignments
0 Petitions
Accused Products
Abstract
The present invention allows users to navigate in a Web application or Web pages using a combination of point-and-click and voice-input. At each point of the dialog, the user can use the standard point-and-click interface to perform context-dependent actions, or alternatively, use speech input to navigate and operate in the global application context. The voice input uses a voice navigation component which builds an interface to the installed recognition and synthesis engines. The point-and-click and the voice navigation components can be loaded automatically with the initial Web page of a Web application. Grammars for recognizing vocabulary related to that Web application will be provided with the voice navigation component. The present invention combines the advantages of a context-dependent point-and-click user interface with those of a context-independent speech-input interface. Accordingly, a multi-modal interface can be provided to a Web browser.
124 Citations
29 Claims
-
1. A client system for gathering information via a network by voice input comprising:
-
a speech recognition engine installed on said client system;
a communication component installed on said client system configured to establish communications with a communication component on a server system which provides access to information stored on said server; and
a voice navigation component configured to provide information-dependent grammars from said server to said speech recognition engine via said communication component based on initial information loaded from said server to said client and configured to process results of said speech recognition system. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A client-server system comprising:
-
a client having a speech recognition engine and a speech synthesis engine, a client communication component configured to establish communications with a server, and a voice navigation component configured to provide information-dependent grammars from said server to said speech recognition engine via said client communication component based on initial information loaded from said server to said client and further configured to process results of said speech recognition engine; and
a server having a server communication component configured to establish communication with a client, a voice navigation component configured to provide information-dependent grammars from said server to said speech recognition engine based on said initial information and further configured to process said results of said speech recognition engine, wherein said voice navigation component is available for download to and execution on said client, and said information-dependent grammars are available for download to and execution on said client.
-
-
8. A method for gathering information via a network by voice input comprising:
-
loading an initial information from a server in a client using a communication component;
automatically loading an information-dependent grammar in said client by using access information contained in said initial information and automatically providing said information-dependent grammar to a speech recognition engine disposed in said client for recognizing spoken words defined by said information-dependent grammar;
sending results of said speech recognition engine to a voice navigation component; and
processing results of said speech recognition engine in said voice navigation component. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
-
-
19. A machine-readable storage, having stored thereon a computer program having a plurality of code sections executable by a machine for causing the machine to perform the steps of:
-
loading an initial information from a server in a client using a communication component;
automatically loading an information-dependent grammar in said client by using access information contained in said initial information and automatically providing said information-dependent grammar to a speech recognition engine disposed in said client for recognizing spoken words defined by said information-dependent grammar;
sending results of said speech recognition engine to a voice navigation component; and
processing results of said speech recognition engine in said voice navigation component.
-
Specification