Distributed Internet Based Speech Recognition System With Natural Language Support
First Claim
1. A speech-enabled internet based computing system comprising:
- a speech recognition engine configured to generate a recognized speech query from an utterance;
said speech recognition engine being further configured to distribute speech processing operations between a portable client device and a server device on a device-by-device basis, such that a plurality of portable client devices having differing computing capabilities can be supported;
wherein individual ones of said plurality of portable client devices can be configured to perform at least part of said speech processing operations to generate said recognized speech query;
a web page having a list of items, at least some of said list of items being selectable through a browser on said portable client device based on said recognized speech query.
2 Assignments
0 Petitions
Accused Products
Abstract
A speech-enabled internet based computing system includes a configurable speech recognition engine used for interacting with content on a web accessible page. The speech recognition engine is distributed across a client and server architecture, and is adaptive so that speech processing operations can be allocated as needed between the two. This allows for support for client devices having differing computing capabilities. Natural language operations can also be supported as desired. A user can thus interact with a web page and select items of interest using speech as a mode of input. Dynamic grammars can assist in the recognition operations to improve speed and comprehension.
172 Citations
32 Claims
-
1. A speech-enabled internet based computing system comprising:
-
a speech recognition engine configured to generate a recognized speech query from an utterance;
said speech recognition engine being further configured to distribute speech processing operations between a portable client device and a server device on a device-by-device basis, such that a plurality of portable client devices having differing computing capabilities can be supported;
wherein individual ones of said plurality of portable client devices can be configured to perform at least part of said speech processing operations to generate said recognized speech query;
a web page having a list of items, at least some of said list of items being selectable through a browser on said portable client device based on said recognized speech query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
-
13. A system for enabling a browser program to interact with a website using speech utterances, the system comprising:
-
a speech recognition engine configured to generate a recognized speech query from an utterance;
said speech recognition engine being further configurable such that speech processing operations can be distributed between a client device and a server device as required to achieve real-time recognition of a speech query; and
a natural language engine configured to determine a meaning of said recognized speech query and provide a first response thereto;
a web page routine for presenting one or more web pages to the browser program, wherein data content for said one or more web pages is controlled by said recognized speech query and/or said first response of said natural language engine;
wherein said recognized speech queries can be presented to both said natural language engine as well as to a text based query database for identifying a meaning of said recognized speech query, such that a second response can be provided by said database for at least some recognized speech queries prior to said first response of said natural language engine. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A World-Wide Web (WWW) accessible natural language computing system comprising:
-
a speech recognition engine configured to distribute speech processing operations between a client device and a server device for processing an utterance for a speech based query;
said server device being configured to generate a recognized speech query using a grammar accessible to said server device;
a natural language routine executing on the server device and configured to process said recognized speech query to generate a natural language result in real-time;
a WWW page coupled to the server device and having a list of items, at least some of said list of items being selectable by a user based on said natural language result;
a database coupled to the server device for storing predefined answers which correspond to content for said list of items on said WWW accessible page;
wherein a grammar used to recognize said speech based query can be varied between utterances and loaded dynamically as needed to recognize utterances associated with said content.
-
-
24. A method of interacting with a web-connected server using a browser program, the method comprising the steps of:
-
providing a distributed speech recognition engine configured to generate recognized speech queries from an utterance;
wherein said distributed speech recognition engine can be further configured to permit partial or full recognition of said utterance at a client device and/or a server device;
presenting one or more web pages to the browser program, such that data content for said one or more web pages transmitted to the browser program is controlled by said recognized speech query. - View Dependent Claims (25, 26, 27, 28, 29, 30, 31, 32)
-
Specification