Method for processing speech using dynamic grammars
First Claim
Patent Images
1. A distributed speech query recognition system adapted for responding to speech-based queries comprising:
- a client device including;
i) a speech capture software module for capturing a speech utterance from a user of said client device and partially processing said speech utterance to capture acoustic features;
ii) a communications module for transferring said acoustic features and context information associated with said client device through a network;
wherein said context information is related to items presented within a browser to said user of said client device when said user provides said speech utterance;
a server device including a speech recognition engine software module for generating recognized speech data from said acoustic features;
wherein said speech recognition engine uses a dynamic speech recognition grammar which is loaded based on said context information.
3 Assignments
0 Petitions
Accused Products
Abstract
Speech data is processed with one or more dynamic grammars, to reduce latency and improve accuracy. Different speech grammars are used by a speech recognition process depending on a context experienced by a speaker, and sentence grammars are similarly varied during a natural language process. The methods are useful for distributed speech recognition systems such as a client-server system, typically implemented on an intranet or over the Internet based on user queries at his/her computer, a PDA, or a workstation using a speech input interface.
-
Citations
30 Claims
-
1. A distributed speech query recognition system adapted for responding to speech-based queries comprising:
-
a client device including;
i) a speech capture software module for capturing a speech utterance from a user of said client device and partially processing said speech utterance to capture acoustic features;
ii) a communications module for transferring said acoustic features and context information associated with said client device through a network;
wherein said context information is related to items presented within a browser to said user of said client device when said user provides said speech utterance;
a server device including a speech recognition engine software module for generating recognized speech data from said acoustic features;
wherein said speech recognition engine uses a dynamic speech recognition grammar which is loaded based on said context information. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A speech query recognition system adapted for responding to speech-based queries system comprising:
-
a speech recognition engine for generating recognized speech data from a speech signal resulting from a speech-based query provided by a speaker;
wherein said speech recognition engine uses a limited speech recognition grammar of words which is loaded for a context experienced by said speaker when said speech-based query is made, said context being related to an environment presented within a browser to said speaker at a time when said speaker provides said speech-based query;
a natural language engine which generates recognized speech sentence data corresponding to said speech-based query based on said recognized speech data;
one or more query/response databases for storing question/answer pairs corresponding to said speech-based query;
wherein a set of question/answer pairs is selected and used to determine an answer to said speech-based query based on said context experienced by said speaker;
a query formulation engine adapted for retrieving one or more question/answer pairs from said set of question/answer pairs based on said recognized speech sentence data provided by said natural language engine. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. A method of responding to speech-based queries across a distributed speech recognition system including the steps of:
-
capturing a speech utterance from a user of a client device;
extracting acoustic features from said speech utterance to perform a partial recognition of said speech utterance;
transferring said acoustic features and context information associated with said client device through a network to a server device;
wherein said context information is related to items presented within a browser to said user of said client device when said user provides said speech utterance;
completing said recognition of said speech utterance at said server device to generate recognized speech data from said acoustic features;
wherein said speech recognition engine at said server device uses a dynamic speech recognition grammar which is loaded based on said context information. - View Dependent Claims (17, 18, 19, 20, 21, 22)
-
-
23. A method of responding to speech-based queries system comprising the steps of:
-
generating recognized speech data from a speech signal resulting from a speech-based query provided by a speaker;
wherein said speech recognition engine uses a limited speech recognition grammar of words which is loaded for a context experienced by said speaker when said speech-based query is made, said context being related to an environment presented within a browser to said speaker at a time when said speaker provides said speech-based query;
generates recognized speech sentence data corresponding to said speech-based query based on said recognized speech data;
storing question/answer pairs corresponding to said speech-based query in one or more query/response databases;
wherein a set of question/answer pairs is selected and used to determine an answer to said speech-based query based on said context experienced by said speaker;
retrieving one or more question/answer pairs from said set of question/answer pairs based on said recognized speech sentence data provided by said natural language engine. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30)
-
Specification