System and method for providing an automated call center inline architecture
First Claim
1. A system for providing an automated call center inline architecture, comprising:
- a script engine to maintain prompts and a plurality of grammar references;
a telephony interface to receive a call and to collect audio data using the prompts from the script engine, which are transmitted to the telephony interface via a message server;
a speech server to perform distributed speech recognition, comprising;
one or more secondary recognizers to receive the grammar references from the script engine via the message server and to determine speech results by applying the grammar references to the audio data; and
a main recognizer to form a new grammar from the speech results and to identify speech recognition results by applying the new grammar to the audio data;
a confidence level module to assign confidence levels to each of the speech recognition results as an indication of accuracy of the distributed speech recognition;
a placeholder module to assign a placeholder to at least a portion of the speech recognition results when the assigned confidence level for those results do not satisfy a predetermined threshold;
an action module to perform an action based on a presence of the placeholder, wherein the action comprises querying the message server for additional speech recognition results; and
an agent console to receive the speech recognition results and the placeholder as a display.
20 Assignments
0 Petitions
Accused Products
Abstract
A system and method for providing an automated call center inline architecture is provided. A plurality of grammar references and prompts are maintained on a script engine. A call is received through a telephony interface. Audio data is collected using the prompts from the script engine, which are transmitted to the telephony interface via a message server. Distributed speech recognition is performed on a speech server. The grammar references are received from the script engine via the message server. Speech results are determined by applying the grammar references to the audio data. A new grammar is formed from the speech results. Speech recognition results are identified by applying the new grammar to the audio data. The speech recognition results are received as a display on an agent console.
33 Citations
18 Claims
-
1. A system for providing an automated call center inline architecture, comprising:
-
a script engine to maintain prompts and a plurality of grammar references; a telephony interface to receive a call and to collect audio data using the prompts from the script engine, which are transmitted to the telephony interface via a message server; a speech server to perform distributed speech recognition, comprising; one or more secondary recognizers to receive the grammar references from the script engine via the message server and to determine speech results by applying the grammar references to the audio data; and a main recognizer to form a new grammar from the speech results and to identify speech recognition results by applying the new grammar to the audio data; a confidence level module to assign confidence levels to each of the speech recognition results as an indication of accuracy of the distributed speech recognition; a placeholder module to assign a placeholder to at least a portion of the speech recognition results when the assigned confidence level for those results do not satisfy a predetermined threshold; an action module to perform an action based on a presence of the placeholder, wherein the action comprises querying the message server for additional speech recognition results; and an agent console to receive the speech recognition results and the placeholder as a display. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method for providing an automated call center inline architecture, comprising:
-
maintaining prompts and a plurality of grammar references on a script engine; receiving a call through a telephony interface and collecting audio data using the prompts from the script engine, which are transmitted to the telephony interface via a message server; performing distributed speech recognition on a speech server, comprising; receiving the grammar references from the script engine via the message server and determining speech results by applying the grammar references to the audio data; forming a new grammar from the speech results; and identifying speech recognition results by applying the new grammar to the audio data; assigning a confidence level to each of the speech recognition results to provide an indication of accuracy of the distributed speech recognition; assigning a placeholder to at least a portion of the speech recognition results when the assigned confidence level for those results do not satisfy a predetermined threshold; performing an action based on a presence of the placeholder, wherein the action comprises querying the message server for additional speech recognition results; and receiving the speech recognition results and the placeholder as a display on an agent console. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification