Apparatus and methods for implementing voice enabling applications in a coverged voice and data network environment
First Claim
1. A method of implementing voice-enabled applications in a converged voice and data network environment, the method comprising of the steps:
- a. entering human voice data into the converged voice and data network;
b. converting the voice data into a digital format for speech processing at a later time;
c. providing a control mechanism to interface at least one network-based voice enabled application with at least one speech processing function using a speech application interface;
d. performing at least one speech processing function using the voice of the end user; and
e. taking an action with respect to the end user depending on the result from the speech processing of the voice data, said action being taken from the group consisting of allowing the end user to enter into a secured transaction, awarding the end user with a prize, penalizing the end user by disallowing entry, and providing feedback to the end user to communicate the result.
0 Assignments
0 Petitions
Accused Products
Abstract
Human speech is transported through a voice and data converged Internet network to recognize its content, verify the ifentity of the speaker, or to verify the content of a spoken phrase by utilizing the Internet protocol to transmit voice packets. The voice data (4) entered is processed and transmitted in the same way as Internet data packets over converged voice and data IP networks. A voice-enabled application isends a message (5), which is decoded by the speech API (2) and the appriopriate control and synchronization information is issued (7) to the data preparation module (9) and to the speech engine (3). Standard voice over IP includes a speech compression algorithm and the use of RTP (Real Time Protocol), enabling additional processing of the human voice anywhere in the network to perform speaker verification, with or without the knowledge of the speaker.
-
Citations
17 Claims
-
1. A method of implementing voice-enabled applications in a converged voice and data network environment, the method comprising of the steps:
-
a. entering human voice data into the converged voice and data network;
b. converting the voice data into a digital format for speech processing at a later time;
c. providing a control mechanism to interface at least one network-based voice enabled application with at least one speech processing function using a speech application interface;
d. performing at least one speech processing function using the voice of the end user; and
e. taking an action with respect to the end user depending on the result from the speech processing of the voice data, said action being taken from the group consisting of allowing the end user to enter into a secured transaction, awarding the end user with a prize, penalizing the end user by disallowing entry, and providing feedback to the end user to communicate the result. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method of implementing voice-enabled applications in a converged voice and data network environment, comprising:
-
a. entering human voice data from a speaker into the converged voice and data network for later processing and acoustic matching;
b. non-intrusively processing the voice data from an end user while the user is speaking into the voice and data network;
C. preparing the voice data for the speech processing function with a front-end processing module to separate the pauses in the speech from the voice data and utilizing a voice feature extraction module to ready the speech for a processing algorithm from a speech engine;
d. synchronizing generated control information with a data exchange between a data preparation module and the speech engine through a speech application interface;
e. processing the voice features by the speech engine to perform at least one combination of the speech processing and pattern recognition algorithms implemented by the speech engine. f. providing feedback to the end user to communicate the result from the speech processing; and
g. taking an action responsive to the result from the processing of the voice. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
Specification