Apparatus and methods for implementing voice enabling applications in a converged voice and data network environment
First Claim
1. A method of implementing voice-enabled applications in a single converged voice and data network IP environment that supports and implements VoIP connections, protocols and traffic, the method comprising:
- a. entering human voice data as data packets into a converged VoIP packet-switched IP based voice and data network;
b. converting the voice data into a digital format for VoIP transport and speech processing at a later time;
c. providing a control mechanism to interface at least one packet-switched network voice enabled application with at least one speech processing function using a VoIP speech application interface;
d. performing at least one speech processing biometrics function inside the VoIP network using the voice of the end user, wherein the voice-enabled application performs non-intrusive voice ID biometrics during which the user enters his voice into the network for the purpose of performing a VoIP telephone conversation and a sample from the voice of the telephone conversation is used to verify the user in a manner transparent to the user; and
e. taking an action with respect to the end user depending on the result from the speech processing of the voice data, said action being taken from the group consisting of allowing the end user to enter into a secured transaction, awarding the end user with a prize, penalizing the end user by disallowing entry, and providing feedback to the end user to communicate the result.
0 Assignments
0 Petitions
Accused Products
Abstract
Human speech is transported through a voice and data converged Internet network to recognize its content, verify the identity of the speaker, or to verify the content of a spoken phrase by utilizing the Internet protocol to transmit voice packets. The voice data (4) entered is processed and transmitted in the same way as Internet data packets over converged voice and data IP networks. A voice-enabled application sends a message (5), which is decoded by the speech API (2) and the appropriate control and synchronization information is issued (7) to the data preparation module (9) and to the speech engine (3). Standard voice over IP includes a speech compression algorithm and the use of RTP (Real Time Protocol), enabling additional processing of the human voice anywhere in the network to perform speaker verification, with or without the knowledge of the speaker.
29 Citations
18 Claims
-
1. A method of implementing voice-enabled applications in a single converged voice and data network IP environment that supports and implements VoIP connections, protocols and traffic, the method comprising:
-
a. entering human voice data as data packets into a converged VoIP packet-switched IP based voice and data network; b. converting the voice data into a digital format for VoIP transport and speech processing at a later time; c. providing a control mechanism to interface at least one packet-switched network voice enabled application with at least one speech processing function using a VoIP speech application interface; d. performing at least one speech processing biometrics function inside the VoIP network using the voice of the end user, wherein the voice-enabled application performs non-intrusive voice ID biometrics during which the user enters his voice into the network for the purpose of performing a VoIP telephone conversation and a sample from the voice of the telephone conversation is used to verify the user in a manner transparent to the user; and e. taking an action with respect to the end user depending on the result from the speech processing of the voice data, said action being taken from the group consisting of allowing the end user to enter into a secured transaction, awarding the end user with a prize, penalizing the end user by disallowing entry, and providing feedback to the end user to communicate the result.
-
-
2. A method of implementing voice-enabled applications in a single converged voice and data network IP environment that supports and implements VoIP connections, protocols and traffic, the method comprising:
-
a. entering human voice data as data packets into a converged VoIP packet-switched IP based voice and data network; b. converting the voice data into a digital format for VoIP transport and speech processing at a later time; c. providing a control mechanism to interface at least one packet-switched network voice enabled application with at least one speech processing function using a VoIP speech application interface; d. performing at least one speech processing biometrics function inside the VoIP network using the voice of the end user; wherein the voice-enabled application residing in a packet-switched IP based network for VoIP traffic performs spoken text verification without an intrusive response, where the computer screen prompts the user to speak an advertising phrase and the voice enters the network through the computer without passing through a PSTN network; and e. taking an action with respect to the end user depending on the result from the speech processing of the voice data, said action being taken from the group consisting of allowing the end user to enter into a secured transaction, awarding the end user with a prize, penalizing the end user by disallowing entry, and providing feedback to the end user to communicate the result.
-
-
3. A method of implementing voice-enabled applications in a single converged voice and data network IP environment that supports and implements VoIP connections, protocols and traffic, the method comprising:
-
a. entering human voice data as data packets into a converged VoIP packet-switched IP based voice and data network; b. converting the voice data into a digital format for VoIP transport and speech processing at a later time; c. providing a control mechanism to interface at least one packet-switched network voice enabled application with at least one speech processing function using a VoIP speech application interface; d. performing at least one speech processing biometrics function inside the VoIP network using the voice of the end user, wherein the voice-enabled application residing in a packet-switched IP based network for VoIP traffic performs spoken text verification where the computer screen prompts the user to speak an advertising phrase while simultaneously verifying the user'"'"'s identity through non-intrusive voice ID biometrics, and the voice enters the network through the computer without passing through a PSTN network; and e. taking an action with respect to the end user depending on the result from the speech processing of the voice data, said action being taken from the group consisting of allowing the end user to enter into a secured transaction, awarding the end user with a prize, penalizing the end user by disallowing entry, and providing feedback to the end user to communicate the result.
-
-
4. A method of implementing voice-enabled applications in a single converged voice and data network IP environment that supports and implements VoIP connections, protocols and traffic, the method comprising:
-
a. entering human voice data as data packets into a converged VoIP packet-switched IP based voice and data network; b. converting the voice data into a digital format for VoIP transport and speech processing at a later time; c. providing a control mechanism to interface at least one packet-switched network voice enabled application with at least one speech processing function using a VoIP speech application interface; d. performing at least one speech processing biometrics function inside the VoIP network using the voice of the end user, wherein the voice-enabled application residing in a packet-switched IP based network for VoIP traffic performs non-intrusive voice ID biometrics simultaneously to more than one speaker participating in a conference call and indicates who is currently speaking by highlighting a reference to him on a monitoring WEB site; and e. taking an action with respect to the end user depending on the result from the speech processing of the voice data, said action being taken from the group consisting of allowing the end user to enter into a secured transaction, awarding the end user with a prize, penalizing the end user by disallowing entry, and providing feedback to the end user to communicate the result.
-
-
5. A method of implementing voice-enabled applications in a single converged voice and data network IP environment that supports and implements VoIP connections, protocols and traffic, the method comprising:
-
a. entering human voice data as data packets into a converged VoIP packet-switched IP based voice and data network; b. converting the voice data into a digital format for VoIP transport and speech processing at a later time; c. providing a control mechanism to interface at least one packet-switched network voice enabled application with at least one speech processing function using a VoIP speech application interface; d. performing at least one speech processing biometrics function inside the VoIP network using the voice of the end user, wherein the voice-enabled application residing in a packet-switched IP based network for VoIP traffic performs non-intrusive voice ID biometrics in order to differentiate the voices of the users of a flat rate long distance VoIP telephone service and allow usage-only by the legitimate subscriber; and e. taking an action with respect to the end user depending on the result from the speech processing of the voice data, said action being taken from the group consisting of allowing the end user to enter into a secured transaction, awarding the end user with a prize, penalizing the end user by disallowing entry, and providing feedback to the end user to communicate the result.
-
-
6. A method of implementing voice-enabled applications in a converged voice and data network IP environment that supports and implements VoIP connections, protocols and traffic, comprising:
-
a. entering human voice data as data packets from a speaker into a converged VoIP voice and data packet-switched IP based network for later processing and acoustic matching; b. non-intrusively processing the voice data from an end user biometrically while the end user is speaking into the converged VoIP voice and data packet-switched IP based network; c. preparing the voice data for VoIP speech processing function with a front-end processing module to separate the pauses in the speech from the voice data and utilizing a voice feature extraction module to ready the speech for a processing algorithm from a speech engine; d. synchronizing generated control information with a data exchange between a data preparation module and the speech engine through a speech application interface; e. processing the voice features by the speech engine to perform at least one combination of the speech processing and pattern recognition algorithms implemented by the speech engine; f. providing feedback to the end user to communicate the result from the speech processing; and g. taking an action responsive to the result from the processing of the voice. - View Dependent Claims (7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
Specification