Real time automatic caller speech profiling
First Claim
1. A method of recognizing a voice input received by an interactive voice recognition system, the method comprising:
- receiving an unintelligible voice input from a caller;
digitizing, by one or more processors, the unintelligible voice input to generate an unintelligible digitized voice input;
determining, by one or more processors, that the unintelligible digitized voice input is unintelligible to a computer due to background noise at a caller environment of the caller;
in response to determining that the unintelligible digitized voice input does not match any entry from a lexicon of known digitized voice inputs, storing the unintelligible digitized voice input in a no-match-based user profiles database;
prompting the caller to transmit a secondary input to clarify the unintelligible digitized voice input, wherein the secondary input is a non-verbal electronic input that identifies a user-requested activity that the caller was requesting when transmitting the voice input;
utilizing the non-verbal electronic input to match the unintelligible digitized voice input with a specific known digitized voice input from the lexicon of known digitized voice inputs;
comparing the unintelligible digitized voice input to the specific known digitized voice input to derive a voice input match percentage, wherein the voice input match percentage describes how closely the unintelligible digitized voice input and the specific known digitized voice input match one another;
in response to the voice input match percentage exceeding a predetermined value, updating the lexicon of known digitized voice inputs with the unintelligible digitized voice input, wherein the unintelligible digitized voice input is matched to the specific known digitized voice input from the lexicon of known digitized voice inputs;
matching the unintelligible digitized voice input to the user-requested activity in a database that is specific for the caller;
receiving the unintelligible voice input in a subsequent phone call from the user and converting the unintelligible voice input in the subsequent phone call into the unintelligible digitized voice input; and
matching the unintelligible digitized voice input from the subsequent phone call to the user-requested activity for the caller.
1 Assignment
0 Petitions
Accused Products
Abstract
A computer-implemented method, system, and/or computer program product recognizes a voice input received by an interactive voice recognition system. An unintelligible voice input, which is unintelligible due to background noise at the caller'"'"'s location, is received from a caller. Upon determining that the unintelligible voice input does not match any entry from a lexicon of known voice inputs, the unintelligible voice input is stored in a no-match-based user profiles database, and the caller is prompted to transmit a non-verbal electronic input to clarify the unintelligible voice input. This non-verbal electronic input is used to match the unintelligible voice input with a specific known voice input from the lexicon of known voice inputs, which is updated with the unintelligible voice input. The updated lexicon is subsequently used in future phone calls from the user to match the unintelligible voice input to the user-requested activity for the caller.
20 Citations
20 Claims
-
1. A method of recognizing a voice input received by an interactive voice recognition system, the method comprising:
-
receiving an unintelligible voice input from a caller; digitizing, by one or more processors, the unintelligible voice input to generate an unintelligible digitized voice input; determining, by one or more processors, that the unintelligible digitized voice input is unintelligible to a computer due to background noise at a caller environment of the caller; in response to determining that the unintelligible digitized voice input does not match any entry from a lexicon of known digitized voice inputs, storing the unintelligible digitized voice input in a no-match-based user profiles database; prompting the caller to transmit a secondary input to clarify the unintelligible digitized voice input, wherein the secondary input is a non-verbal electronic input that identifies a user-requested activity that the caller was requesting when transmitting the voice input; utilizing the non-verbal electronic input to match the unintelligible digitized voice input with a specific known digitized voice input from the lexicon of known digitized voice inputs; comparing the unintelligible digitized voice input to the specific known digitized voice input to derive a voice input match percentage, wherein the voice input match percentage describes how closely the unintelligible digitized voice input and the specific known digitized voice input match one another; in response to the voice input match percentage exceeding a predetermined value, updating the lexicon of known digitized voice inputs with the unintelligible digitized voice input, wherein the unintelligible digitized voice input is matched to the specific known digitized voice input from the lexicon of known digitized voice inputs; matching the unintelligible digitized voice input to the user-requested activity in a database that is specific for the caller; receiving the unintelligible voice input in a subsequent phone call from the user and converting the unintelligible voice input in the subsequent phone call into the unintelligible digitized voice input; and matching the unintelligible digitized voice input from the subsequent phone call to the user-requested activity for the caller. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer program product for recognizing a voice input received by an interactive voice recognition system, the computer program product comprising a computer readable storage medium having program code embodied therewith, the program code readable and executable by a processor to:
-
receive an unintelligible voice input from a caller; digitize the unintelligible voice input to generate an unintelligible digitized voice input; determine that the unintelligible digitized voice input is unintelligible to a computer due to background noise at a caller environment of the caller; in response to determining that the unintelligible digitized voice input does not match any entry from a lexicon of known digitized voice inputs, store the unintelligible digitized voice input in a no-match-based user profiles database; prompt the caller to transmit a secondary input to clarify the unintelligible digitized voice input, wherein the secondary input is a non-verbal electronic input that identifies a user-requested activity that the caller was requesting when transmitting the voice input; utilize the non-verbal electronic input to match the unintelligible digitized voice input with a specific known digitized voice input from the lexicon of known digitized voice inputs; compare the unintelligible digitized voice input to the specific known digitized voice input to derive a voice input match percentage, wherein the voice input match percentage describes how closely the unintelligible digitized voice input and the specific known digitized voice input match one another; in response to the voice input match percentage exceeding a predetermined value, update the lexicon of known digitized voice inputs with the unintelligible digitized voice input, wherein the unintelligible digitized voice input is matched to the specific known digitized voice input from the lexicon of known digitized voice inputs; match the unintelligible digitized voice input to the user-requested activity in a database that is specific for the caller; receive the unintelligible voice input in a subsequent phone call from the user and converting the unintelligible voice input in the subsequent phone call into the unintelligible digitized voice input; and match the unintelligible digitized voice input from the subsequent phone call to the user-requested activity for the caller. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A computer system comprising:
-
a processor, a computer readable memory, and a non-transitory computer readable storage media; first program instructions to receive an unintelligible voice input from a caller; second program instructions to digitize the unintelligible voice input to generate an unintelligible digitized voice input; third program instructions to determine that the unintelligible digitized voice input is unintelligible to a computer due to background noise at a caller environment of the caller; fourth program instructions to, in response to determining that the unintelligible digitized voice input does not match any entry from a lexicon of known digitized voice inputs, store the unintelligible digitized voice input in a no-match-based user profiles database; fifth program instructions to prompt the caller to transmit a secondary input to clarify the unintelligible digitized voice input, wherein the secondary input is a non-verbal electronic input that identifies a user-requested activity that the caller was requesting when transmitting the voice input; sixth program instructions to utilize the non-verbal electronic input to match the unintelligible digitized voice input with a specific known digitized voice input from the lexicon of known digitized voice inputs; seventh program instructions to compare the unintelligible digitized voice input to the specific known digitized voice input to derive a voice input match percentage, wherein the voice input match percentage describes how closely the unintelligible digitized voice input and the specific known digitized voice input match one another; eighth program instructions to, in response to the voice input match percentage exceeding a predetermined value, update the lexicon of known digitized voice inputs with the unintelligible digitized voice input, wherein the unintelligible digitized voice input is matched to the specific known digitized voice input from the lexicon of known digitized voice inputs; ninth program instructions to match the unintelligible digitized voice input to the user-requested activity in a database that is specific for the caller; tenth program instructions to receive the unintelligible voice input in a subsequent phone call from the user and converting the unintelligible voice input in the subsequent phone call into the unintelligible digitized voice input; and eleventh program instructions to match the unintelligible digitized voice input from the subsequent phone call to the user-requested activity for the caller; and
wherein the first, second, third, fourth, fifth, sixth, seventh, eighth, ninth, tenth, and eleventh program instructions are stored on the non-transitory computer readable storage media for execution by the processor via the computer readable memory. - View Dependent Claims (18, 19, 20)
-
Specification