Apparatus and method for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases

US 6,529,871 B1
Filed: 10/25/2000
Issued: 03/04/2003
Est. Priority Date: 06/11/1997
Status: Expired due to Term

First Claim

Patent Images

1. A program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for evaluating a user of one of a service and a facility, said method steps comprising:

(a) receiving an identity claim of the user;

(b) querying the user with a random question;

(c) receiving an answer of the user to said random question;

at least one of said identity claim and said answer being received as a spoken utterance of the user;

(d) evaluating correctness of said answer of the user;

(e) performing speaker recognition on said at least one of said identity claim and said answer which is received as said spoken utterance; and

(f) granting access to the user if steps (d) and (e) indicate such access to be warranted.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method of controlling access of a speaker to one of a service and a facility, the method comprising the steps of: (a) receiving first spoken utterances of the speaker, the first spoken utterances containing indicia of the speaker;(b) decoding the first spoken utterances; (c) accessing a database corresponding to the decoded first spoken utterances, the database containing information attributable to a speaker candidate having indicia substantially similar to the speaker; (d) querying the speaker with at least one question based on the information contained in the accessed database; (e) receiving second spoken utterances of the speaker, the second spoken utterances being representative of at least one answer to the at least one question; (f) decoding the second spoken utterances; (g) verifying the accuracy of the decoded answer against the information contained in the accessed database serving as the basis for the question; (h) taking a voice sample from the utterances of the speaker and processing the voice sample against an acoustic model attributable to the speaker candidate; (i) generating a score corresponding to the accuracy of the decoded answer and the closeness of the match between the voice sample and the model; and (j) comparing the score to a predetermined threshold value and if the score is one of substantially equivalent to and above the threshold value, then permitting speaker access to one of the service and the facility.

Citations

45 Claims

1. A program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for evaluating a user of one of a service and a facility, said method steps comprising:
- (a) receiving an identity claim of the user;
  
  (b) querying the user with a random question;
  
  (c) receiving an answer of the user to said random question;
  
  at least one of said identity claim and said answer being received as a spoken utterance of the user;
  
  (d) evaluating correctness of said answer of the user;
  
  (e) performing speaker recognition on said at least one of said identity claim and said answer which is received as said spoken utterance; and
  
  (f) granting access to the user if steps (d) and (e) indicate such access to be warranted.

2. A method for evaluating a user of one of a service and a facility, said method comprising the steps of:
- (a) receiving an identity claim of the user;
  
  (b) querying the user with a random question;
  
  (c) receiving an answer of the user to said random question;
  
  at least one of said identity claim and said answer being received as a spoken utterance of the user;
  
  (d) evaluating correctness of said answer of the user;
  
  (e) performing speaker recognition on said at least one of said identity claim and said answer which is received as said spoken utterance; and
  
  (f) granting access to the user if steps (d) and (e) indicate such access to be warranted.
- View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
- - 3. The method of claim 2, wherein step (a) comprises receiving said identity claim as at least one of a static feature and a dynamic feature, said static feature in turn comprising at least one of external information and internal information.
  - 4. The method of claim 3, wherein step (a) comprises receiving said identity claim as a dynamic feature comprising at least one of:
    - trip information, facsimile information, e-mail information, meeting information, age information, time of attempting to access the one of the service and the facility, and location from which the user is calling.
  - 5. The method of claim 3, wherein step (a) comprises receiving said identity claim as an external static feature comprising at least one of:
    - phone number from which the user is calling and time of day when the user is calling.
  - 6. The method of claim 3, wherein step (a) comprises receiving said identity claim as an internal static feature comprising at least one of:
    - gender, speech rate, accent, preferred vocabulary, preferred request, name, address, date of birth, and family status.
  - 7. The method of claim 2, wherein step (a) comprises receiving said identity claim in a manner other than as a spoken utterance.
  - 8. The method of claim 7, wherein step (a) comprises receiving said identity claim via a card swipe.
  - 9. The method of claim 7, wherein step (a) comprises receiving said identity claim via keyboard input.
  - 10. The method of claim 2, wherein step (a) comprises receiving said identity claim as said spoken utterance.
  - 11. The method of claim 10, further comprising the additional step of decoding said spoken utterance, via speaker-independent speech recognition, to comprehend said identity claim.
  - 12. The method of claim 11, wherein said decoding and step (e) are performed substantially simultaneously.
  - 13. The method of claim 10, further comprising the additional step of evaluating voice characteristics of the user contained in said spoken utterance, via text-independent speaker recognition, to comprehend said identity claim.
  - 14. The method of claim 2, wherein said random question pertains to personal information of said user, and wherein step (b) comprises querying via a natural language dialog.
  - 15. The method of claim 2, wherein step (c) comprises receiving said answer as natural language dialog.
  - 16. The method of claim 15, wherein step (d) comprises evaluating correctness of said answer, at least in part, via natural language understanding.
  - 17. The method of claim 15, wherein step (c) comprises receiving said answer as said spoken utterance.
  - 18. The method of claim 2, wherein step (d) comprises evaluating correctness of said answer of the user based on said identity claim.
  - 19. The method of claim 2, wherein step (e) comprises performing text-independent speaker recognition.
  - 20. The method of claim 2, wherein:

21. A method for evaluating a user of one of a service and a facility, said method comprising the steps of:
- (a) receiving a spoken utterance of a user;
  
  (b) decoding said spoken utterance via automatic speech recognition to obtain information bearing indications of identity of the user;
  
  (c) performing text-independent speaker recognition on said spoken utterance to test whether said spoken utterance was likely uttered by a person corresponding to said indications of said identity of the user; and
  
  (d) granting access to the user if steps (b) and (c) indicate such access to be warranted.
- View Dependent Claims (22, 23, 24)
- - 22. The method of claim 21, wherein:
23. The method of claim 21, wherein step (a) comprises receiving said spoken utterance of the user as at least one of a static feature and a dynamic feature.
24. The method of claim 21, wherein:
- step (a) comprises receiving said spoken utterance of said user as indicative of a subset of users having more than one member; and
  
  step (b) comprises decoding said spoken utterance to obtain said information, said information bearing indications of membership of the user in said subset.

25. A method for evaluating a user of one of a service and a facility, said method comprising the steps of:
- (a) receiving a spoken utterance of a user;
  
  (b) decoding said spoken utterance via automatic speech recognition to obtain information bearing indications of identity of the user;
  
  (c) performing speaker identification, via text-independent speaker recognition, on said spoken utterance to develop an estimation of said identity of the user; and
  
  (d) granting access to the user if steps (b) and (c) indicate such access to be warranted.
- View Dependent Claims (26, 27, 28, 29)
- - 26. The method of claim 25, wherein:
27. The method of claim 25, wherein step (a) comprises receiving said spoken utterance of the user as at least one of a static feature and a dynamic feature.
28. The method of claim 25, wherein:
- step (a) comprises receiving said spoken utterance of said user as indicative of a subset of users having more than one member; and
  
  step (b) comprises decoding said spoken utterance to obtain said information, said information bearing indications of membership of the user in said subset.
29. The method of claim 25, wherein steps (b) and (c) are performed substantially simultaneously.

30. A method for evaluating a user of one of a service and a facility, said method comprising the steps of:
- (a) receiving a first natural language spoken utterance of the user;
  
  (b) decoding said first natural language spoken utterance, via natural language understanding (NLU) speech recognition, to obtain a first decoded utterance having factual content;
  
  (c) performing text-independent speaker recognition on said first natural language spoken utterance; and
  
  (d) granting access to the user if both said factual content of said first decoded utterance and said text-independent speaker recognition of said first natural language spoken utterance indicate such access to be warranted.
- View Dependent Claims (31)
- - 31. The method of claim 30, further comprising the additional steps of:

32. A method for evaluating a user of one of a service and a facility having a plurality of permitted users, said method comprising the steps of:
- (a) receiving a first piece of information pertaining to the user, said first piece of information containing information sufficient to identify the user as a member of a multi-user group including less than all of said permitted users of the one of the service and the facility;
  
  (b) accessing a database, based on said first piece of information, to identify said multi-user group; and
  
  (c) determining a most likely member of said multi-user group to whom the user corresponds, based on speaker identification performed on a spoken utterance of the user.
- View Dependent Claims (33, 34, 35, 36, 37)
- - 33. The method of claim 32, wherein step (a) comprises receiving, as said first piece of information, one of a name associated with the user, a password associated with the user, an object of a request associated with the user and a phone number associated with the user.
  - 34. The method of claim 32, wherein step (a) comprises receiving said first piece of information as one of a static feature and a dynamic feature.
  - 35. The method of claim 32, wherein step (a) comprises receiving said first piece of information as said spoken utterance of the user, further comprising the additional step of, prior to step (b), decoding said spoken utterance via automatic speech recognition, and wherein step (b) comprises accessing said database based on said decoding.
  - 36. The method of claim 35, wherein:
37. The method of claim 36, wherein step (a) comprises receiving said name associated with the user;
- andsaid multi-user group comprises users having names which are similar-sounding to said name associated with the user.

38. A method for evaluating a user of one of a service and a facility having a plurality of permitted users, said method comprising the steps of:
- (a) receiving a first piece of information pertaining to the user, said first piece of information containing information sufficient to identify the user as a member of a multi-user group including less than all of said permitted users of the one of the service and the facility;
  
  (b) accessing a database, based on said first piece of information, to identify said multi-user group;
  
  (c) forming a ranked list of possible users to whom the user may correspond, based on speaker identification performed on a spoken utterance of the user; and
  
  (d) searching for a match between said ranked list of possible users and members of said multi-user group.
- View Dependent Claims (39, 40)
- - 39. The method of claim 38, wherein step (a) comprises receiving said first piece of information as said spoken utterance of the user, further comprising the additional step of decoding said spoken utterance via automatic speech recognition, and wherein said decoding and said speaker identification are performed substantially simultaneously.
  - 40. The method of claim 38, wherein step (a) comprises receiving a name associated with the user;
    - and

41. A program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for evaluating a user of one of a service and a facility, said method steps comprising:
- (a) receiving a spoken utterance of a user;
  
  (b) decoding said spoken utterance via automatic speech recognition to obtain information bearing indications of identity of the user;
  
  (c) performing text-independent speaker recognition on said spoken utterance to test whether said spoken utterance was likely uttered by a person corresponding to said indications of said identity of the user; and
  
  (d) granting access to the user if steps (b) and (c) indicate such access to be warranted.

42. A program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for evaluating a user of one of a service and a facility, said method steps comprising:
- (a) receiving a spoken utterance of a user;
  
  (b) decoding said spoken utterance via automatic speech recognition to obtain information bearing indications of identity of the user;
  
  (c) performing speaker identification, via text-independent speaker recognition, on said spoken utterance to develop an estimation of said identity of the user; and
  
  (d) granting access to the user if steps (b) and (c) indicate such access to be warranted.

43. A program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for evaluating a user of one of a service and a facility, said method steps comprising:
- (a) receiving a first natural language spoken utterance of the user;
  
  (b) decoding said first natural language spoken utterance, via natural language understanding (NLU) speech recognition, to obtain a first decoded utterance having factual content;
  
  (c) performing text-independent speaker recognition on said first natural language spoken utterance; and
  
  (d) granting access to the user if both said factual content of said first decoded utterance and said text-independent speaker recognition of said first natural language spoken utterance indicate such access to be warranted.

44. A program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for evaluating a user of one of a service and a facility, said method steps comprising:
- (a) receiving a first piece of information pertaining to the user, said first piece of information containing information sufficient to identify the user as a member of a multi-user group including less than all of said permitted users of the one of the service and the facility;
  
  (b) accessing a database, based on said first piece of information, to identify said multi-user group; and
  
  (c) determining a most likely member of said multi-user group to whom the user corresponds, based on speaker identification performed on a spoken utterance of the user.

45. A program storage device readable by machine, tangibly embodying a program of instructions executable by the machine to perform method steps for evaluating a user of one of a service and a facility, said method steps comprising:
- (a) receiving a first piece of information pertaining to the user, said first piece of information containing information sufficient to identify the user as a member of a multi-user group including less than all of said permitted users of the one of the service and the facility;
  
  (b) accessing a database, based on said first piece of information, to identify said multi-user group;
  
  (c) forming a ranked list of possible users to whom the user may correspond, based on speaker identification performed on a spoken utterance of the user; and
  
  (d) searching for a match between said ranked list of possible users and members of said multi-user group.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
International Business Machines Corporation
Original Assignee
International Business Machines Corporation
Inventors
Kanevsky, Dimitri, Maes, Stephane Herman
Primary Examiner(s)
Banks-Harold, Marsha D.
Assistant Examiner(s)
MCFADDEN, SUSAN IRIS

Application Number

US09/696,585
Time in Patent Office

860 Days
Field of Search

704/246, 704/247, 704/257, 704/270, 704/273, 704/275, 704/255, 704/260, 379/88.02
US Class Current

704/246
CPC Class Codes

G10L 17/22 Interactive procedures; Man...

G10L 17/24 the user being prompted to ...

Apparatus and method for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

Citations

45 Claims

Specification

Solutions

Use Cases

Quick Links

Apparatus and method for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

45 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links