System and method for speech personalization by need
First Claim
1. A method comprising:
- recognizing speech of each speaker of a plurality of speakers on a conference call, to yield recognized speech for each of the plurality of speakers, wherein the speech of each speaker from the plurality of speakers is received via a speech interface implemented on a computing device;
recording metrics associated with the recognized speech for each of the plurality of speakers, wherein the metrics comprise a request for repetition, a negative response to confirmation, and a task completion;
after recording the metrics, while recording further speech from the each speaker from the plurality of speakers, modifying, via a processor, an allocation of resources of the speech interface based on the metrics, to yield a modified speech interface; and
recognizing additional speech during the conference call from an identified speaker in the plurality of speakers using the modified speech interface.
4 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for speaker recognition personalization. The method recognizes speech received from a speaker interacting with a speech interface using a set of allocated resources, the set of allocated resources including bandwidth, processor time, memory, and storage. The method records metrics associated with the recognized speech, and after recording the metrics, modifies at least one of the allocated resources in the set of allocated resources commensurate with the recorded metrics. The method recognizes additional speech from the speaker using the modified set of allocated resources. Metrics can include a speech recognition confidence score, processing speed, dialog behavior, requests for repeats, negative responses to confirmations, and task completions. The method can further store a speaker personalization profile having information for the modified set of allocated resources and recognize speech associated with the speaker based on the speaker personalization profile.
-
Citations
20 Claims
-
1. A method comprising:
-
recognizing speech of each speaker of a plurality of speakers on a conference call, to yield recognized speech for each of the plurality of speakers, wherein the speech of each speaker from the plurality of speakers is received via a speech interface implemented on a computing device; recording metrics associated with the recognized speech for each of the plurality of speakers, wherein the metrics comprise a request for repetition, a negative response to confirmation, and a task completion; after recording the metrics, while recording further speech from the each speaker from the plurality of speakers, modifying, via a processor, an allocation of resources of the speech interface based on the metrics, to yield a modified speech interface; and recognizing additional speech during the conference call from an identified speaker in the plurality of speakers using the modified speech interface. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
a processor; and a computer-readable storage medium having instructions stored which, when executed by the processor, result in the processor performing operations comprising; recognizing speech of each speaker of a plurality of speakers on a conference call, to yield recognized speech for each of the plurality of speakers, wherein the speech of each speaker from the plurality of speakers is received via a speech interface; recording metrics associated with the recognized speech for each of the plurality of speakers, wherein the metrics comprise a request for repetition, a negative response to confirmation, and a task completion; after recording the metrics, while recording further speech from the each speaker from the plurality of speakers, modifying an allocation of resources of the speech interface based on the metrics, to yield a modified speech interface; and recognizing additional speech during the conference call from an identified speaker in the plurality of speakers using the modified speech interface. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A computer-readable storage device having instructions stored which, when executed by a computing device, result in the computing device performing operations comprising:
-
recognizing speech of each speaker of a plurality of speakers on a conference call, to yield recognized speech for each of the plurality of speakers, wherein the speech of each speaker from the plurality of speakers is received via a speech interface; recording metrics associated with the recognized speech for each of the plurality of speakers, wherein the metrics comprise a request for repetition, a negative response to confirmation, and a task completion; after recording the metrics, while recording further speech from the each speaker from the plurality of speakers, modifying an allocation of resources of the speech interface based on the metrics, to yield a modified speech interface; and recognizing additional speech during the conference call from an identified speaker in the plurality of speakers using the modified speech interface.
-
Specification