Speech recognition tuning tool
First Claim
Patent Images
1. A method, comprising:
- accessing a voice mail record of a user within a voice mail system;
accessing a recorded audio file of a name of the user in the voice mail record spoken by the user;
providing the audio file to a speech recognition system that is operable with an automated attendant;
processing the audio file in the speech recognition system and obtaining a text result;
determining whether a confidence score of the text result is below a predetermined threshold;
adding, at least, the name of the user to a list of low confidence names when the confidence score is below the predetermined threshold;
when the name of the user is listed in the list of low confidence names, storing a plurality of actual alternate spellings for the name of the user, wherein the plurality of actual alternate spellings are accessible to the speech recognition system and are received via a user interface configured to be presented to an administrator of the automated attendant;
receiving a voice call at the automated attendant including receiving a voice command comprising a spoken name of the user; and
processing the spoken name of the user including comparing a spelled name result generated by the speech recognition system to the plurality of actual alternate spellings previously stored to identify the user.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods for tuning a dictionary of a speech recognition system includes accessing a voice mail record of a user, accessing a recorded audio file of a name of the user in the voice mail record spoken by the user, providing the audio file to a speech recognition system, processing the audio file in the speech recognition system and obtaining a text result, determining whether a confidence score of the text result is below a predetermined threshold, and adding, at least, the name of the user to a list of low confidence names. Alternate spellings for the low confidence names can then be added to the dictionary.
-
Citations
17 Claims
-
1. A method, comprising:
-
accessing a voice mail record of a user within a voice mail system; accessing a recorded audio file of a name of the user in the voice mail record spoken by the user; providing the audio file to a speech recognition system that is operable with an automated attendant; processing the audio file in the speech recognition system and obtaining a text result; determining whether a confidence score of the text result is below a predetermined threshold; adding, at least, the name of the user to a list of low confidence names when the confidence score is below the predetermined threshold; when the name of the user is listed in the list of low confidence names, storing a plurality of actual alternate spellings for the name of the user, wherein the plurality of actual alternate spellings are accessible to the speech recognition system and are received via a user interface configured to be presented to an administrator of the automated attendant; receiving a voice call at the automated attendant including receiving a voice command comprising a spoken name of the user; and processing the spoken name of the user including comparing a spelled name result generated by the speech recognition system to the plurality of actual alternate spellings previously stored to identify the user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An apparatus, comprising:
-
a speech recognition tuning tool configured to be in communication with a voice mail system, a low confidence audio recording database, and a speech to text converter, the speech recognition tuning tool having a controller, wherein the controller is configured to; access an audio file of a name of the user in the voice mail system; provide the audio file to the speech to text converter operable with an automated attendant; determine whether a confidence score of a text result from the speech to text converter is below a predetermined threshold; add, at least, the name of the user to a list of low confidence names in the low confidence audio recording database when the confidence score is below the predetermined threshold; receive, via a user interface configured to be presented, via a display, to an administrator of the automated attendant, and store a plurality of actual alternate spellings for the name of the user, wherein the plurality of actual alternate spellings are accessible to the controller; receive a voice call at the automated attendant including receiving a voice command comprising a spoken name of the user; and process the spoken name of the user including comparing a spelled name result generated by the speech to text converter to the plurality of actual alternate spellings previously stored to identify the user. - View Dependent Claims (12, 13, 14)
-
-
15. Logic encoded in one or more non-transitory media for execution and when executed operable to:
-
access an audio file of a name of the user in a voice mail system; provide the audio file to a speech to text converter operable with an automated attendant; determine whether a confidence score of a text result from the speech to text converter is below a predetermined threshold; add, at least, the name of the user to a list of low confidence names when the confidence score is below the predetermined threshold; receive, via a user interface configured to be presented to an administrator of the automated attendant, and store a plurality of actual alternate spellings for the name of the user as a result of the name having been added to the list of low confidence names; receive a voice call at the automated attendant including receiving a voice command comprising a spoken name of the user; and process the spoken name of the user including comparing a spelled name result generated by the speech to text converter to the plurality of actual alternate spellings previously stored to identify the user. - View Dependent Claims (16, 17)
-
Specification