Audio-augmented data keying
First Claim
1. A data-keying apparatus for keying data comprising groups of alphabetic characters corresponding to spoken speech portions, comprising:
- keypad-data receiving means for receiving a group of numeric keying signals, at least some of said numeric keying signals each corresponding to multiple alphabetic characters;
sound receiving means for detecting sounds corresponding to a spoken speech portion and for extracting features from said sounds;
template store means for storing a multiplicity of speech portion templates, each template indicative of features associated with sounds corresponding to a speech portion and indicative of the spelling thereof;
culling means responsive to said group of numeric keying signals for culling from the speech portion templates in said template store means a subset of speech portion templates such that each speech portion template in the subset has a spelling corresponding to said group of numeric keying signals;
and correlating means responsive to the extracted features for evaluating the correlation between the extracted features and the features of each speech portion template in the subset of speech portion templates, and for identifying the speech portion template in the subset of speech portion templates having the highest correlation with the extracted features.
0 Assignments
0 Petitions
Accused Products
Abstract
Present-day limitations of the conventional touch-tone keypad are overcome permitting alphabetic information to be entered into a distant computer. The caller speaks a speech portion into a telephone handset, and then types out the speech portion on the touch-tone keypad. The computer receiving the call converts the spoken voice information into a form suitable for additional digital processing, as by extracting speech-recognition features from the spoken information. The computer processes the typed numeric string into a list of all the possible combinations of characters it could represent. The extent of correlation between the features of the spoken speech portion and each of the combinations is determined, and the combination having the highest correlation is taken to be the speech portion entered by the user.
-
Citations
20 Claims
-
1. A data-keying apparatus for keying data comprising groups of alphabetic characters corresponding to spoken speech portions, comprising:
-
keypad-data receiving means for receiving a group of numeric keying signals, at least some of said numeric keying signals each corresponding to multiple alphabetic characters; sound receiving means for detecting sounds corresponding to a spoken speech portion and for extracting features from said sounds; template store means for storing a multiplicity of speech portion templates, each template indicative of features associated with sounds corresponding to a speech portion and indicative of the spelling thereof; culling means responsive to said group of numeric keying signals for culling from the speech portion templates in said template store means a subset of speech portion templates such that each speech portion template in the subset has a spelling corresponding to said group of numeric keying signals; and correlating means responsive to the extracted features for evaluating the correlation between the extracted features and the features of each speech portion template in the subset of speech portion templates, and for identifying the speech portion template in the subset of speech portion templates having the highest correlation with the extracted features. - View Dependent Claims (2, 3, 4, 5)
-
-
6. For use with a template store means for storing a multiplicity of speech portion templates, each template indicative of features associated with sounds corresponding to a speech portion and indicative of the spelling thereof, a method for keying data comprising groups of alphabetic characters corresponding to spoken speech portions, comprising the steps of:
-
receiving via a keypad a group of numeric keying signals, at least some of said numeric keying signals each corresponding to multiple alphabetic characters; detecting sounds corresponding to a spoken speech portion and extracting features from said sounds; culling from the speech portion templates in said template store means a subset of speech portion templates such that each speech portion template in the subset has a spelling corresponding to said group of numeric keying signals; and evaluating the correlation between the extracted features and the features of each speech portion template in the subset of speech portion templates, and identifying the speech portion template in the subset of speech portion templates having the highest correlation with the extracted features. - View Dependent Claims (7, 8, 9, 10)
-
-
11. A data-keying apparatus for keying data comprising groups of alphabetic characters corresponding to spoken speech portions, comprising:
-
keypad-data receiving means for receiving a group of numeric keying signals, at least some of said numeric keying signals each corresponding to multiple alphabetic characters; sound receiving means for detecting sounds corresponding to a spoken speech portion and for extracting features from said sounds; template generation means for generation of a multiplicity of speech portion templates, each template indicative of a spelling corresponding to said group of numeric keying signals; and correlating means responsive to the extracted features for evaluating the correlation between the extracted features and the features of each generated speech portion template, and for identifying the generated speech portion template in the multiplicity of speech portion templates having the highest correlation with the extracted features. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A method for keying data comprising groups of alphabetic characters corresponding to spoken speech portions, comprising the steps of:
-
receiving via a keypad a group of numeric keying signals, at least some of said numeric keying signals each corresponding to multiple alphabetic characters; detecting sounds corresponding to a spoken speech portion and extracting features from said sounds; generating from the multiple alphabetic characters a multiplicity of speech portion templates; and evaluating the correlation between the extracted features and the features of each speech portion template in the multiplicity of speech portion templates, and identifying the speech portion template in the multiplicity of speech portion templates having the highest correlation with the extracted features. - View Dependent Claims (17, 18, 19, 20)
-
Specification