Non-interactive enrollment in speech recognition
First Claim
1. A computer-implemented method for enrolling a user in a speech recognition system, comprising:
- providing an enrollment text;
recording a user'"'"'s speech using a portable recording device, the speech generally corresponding to the enrollment text;
transferring the recorded speech to a computer;
using the computer to;
analyze acoustic content of the recorded speech;
identify, based on the analysis, portions of the speech that match portions of the enrollment text; and
update acoustic models corresponding to matched portions of the enrollment text using acoustic content of matching portions of the speech.
7 Assignments
0 Petitions
Accused Products
Abstract
A computer enrolls a user in a speech recognition system by obtaining data representing a user'"'"'s speech, the speech including multiple user utterances and generally corresponding to an enrollment text, and analyzing acoustic content of data corresponding to a user utterance. The computer determines, based on the analysis, whether the user utterance matches a portion of the enrollment text. If so, the computer uses the acoustic content of the user utterance to update acoustic models corresponding to the portion of the enrollment text. The computer may determine that the user utterance matches a portion of the enrollment text even when the user has skipped or repeated words of the enrollment text.
-
Citations
3 Claims
-
1. A computer-implemented method for enrolling a user in a speech recognition system, comprising:
-
providing an enrollment text;
recording a user'"'"'s speech using a portable recording device, the speech generally corresponding to the enrollment text;
transferring the recorded speech to a computer;
using the computer to;
analyze acoustic content of the recorded speech;
identify, based on the analysis, portions of the speech that match portions of the enrollment text; and
update acoustic models corresponding to matched portions of the enrollment text using acoustic content of matching portions of the speech. - View Dependent Claims (2, 3)
-
Specification