Language independent suprasegmental pronunciation tutoring system and methods
First Claim
1. A speech analysis system for voice and pronunciation training, the system stored as a series of programmed routines on a removable item of storage media, the system comprising:
- a data file that stores a pre-recorded utterance by a native speaker comprising a sequence of syllables and a first example metric corresponding to a pitch value of a vocal part of each one of the sequence of syllables;
a speech capture routine that captures a sample utterance spoken by a user;
a segmentation routine that segments the sample utterance into a sequence of syllables;
a computation routine that computes a first sample metric corresponding to a pitch value of a vocal part of each one of the sequence of syllables for the sample utterance; and
a display routine for graphically displaying the first example metric and the first sample metric to the user on a side-by-side basis.
5 Assignments
0 Petitions
Accused Products
Abstract
A pronunciation training system and methods are provided as a series of programmed routines stored on an item of removable storage media, and select information generated by a speech analysis engine to compute and display graphical representations of metrics useful to a student. The student selects from among a plurality of pre-recorded utterances spoken by a native speaker, and the student then records his or her pronunciation of the utterance. The software computes and displays graphical metrics for the native speaker'"'"'s utterance and the student'"'"'s utterance, in any of a variety of formats, on a side-by-side basis. The system also permits the student to repeat selected phrases and to monitor improvement by similarity between the graphical metrics.
72 Citations
40 Claims
-
1. A speech analysis system for voice and pronunciation training, the system stored as a series of programmed routines on a removable item of storage media, the system comprising:
-
a data file that stores a pre-recorded utterance by a native speaker comprising a sequence of syllables and a first example metric corresponding to a pitch value of a vocal part of each one of the sequence of syllables;
a speech capture routine that captures a sample utterance spoken by a user;
a segmentation routine that segments the sample utterance into a sequence of syllables;
a computation routine that computes a first sample metric corresponding to a pitch value of a vocal part of each one of the sequence of syllables for the sample utterance; and
a display routine for graphically displaying the first example metric and the first sample metric to the user on a side-by-side basis. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A speech analysis system for voice and pronunciation training, the system stored as a series of programmed routines on a removable item of storage media, the system comprising:
-
a data file that stores a pre-recorded utterance by a native speaker comprising a sequence of syllables and a first example metric corresponding to a duration, and an energy of a vocal part of, each one of the sequence of syllables;
a speech capture routine that captures a sample utterance spoken by a user;
a segmentation routine that segments the sample utterance into a sequence of syllables;
a computation routine that computes a first sample metric corresponding to a duration, and an energy of a vocal part of, each one of the sequence of syllables for the sample utterance; and
a display routine for graphically displaying the first example metric and the first sample metric to the user on a side-by-side basis. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method of analyzing speech for voice and pronunciation training, the method comprising:
-
storing a pre-recorded utterance by a native speaker comprising a sequence of syllables and a first example metric corresponding to a pitch value of a vocal part of each one of the sequence of syllables;
capturing a sample utterance spoken by a user;
segmenting the sample utterance into a sequence of syllables;
computing a first sample metric corresponding to a pitch value of a vocal part of each one of the sequence of syllables for the sample utterance; and
graphically displaying the first example metric and the first sample metric to the user on a side-by-side basis. - View Dependent Claims (22, 23, 24, 25, 26, 27, 28, 29, 30)
storing a second example metric corresponding to a duration, and an energy of the vocal part of, each one of the sequence of syllables of the pre-recorded utterance;
computing a second sample metric corresponding to a duration and an energy of the vocal part of each one of the sequence of syllables of the sample utterance; and
selectably graphically displaying either the first example metric and the first sample metric to the user on a side-by-side basis or the second example metric and the second sample metric to the user on a side-by-side basis.
-
-
29. The method of claim 28 wherein graphically displaying the second sample metric comprises graphically representing the sequence of syllables as a series of steps, wherein for each syllable in the sequence, a length of each step represents the duration of the corresponding syllable and a height of each step represents the energy of the corresponding syllable.
-
30. The method of claim 21 further comprising selectably graphically displaying either the first example metric and the first sample metric to the user on a side-by-side basis or a waveform for the pre-recorded utterance and a waveform for the sample utterance to the user on a side-by-side basis.
-
31. A method of analyzing speech for voice and pronunciation training, the method comprising:
-
storing a pre-recorded utterance by a native speaker comprising a sequence of syllables and a first example metric corresponding to a duration, and an energy of a vocal part of, each one of the sequence of syllables;
capturing a sample utterance spoken by a user;
segmenting the sample utterance into a sequence of syllables;
computing a first sample metric corresponding to a duration, and an energy of a vocal part of, each one of the sequence of syllables for the sample utterance; and
graphically displaying the first example metric and the first sample metric to the user on a side-by-side basis. - View Dependent Claims (32, 33, 34, 35, 36, 37, 38, 39, 40)
storing a second example metric corresponding to a pitch value of a vocal part of each one of the sequence of syllables of the pre-recorded utterance;
computing a second sample metric corresponding to a pitch value of a vocal part of each one of the sequence of syllables of the sample utterance; and
selectably graphically displaying either the first example metric and the first sample metric to the user on a side-by-side basis or the second example metric and the second sample metric to the user on a side-by-side basis.
-
-
40. The method of claim 31 further comprising selectably graphically displaying either the first example metric and the first sample metric to the user on a side-by-side basis or a waveform for the pre-recorded utterance and a waveform for the sample utterance to the user on a side-by-side basis.
Specification