Creating speech models
First Claim
1. A method for selecting human speech samples for a speech model of human speech, the speech model including audio data specific to a particular sound in human speech, comprising the steps of:
- presenting a graphic representing a human speech sample in a first area of a user interface on a computer display;
responsive to user input, marking a segment of the graphic, the marked segment of the graphic representing a portion of the human speech sample;
responsive to user input, playing the portion of the human speech sample represented by the marked segment; and
selecting the portion of the human speech sample for inclusion in the speech model,wherein the human speech sample is used for evaluating the accuracy of a later produced human speech sample as the particular sound.
2 Assignments
0 Petitions
Accused Products
Abstract
Selecting human speech samples for a speech model of human speech is preformed. The system presents a graphic representing a human speech sample on a computer display, e.g., an amplitude vs. time graph of the speech sample. Through user input, the system marks a segment of the graphic. The marked segment of the graphic represents a portion of the human speech sample. The system plays the portion of the human speech sample represented by the marked segment back to the user to allow the user to determine its acceptability for inclusion in the speech model. If so indicated by the user, the portion of the human speech sample represented by the marked segment is selected for inclusion in the speech model. The system also analyzes the portion of the human speech sample represented by the marked segment for acoustic properties. These properties are presented to the user in a graphic of the analyzed portion representative of the acoustic properties, e.g., a spectral analysis of the sample graphed as a set of spectral lines. Thus, the user can select the analyzed portion for inclusion in the speech model due to the presence of desired acoustic properties in the analyzed portion.
40 Citations
21 Claims
-
1. A method for selecting human speech samples for a speech model of human speech, the speech model including audio data specific to a particular sound in human speech, comprising the steps of:
-
presenting a graphic representing a human speech sample in a first area of a user interface on a computer display; responsive to user input, marking a segment of the graphic, the marked segment of the graphic representing a portion of the human speech sample; responsive to user input, playing the portion of the human speech sample represented by the marked segment; and selecting the portion of the human speech sample for inclusion in the speech model, wherein the human speech sample is used for evaluating the accuracy of a later produced human speech sample as the particular sound. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A system including processor, memory, display and input devices for selecting human speech samples for a speech model of human speech, the speech model including audio data specific to a particular sound in human speech comprising:
-
means for presenting a graphic representing acoustic values of a speech sample in a first area of a user interface on the display; means responsive to user input for marking a segment of the graphic, the marked segment of the graphic representing a portion of the speech sample; means for analyzing the portion of the speech sample represented by the marked segment for acoustic properties different from the acoustic values; means for presenting a graphic of the analyzed portion representative of the acoustic properties in a second area of the user interface; and means for selecting the analyzed portion for inclusion in the speech model. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A computer program product in a computer readable medium for selecting human speech samples for a speech model of human speech, the speech model including audio data specific to a particular sound in human speech, comprising:
-
means for presenting a graphic representing acoustic values of a speech sample in a first area of a user interface on the display; means for analyzing the speech sample for desired acoustic properties; means for presenting a graphic of an analyzed portion representative of the desired acoustic properties in a second area of the user interface, wherein the desired acoustic properties are different from acoustic values presented in the first area; and means for including the speech sample in the speech model. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification