Data Process unit and data process unit control program
0 Assignments
0 Petitions
Accused Products
Abstract
To provide a data process unit and data process unit control program which are suitable for generating acoustic models for unspecified speakers taking distribution of diversifying feature parameters into consideration under such specific conditions as the type of speaker, speech lexicons, speech styles, and speech environment and which are suitable for providing acoustic models intended for unspecified speakers and adapted to speech of a specific person.
A data process unit 1 comprises a data classification section 1a, data storing section 1b, pattern model generating section 1c, data control section 1d, mathematical distance calculating section 1e, pattern model converting section 1f, pattern model display section 1g, region dividing section 1h, division changing section 1i, region selecting section 1j, and specific pattern model generating section 1k.
143 Citations
75 Claims
-
1-37. -37. (canceled)
-
38. A data process unit comprising:
-
acoustic space storing means for storing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers; speech data acquiring means for acquiring speech data of a target speaker; position calculating means for calculating position of the speech data of the target speaker in the acoustic space based on the speech data of the target speaker acquired by the speech data acquiring means and the plurality of pattern models in the acoustic space stored by the acoustic space storing means; speech data evaluating means for evaluating value of the speech data of the target speaker based on the position calculated by the position calculating means; evaluation result display means for displaying evaluation results produced by the speech data evaluating means; and positional relationship information display means for displaying information about positional relationship between the speech data and pattern models around the speech data in the acoustic space based on the calculated position. - View Dependent Claims (39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 52, 53, 54, 55, 56)
-
-
51. A data process method comprising the steps of:
-
preparing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers; acquiring speech data of a target speaker; calculating position of the speech data of the target speaker in the acoustic space based on the acquired speech data and the plurality of pattern models in the acoustic space; evaluating value of the speech data of the target speaker based on the calculated position; and displaying the evaluation results.
-
-
57. A data process unit comprising:
-
acoustic space storing means for storing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers; speech data acquiring means for acquiring speech data of a target speaker; position calculating means for calculating position of the speech data of the target speaker in the acoustic space based on the speech data of the target speaker and the plurality of pattern models in the acoustic space; similar-speaker detecting means for detecting similar speakers who resemble the target speaker in speech out of the plurality of speakers based on the position of the speech data and the plurality of pattern models; and positional relationship information display means for displaying information about positional relationship between the speech data of the target speaker and pattern models of the similar speakers in the acoustic space based on the position of the speech data and the pattern models of the similar speakers. - View Dependent Claims (58, 60, 64, 66, 67, 68, 69, 70)
-
-
59. A data process unit comprising:
-
acoustic space storing means for storing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers; specific speaker specifying means for specifying a specific speaker among the plurality of speakers; speech data acquiring means for acquiring speech data of a target speaker; position calculating means for calculating position of the speech data of the target speaker based on the speech data of the target speaker and the plurality of pattern models in the acoustic space; similarity evaluating means for evaluating similarity in speech between the specific speaker and the target speaker based on the position of the speech data and the pattern model of the specific speaker; evaluation result display means for displaying evaluation results produced by the similarity evaluating means; and positional relationship information display means for displaying information about positional relationship between the speech data of the target speaker and pattern model of the specific speaker in the acoustic space based on the position of the speech data and the pattern model of the specific speaker. - View Dependent Claims (61, 62, 63, 65)
-
-
71. A data process method comprising the steps of:
-
preparing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers; acquiring speech data of a target speaker; calculating position of the speech data of the target speaker in the acoustic space based on the speech data of the target speaker and the plurality of pattern models in the acoustic space; detecting similar speakers who resemble the target speaker in speech out of the plurality of speakers based on the position of the speech data and the plurality of pattern models; and displaying information about positional relationship between the speech data of the target speaker and pattern models of the similar speakers in the acoustic space based on the position of the speech data and the pattern models of the similar speakers. - View Dependent Claims (72)
-
-
73. A data process method comprising:
-
preparing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers; specifying a specific speaker among the plurality of speakers; acquiring speech data of a target speaker; calculating position of the speech data of the target speaker based on the speech data of the target speaker and the plurality of pattern models in the acoustic space; evaluating similarity in speech between the specific speaker and the target speaker based on the position of the speech data and the pattern model of the specific speaker; displaying evaluation results; and displaying information about positional relationship between the speech data of the target speaker and pattern model of the specific speaker in the acoustic space based on the position of the speech data and the pattern model of the specific speaker.
-
-
74. A data process unit control program comprising:
-
an acoustic space storing step of storing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers; a speech data acquiring step of acquiring speech data of a target speaker; a position calculating step of calculating position of the speech data of the target speaker in the acoustic space based on the speech data of the target speaker and the plurality of pattern models in the acoustic space; a similar-speaker detecting step of detecting similar speakers who resemble the target speaker in speech out of the plurality of speakers based on the position of the speech data and the plurality of pattern models; a positional relationship information display step of displaying information about positional relationship between the speech data of the target speaker and pattern models of the similar speakers in the acoustic space based on the position of the speech data and the pattern models of the similar speakers; a speaker specifying step of specifying a specific speaker; a similarity evaluating step of evaluating similarity in speech between the specific speaker and the target speaker based on the position of the speech data and the pattern model of the specific speaker in the acoustic space; and an evaluation result display step of displaying evaluation results produced by the similarity evaluating step, wherein the positional relationship information display step displays information about positional relationship between the speech data of the target speaker and pattern model of the specific speaker in the acoustic space based on the position of the speech data and the pattern model of the specific speaker.
-
-
75. A data process unit control program comprising:
-
an acoustic space storing step of storing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers; a specific speaker specifying step of specifying a specific speaker among the plurality of speakers; a speech data acquiring step of acquiring speech data of a target speaker; a position calculating step of calculating position of the speech data of the target speaker based on the speech data of the target speaker and the plurality of pattern models in the acoustic space; a similarity evaluating step of evaluating similarity in speech between the specific speaker and the target speaker based on the position of the speech data and the pattern model of the specific speaker; an evaluation result display step of displaying evaluation results produced by the similarity evaluating step; and a positional relationship information display step of displaying information about positional relationship between the speech data of the target speaker and pattern model of the specific speaker in the acoustic space based on the position of the speech data and the pattern model of the specific speaker.
-
Specification