Data process unit and data process unit control program
First Claim
1. A data process unit comprising:
- data classification means for classifying a plurality of predetermined data on a plurality of objects into a plurality of groups based on a plurality of specific conditions;
pattern model generating means for generating a plurality of pattern models which have 4-dimensional or higher dimensional elements for each group of predetermined data based on the predetermined data classified by the data classification means;
mathematical distance calculating means for calculating mathematical distance among the pattern models generated by the pattern model generating means for the respective groups;
pattern model converting means for converting the plurality of pattern models into the same number of low dimensional vectors corresponding to the pattern models in the lower dimension while maintaining distance relationship among the pattern models, based on the mathematical distance calculated by the mathematical distance calculating means; and
low dimensional vector corresponding to pattern model display means for displaying the plurality of low dimensional vectors corresponding to pattern models as coordinate points in a low dimensional space of the same dimension as the low dimensional vectors corresponding to pattern models while maintaining the distance relationship, based on values of low dimensional elements.
1 Assignment
0 Petitions
Accused Products
Abstract
To provide a data process unit and data process unit control program which are suitable for generating acoustic models for unspecified speakers taking distribution of diversifying feature parameters into consideration under such specific conditions as the type of speaker, speech lexicons, speech styles, and speech environment and which are suitable for providing acoustic models intended for unspecified speakers and adapted to speech of a specific person.
A data process unit 1 comprises a data classification section 1a, data storing section 1b, pattern model generating section 1c, data control section 1d, mathematical distance calculating section 1e, pattern model converting section 1f, pattern model display section 1g, region dividing section 1h, division changing section 1i, region selecting section 1j, and specific pattern model generating section 1k.
-
Citations
75 Claims
-
1. A data process unit comprising:
-
data classification means for classifying a plurality of predetermined data on a plurality of objects into a plurality of groups based on a plurality of specific conditions;
pattern model generating means for generating a plurality of pattern models which have 4-dimensional or higher dimensional elements for each group of predetermined data based on the predetermined data classified by the data classification means;
mathematical distance calculating means for calculating mathematical distance among the pattern models generated by the pattern model generating means for the respective groups;
pattern model converting means for converting the plurality of pattern models into the same number of low dimensional vectors corresponding to the pattern models in the lower dimension while maintaining distance relationship among the pattern models, based on the mathematical distance calculated by the mathematical distance calculating means; and
low dimensional vector corresponding to pattern model display means for displaying the plurality of low dimensional vectors corresponding to pattern models as coordinate points in a low dimensional space of the same dimension as the low dimensional vectors corresponding to pattern models while maintaining the distance relationship, based on values of low dimensional elements. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A data process unit comprising:
-
data classification means for classifying a plurality of predetermined data on a plurality of objects into a plurality of groups based on a plurality of specific conditions;
pattern model generating means for generating a plurality of pattern models which have 4-dimensional or higher dimensional elements for each group of predetermined data based on the predetermined data classified by the data classification means;
mathematical distance calculating means for calculating mathematical distance among the pattern models generated by the pattern model generating means for the respective groups;
pattern model converting means for converting the plurality of pattern models into the same number of low dimensional vectors corresponding to the pattern models in the lower dimension while maintaining distance relationship among the pattern models, based on the mathematical distance calculated by the mathematical distance calculating means; and
low dimensional vector corresponding to pattern model display means for displaying the plurality of low dimensional vectors corresponding to pattern models as coordinate points in a low dimensional space of the same dimension as the low dimensional vectors corresponding to pattern models while maintaining the distance relationship, based on values of low dimensional elements, wherein the mathematical distance calculating means uses the occurrence frequency of each pattern unit in the plurality of predetermined data on the plurality of objects when calculating the mathematical distance. - View Dependent Claims (17, 35)
-
-
18. A data process unit comprising:
-
data classification means for classifying a plurality of predetermined data on a plurality of objects into a plurality of groups based on a plurality of specific conditions;
pattern model generating means for generating a plurality of pattern models which have 4-dimensional or higher dimensional elements for each group of predetermined data based on the predetermined data classified by the data classification means;
mathematical distance calculating means for calculating mathematical distance among the pattern models generated by the pattern model generating means for the respective groups;
pattern model converting means for converting the plurality of pattern models into the same number of low dimensional vectors corresponding to the pattern models while maintaining distance relationship among the pattern models, based on the mathematical distance calculated by the mathematical distance calculating means;
low dimensional vector corresponding to pattern model display means for displaying the plurality of low dimensional vectors corresponding to pattern models as coordinate points in a low dimensional space of the same dimension as the low dimensional vectors corresponding to pattern models while maintaining the distance relationship, based on values of low dimensional elements;
region dividing means for automatically dividing the coordinate points of the plurality of low dimensional vectors corresponding to pattern models displayed in the low dimensional space by the low dimensional vector corresponding to pattern model display means into a plurality of regions in the low dimensional space;
regional pattern model generating means for generating regional pattern models for each region based on predetermined data corresponding to coordinate points of the low dimensional vectors corresponding to pattern models contained in the segment regions;
predetermined-data acquiring means for acquiring predetermined data on a new object; and
regional pattern model searching means for calculating likelihood of regional pattern models for respective segment regions in relation to the acquired predetermined data and searching the regional pattern models generated by the regional pattern model generating means for a regional pattern model with recognition performance suitable for recognizing the predetermined data on the new object based on the calculated likelihood. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 36, 37)
-
-
38. A data process unit comprising:
-
acoustic space storing means for storing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers;
speech data acquiring means for acquiring speech data of a target speaker;
position calculating means for calculating position of the speech data of the target speaker in the acoustic space based on the speech data of the target speaker acquired by the speech data acquiring means and the plurality of pattern models in the acoustic space stored by the acoustic space storing means;
speech data evaluating means for evaluating value of the speech data of the target speaker based on the position calculated by the position calculating means;
evaluation result display means for displaying evaluation results produced by the speech data evaluating means; and
positional relationship information display means for displaying information about positional relationship between the speech data and pattern models around the speech data in the acoustic space based on the calculated position. - View Dependent Claims (39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 52, 53, 54, 55, 56)
-
-
51. A data process method comprising the steps of:
-
preparing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers;
acquiring speech data of a target speaker;
calculating position of the speech data of the target speaker in the acoustic space based on the acquired speech data and the plurality of pattern models in the acoustic space;
evaluating value of the speech data of the target speaker based on the calculated position; and
displaying the evaluation results.
-
-
57. A data process unit comprising:
-
acoustic space storing means for storing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers;
speech data acquiring means for acquiring speech data of a target speaker;
position calculating means for calculating position of the speech data of the target speaker in the acoustic space based on the speech data of the target speaker and the plurality of pattern models in the acoustic space;
similar-speaker detecting means for detecting similar speakers who resemble the target speaker in speech out of the plurality of speakers based on the position of the speech data and the plurality of pattern models; and
positional relationship information display means for displaying information about positional relationship between the speech data of the target speaker and pattern models of the similar speakers in the acoustic space based on the position of the speech data and the pattern models of the similar speakers. - View Dependent Claims (58, 60, 64, 66, 67, 68, 69, 70)
-
-
59. A data process unit comprising:
-
acoustic space storing means for storing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers;
specific speaker specifying means for specifying a specific speaker among the plurality of speakers;
speech data acquiring means for acquiring speech data of a target speaker;
position calculating means for calculating position of the speech data of the target speaker based on the speech data of the target speaker and the plurality of pattern models in the acoustic space;
similarity evaluating means for evaluating similarity in speech between the specific speaker and the target speaker based on the position of the speech data and the pattern model of the specific speaker;
evaluation result display means for displaying evaluation results produced by the similarity evaluating means; and
positional relationship information display means for displaying information about positional relationship between the speech data of the target speaker and pattern model of the specific speaker in the acoustic space based on the position of the speech data and the pattern model of the specific speaker. - View Dependent Claims (61, 62, 63, 65)
-
-
71. A data process method comprising the steps of:
-
preparing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers;
acquiring speech data of a target speaker;
calculating position of the speech data of the target speaker in the acoustic space based on the speech data of the target speaker and the plurality of pattern models in the acoustic space;
detecting similar speakers who resemble the target speaker in speech out of the plurality of speakers based on the position of the speech data and the plurality of pattern models; and
displaying information about positional relationship between the speech data of the target speaker and pattern models of the similar speakers in the acoustic space based on the position of the speech data and the pattern models of the similar speakers. - View Dependent Claims (72)
-
-
73. A data process unit comprising the steps of:
-
preparing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers;
specifying a specific speaker among the plurality of speakers;
acquiring speech data of a target speaker;
calculating position of the speech data of the target speaker based on the speech data of the target speaker and the plurality of pattern models in the acoustic space;
evaluating similarity in speech between the specific speaker and the target speaker based on the position of the speech data and the pattern model of the specific speaker;
displaying evaluation results; and
displaying information about positional relationship between the speech data of the target speaker and pattern model of the specific speaker in the acoustic space based on the position of the speech data and the pattern model of the specific speaker.
-
-
74. A data process unit control program comprising:
-
an acoustic space storing step of storing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers;
a speech data acquiring step of acquiring speech data of a target speaker;
a position calculating step of calculating position of the speech data of the target speaker in the acoustic space based on the speech data of the target speaker and the plurality of pattern models in the acoustic space;
a similar-speaker detecting step of detecting similar speakers who resemble the target speaker in speech out of the plurality of speakers based on the position of the speech data and the plurality of pattern models;
a positional relationship information display step of displaying information about positional relationship between the speech data of the target speaker and pattern models of the similar speakers in the acoustic space based on the position of the speech data and the pattern models of the similar speakers;
a speaker specifying step of specifying a specific speaker;
a similarity evaluating step of evaluating similarity in speech between the specific speaker and the target speaker based on the position of the speech data and the pattern model of the specific speaker in the acoustic space; and
an evaluation result display step of displaying evaluation results produced by the similarity evaluating step, wherein the positional relationship information display step displays information about positional relationship between the speech data of the target speaker and pattern model of the specific speaker in the acoustic space based on the position of the speech data and the pattern model of the specific speaker.
-
-
75. A data process unit control program comprising:
-
an acoustic space storing step of storing an acoustic space composed of a plurality of pattern models generated from speech data of a plurality of speakers;
a specific speaker specifying step of specifying a specific speaker among the plurality of speakers;
a speech data acquiring step of acquiring speech data of a target speaker;
a position calculating step of calculating position of the speech data of the target speaker based on the speech data of the target speaker and the plurality of pattern models in the acoustic space;
a similarity evaluating step of evaluating similarity in speech between the specific speaker and the target speaker based on the position of the speech data and the pattern model of the specific speaker;
an evaluation result display step of displaying evaluation results produced by the similarity evaluating step; and
a positional relationship information display step of displaying information about positional relationship between the speech data of the target speaker and pattern model of the specific speaker in the acoustic space based on the position of the speech data and the pattern model of the specific speaker.
-
Specification