RECOGNITION DICTIONARY CREATING DEVICE, VOICE RECOGNITION DEVICE, AND VOICE SYNTHESIZER
First Claim
Patent Images
1. A recognition dictionary creating device comprising:
- an acoustic analysis unit for performing an acoustic analysis on a voice signal of an inputted voice to output a time series of acoustic features;
an acoustic standard pattern storage unit for storing acoustic standard patterns showing standard acoustic features for each language;
an acoustic data matching unit for comparing the time series of acoustic features of said inputted voice which are inputted thereto from said acoustic analysis unit with the acoustic standard patterns stored in said acoustic standard pattern storage unit to create a phoneme label string of said inputted voice;
a user dictionary storage unit for storing a user dictionary in which said phoneme label string of said inputted voice created by said acoustic data matching unit is registered;
a language storage unit for storing information showing a language of the phoneme label string which is registered in said user dictionary;
a language switching unit for switching from a language to another language;
a mapping table storage unit for storing a mapping table in which a correspondence between phoneme labels in different languages is defined; and
a phoneme label string converting unit for referring to the mapping table stored in said mapping table storage unit to convert the phoneme label string registered in said user dictionary and expressed in the language shown by the information stored in said language storage unit into a phoneme label string expressed in the other language which said language switching unit has switched.
1 Assignment
0 Petitions
Accused Products
Abstract
A recognition dictionary creating device includes a user dictionary in which a phoneme label string of an inputted voice is registered and an interlanguage acoustic data mapping table in which a correspondence between phoneme labels in different languages is defined, and refers to the interlanguage acoustic data mapping table to convert the phoneme label string registered in the user dictionary and expressed in a language set at the time of creating the user dictionary into a phoneme label string expressed in another language which the recognition dictionary creating device has switched.
-
Citations
6 Claims
-
1. A recognition dictionary creating device comprising:
an acoustic analysis unit for performing an acoustic analysis on a voice signal of an inputted voice to output a time series of acoustic features; an acoustic standard pattern storage unit for storing acoustic standard patterns showing standard acoustic features for each language; an acoustic data matching unit for comparing the time series of acoustic features of said inputted voice which are inputted thereto from said acoustic analysis unit with the acoustic standard patterns stored in said acoustic standard pattern storage unit to create a phoneme label string of said inputted voice; a user dictionary storage unit for storing a user dictionary in which said phoneme label string of said inputted voice created by said acoustic data matching unit is registered; a language storage unit for storing information showing a language of the phoneme label string which is registered in said user dictionary; a language switching unit for switching from a language to another language; a mapping table storage unit for storing a mapping table in which a correspondence between phoneme labels in different languages is defined; and a phoneme label string converting unit for referring to the mapping table stored in said mapping table storage unit to convert the phoneme label string registered in said user dictionary and expressed in the language shown by the information stored in said language storage unit into a phoneme label string expressed in the other language which said language switching unit has switched.
-
2. A voice recognition device comprising:
-
an acoustic analysis unit for performing an acoustic analysis on a voice signal of an inputted voice to output a time series of acoustic features; an acoustic standard pattern storage unit for storing acoustic standard patterns showing standard acoustic features for each language; an acoustic data matching unit for comparing the time series of acoustic features of said inputted voice which are inputted thereto from said acoustic analysis unit with the acoustic standard patterns stored in said acoustic standard pattern storage unit to create a phoneme label string of said inputted voice; a user dictionary storage unit for storing a user dictionary in which said phoneme label string of said inputted voice created by said acoustic data matching unit is registered; a language storage unit for storing information showing a language of the phoneme label string which is registered in said user dictionary; a language switching unit for switching from a language to another language; a mapping table storage unit for storing a mapping table in which a correspondence between phoneme labels in different languages is defined; a phoneme label string converting unit for referring to the mapping table stored in said mapping table storage unit to convert the phoneme label string registered in said user dictionary and expressed in the language shown by the information stored in said language storage unit into a phoneme label string expressed in the other language to which said language switching unit has switched; a general dictionary storage unit for storing a general dictionary having a vocabulary expressed by said acoustic standard patterns; a dictionary comparing unit for comparing the phoneme label string of said inputted voice created by said acoustic data matching unit with said general dictionary and said user dictionary to specify a word which is most similar to the phoneme label string of said inputted voice from said general dictionary and said user dictionary; and a recognition result output unit for outputting the word specified by said dictionary comparing unit as a voice recognition result.
-
-
3. A voice synthesizer comprising:
-
an acoustic analysis unit for performing an acoustic analysis on a voice signal of an inputted voice to output a time series of acoustic features; an acoustic standard pattern storage unit for storing acoustic standard patterns showing standard acoustic features for each language; an acoustic data matching unit for comparing the time series of acoustic features of said inputted voice which are inputted thereto from said acoustic analysis unit with the acoustic standard patterns stored in said acoustic standard pattern storage unit to create a phoneme label string of said inputted voice; a user dictionary storage unit for storing a user dictionary in which said phoneme label string of said inputted voice created by said acoustic data matching unit is registered; a language storage unit for storing information showing a language of the phoneme label string which is registered in said user dictionary; a language switching unit for switching from a language to another language; a mapping table storage unit for storing a mapping table in which a correspondence between phoneme labels in different languages is defined; a phoneme label string converting unit for referring to the mapping table stored in said mapping table storage unit to convert the phoneme label string registered in said user dictionary and expressed in the language shown by the information stored in said language storage unit into a phoneme label string expressed in the other language to which said language switching unit has switched; a text input unit for inputting a text; a registered word part detecting unit for detecting a word part corresponding to the phoneme label string registered in said user dictionary from a character string of the text inputted from said text input unit; a registered word replacing unit for replacing said word part detected by said registered word part detecting unit with the phoneme label string acquired from said user dictionary and corresponding to said word part; a general dictionary replacing unit for replacing a part of the character string of said text other than said word part detected by said registered word part detecting unit with a phoneme label string of a corresponding word in said general dictionary; and a voice synthesis unit for creating a synthetic voice of said text from the phoneme label strings of said text which are acquired by said registered word replacing unit and said general dictionary replacing unit.
-
-
4. A recognition dictionary creating device comprising:
-
an acoustic analysis unit for performing an acoustic analysis on a voice signal of an inputted voice to output a time series of acoustic features; an acoustic standard pattern storage unit for storing acoustic standard patterns showing standard acoustic features for each language; an acoustic standard pattern setting unit for selecting acoustic standard patterns for a preset language from among the acoustic standard patterns stored in said acoustic standard pattern storage unit; an acoustic data matching unit for comparing the time series of acoustic features of said inputted voice which are inputted thereto from said acoustic analysis unit with the acoustic standard patterns for the language which are selected by said acoustic standard pattern setting unit to create a phoneme label string of said inputted voice; a user dictionary storage unit for storing a user dictionary in which said phoneme label string of said inputted voice created by said acoustic data matching unit is registered;
a language switching unit for switching from a language to another language;a mapping table storage unit for storing a mapping table in which a correspondence between phoneme labels in different languages is defined; and a phoneme label string converting unit for referring to the mapping table stored in said mapping table storage unit to convert the phoneme label string registered in said user dictionary and expressed in the language selected by said acoustic standard pattern setting unit into a phoneme label string expressed in the other language to which said language switching unit has switched.
-
-
5. A voice recognition device comprising:
-
an acoustic analysis unit for performing an acoustic analysis on a voice signal of an inputted voice to output a time series of acoustic features; an acoustic standard pattern storage unit for storing acoustic standard patterns showing standard acoustic features for each language; an acoustic standard pattern setting unit for selecting acoustic standard patterns for a preset language from among the acoustic standard patterns stored in said acoustic standard pattern storage unit; an acoustic data matching unit for comparing the time series of acoustic features of said inputted voice which are inputted thereto from said acoustic analysis unit with the acoustic standard patterns for the language which are selected by said acoustic standard pattern setting unit to create a phoneme label string of said inputted voice; a user dictionary storage unit for storing a user dictionary in which said phoneme label string of said inputted voice created by said acoustic data matching unit is registered;
a language switching unit for switching from a language to another language;a mapping table storage unit for storing a mapping table in which a correspondence between phoneme labels in different languages is defined; a phoneme label string converting unit for referring to the mapping table stored in said mapping table storage unit to convert the phoneme label string registered in said user dictionary and expressed in the language selected by said acoustic standard pattern setting unit into a phoneme label string expressed in the other language to which said language switching unit has switched; a general dictionary storage unit for storing a general dictionary having a vocabulary expressed by said acoustic standard patterns; a dictionary comparing unit for comparing the phoneme label string of said inputted voice created by said acoustic data matching unit with said general dictionary and said user dictionary to specify a word which is most similar to the phoneme label string of said inputted voice from said general dictionary and said user dictionary; and a recognition result output unit for outputting the word specified by said dictionary comparing unit as a voice recognition result.
-
-
6. A voice synthesizer comprising:
-
an acoustic analysis unit for performing an acoustic analysis on a voice signal of an inputted voice to output a time series of acoustic features; an acoustic standard pattern storage unit for storing acoustic standard patterns showing standard acoustic features for each language; an acoustic standard pattern setting unit for selecting acoustic standard patterns for a preset language from among the acoustic standard patterns stored in said acoustic standard pattern storage unit; an acoustic data matching unit for comparing the time series of acoustic features of said inputted voice which are inputted thereto from said acoustic analysis unit with the acoustic standard patterns for the language which are selected by said acoustic standard pattern setting unit to create a phoneme label string of said inputted voice; a user dictionary storage unit for storing a user dictionary in which said phoneme label string of said inputted voice created by said acoustic data matching unit is registered; a language switching unit for switching from a language to another language; a mapping table storage unit for storing a mapping table in which a correspondence between phoneme labels in different languages is defined; a phoneme label string converting unit for referring to the mapping table stored in said mapping table storage unit to convert the phoneme label string registered in said user dictionary and expressed in the language selected by said acoustic standard pattern setting unit into a phoneme label string expressed in the other language to which said language switching unit has switched; a text input unit for inputting a text; a registered word part detecting unit for detecting a word part corresponding to the phoneme label string registered in said user dictionary from a character string of the text inputted from said text input unit; a registered word replacing unit for replacing said word part detected by said registered word part detecting unit with the phoneme label string acquired from said user dictionary and corresponding to said word part; a general dictionary replacing unit for replacing a part of the character string of said text other than said word part detected by said registered word part detecting unit with a phoneme label string of a corresponding word in said general dictionary; and a voice synthesis unit for creating a synthetic voice of said text from the phoneme label strings of said text which are acquired by said registered word replacing unit and said general dictionary replacing unit.
-
Specification