Device method, and computer program product for calculating score representing correctness of voice
First Claim
1. A voice processing device, comprising:
- a processor that operates as;
a presenting unit that presents text to an operator;
a voice acquisition unit that acquires a voice of the operator reading aloud the text;
an identifying unit that identifies output intervals of phonemes included in the voice of the operator;
a determination unit that determines whether each of time lengths of the output intervals is normal;
a frequency acquisition unit that acquires frequency values respectively representing occurrence frequencies of contexts, respectively corresponding to the phonemes, each context including the phoneme and another phoneme adjacent to at least one side of the phoneme;
a weight calculator that calculates a weight corresponding to each of the phonemes in accordance with a frequency value of the context; and
a score calculator that calculates, as a score representing correctness of the voice of the operator, a value in accordance with a ratio of a first value to a second value, the first value representing a sum of weights corresponding to the phonemes, the second value representing a sum of weights corresponding to the phonemes having the time lengths of the output intervals that are determined as normal.
4 Assignments
0 Petitions
Accused Products
Abstract
According to an embodiment, a voice processor includes a presenting unit to present text to an operator; a voice acquisition unit to acquire a voice of the operator reading aloud the text; an identifying unit to identify output intervals of phonemes included in the voice; a determination unit to determine whether each of time lengths of the output intervals is normal; a frequency acquisition unit to acquire frequency values respectively representing occurrence frequencies of contexts, respectively corresponding to the phonemes, the context including the phoneme and another phoneme adjacent to at least one side of the phoneme; and a score calculator to calculate a score representing correctness of the voice on the basis of the determination results of the time lengths of the output intervals and the frequency values of the contexts acquired respectively corresponding to the phonemes.
-
Citations
8 Claims
-
1. A voice processing device, comprising:
-
a processor that operates as; a presenting unit that presents text to an operator; a voice acquisition unit that acquires a voice of the operator reading aloud the text; an identifying unit that identifies output intervals of phonemes included in the voice of the operator; a determination unit that determines whether each of time lengths of the output intervals is normal; a frequency acquisition unit that acquires frequency values respectively representing occurrence frequencies of contexts, respectively corresponding to the phonemes, each context including the phoneme and another phoneme adjacent to at least one side of the phoneme; a weight calculator that calculates a weight corresponding to each of the phonemes in accordance with a frequency value of the context; and a score calculator that calculates, as a score representing correctness of the voice of the operator, a value in accordance with a ratio of a first value to a second value, the first value representing a sum of weights corresponding to the phonemes, the second value representing a sum of weights corresponding to the phonemes having the time lengths of the output intervals that are determined as normal. - View Dependent Claims (2, 3, 4, 5)
-
-
6. A voice processing method, comprising:
- presenting, by a processor, text to an operator;
acquiring, by the processor, a voice of the operator reading aloud the text;identifying, by the processor, output intervals of phonemes included in the voice of the operator; determining, by the processor, whether each of time lengths of the output intervals is normal; acquiring, by the processor, frequency values respectively representing occurrence frequencies of contexts, respectively corresponding to the phonemes, each context including the corresponding phoneme and another phoneme adjacent to at least one side of the phoneme; calculating, by the processor, a weight corresponding to each of the phonemes in accordance with a frequency value of the context; and calculating, by the processor, as a score representing correctness of the voice of the operator, a value in accordance with a ratio of a first value to a second value, the first value representing a sum of weights corresponding to the phonemes, the second value representing a sum of weights corresponding to the phonemes having the time lengths of the output intervals that are determined as normal.
- presenting, by a processor, text to an operator;
-
7. A computer program product comprising a non-transitory
computer-readable medium containing a voice processing program that causes a computer to function as: -
a presenting unit that presents text to an operator; a voice acquisition unit that acquires a voice of the operator reading aloud the text; an identifying unit that identifies output intervals of phonemes included in the voice of the operator; a determination unit that determines whether each of time lengths of the output intervals is normal; a frequency acquisition unit that acquires frequency values respectively representing occurrence frequencies of contexts, respectively corresponding to the phonemes, each context including the phoneme and another phoneme adjacent to at least one side of the phoneme; a weight calculator that calculates a weight corresponding to each of the phonemes in accordance with a frequency value of the context; and a score calculator that calculates, as a score representing correctness of the voice of the operator, a value in accordance with a ratio of a first value to a second value, the first value representing a sum of weights corresponding to the phonemes, the second value representing a sum of weights corresponding to the phonemes having the time lengths of the output intervals that are determined as normal.
-
-
8. A voice processing device, comprising:
-
a processor that operates as; a presenting unit that presents text to an operator; a voice acquisition unit that acquires a voice of the operator reading aloud the text; an identifying unit that identifies output intervals of phonemes included in the voice of the operator; a determination unit that determines whether each of time lengths of the output intervals is normal; a frequency acquisition unit that acquires frequency Values respectively representing occurrence frequencies of contexts, respectively corresponding to the phonemes, each context including the phoneme and another phoneme adjacent to at least one side of the phoneme; a score calculator that calculates a score representing correctness of the voice of the operator on the basis of the frequency values of the contexts acquired respectively corresponding to the phonemes having the time lengths of the output intervals that are determined as normal; a frequency storage unit that stores therein occurrence frequencies of a plurality of contexts included in voices acquired in the past, as the frequency values; an updating unit that updates the frequency values, which are stored in the frequency storage unit, of the contexts corresponding to the phonemes included in the voice of the operator reading aloud the text in accordance with the score; and a text selector that selects, as the text, one piece of text from among a plurality of pieces of candidate text, wherein the text selector selects the text on the basis of the frequency values of contexts corresponding to a plurality of phonemes included in the pieces of candidate text when the pieces of candidate text are read aloud.
-
Specification