Method, apparatus and program capable of outputting response perceivable to a user as natural-sounding
First Claim
Patent Images
1. A voice processing apparatus comprising:
- at least one processor configured to execute stored instructions to;
receive an inputted voice from a user and perform language analysis on the inputted voice;
obtain a primary response data representative of a response to the inputted voice from a database;
analyze whether the primary response data includes an interjection with 2 or less syllables, wherein the interjection is included in a repetition target;
in a case where the analyzed primary response data is determined to include the interjection with 2 or less syllables, generate a voice sequence from a secondary response data that includes the repetition target repeated at least twice;
synthesize a voice based on the voice sequence; and
output the synthesized voice in digital form that includes the secondary response data.
1 Assignment
0 Petitions
Accused Products
Abstract
A voice synthesizing apparatus includes: a voice inputter (102) configured to input a voice; an obtainer (22) configured to obtain a primary response to the voice inputted by the voice inputter (102); an analyzer (112) configured to analyze whether the primary response includes a repetition target; and a voice synthesizer (24) configured to, in a case where the analyzed primary response is determined to include the repetition target, synthesize a voice from a secondary response that includes the repetition target repeated at least twice to output the voice.
15 Citations
11 Claims
-
1. A voice processing apparatus comprising:
-
at least one processor configured to execute stored instructions to; receive an inputted voice from a user and perform language analysis on the inputted voice; obtain a primary response data representative of a response to the inputted voice from a database; analyze whether the primary response data includes an interjection with 2 or less syllables, wherein the interjection is included in a repetition target; in a case where the analyzed primary response data is determined to include the interjection with 2 or less syllables, generate a voice sequence from a secondary response data that includes the repetition target repeated at least twice; synthesize a voice based on the voice sequence; and output the synthesized voice in digital form that includes the secondary response data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A voice processing method comprising:
-
receiving, by at least one processor, an inputted voice from a user and perform language analysis on the inputted voice; obtaining, by the at least one processor, a primary response data representative of a response to the inputted voice from a database; analyzing, by the at least one processor, whether the primary response data includes an interjection with 2 or less syllables, wherein the interjection is included in a repetition target; in a case where the analyzed primary response data is determined to include the interjection with 2 or less syllables, generating a voice sequence from a secondary response data that includes the repetition target repeated at least twice; synthesizing, by the at least one processor, a voice based on the voice sequence; and outputting the synthesized voice in digital form that includes the secondary response data.
-
-
11. A non-transitory computer readable medium storing executable instructions, the executable instructions when executed by at least one processor performs a voice processing method, the method comprising:
-
receiving an inputted voice from a user and perform language analysis on the inputted voice; obtaining a primary response data representative of a response to the inputted voice from a database; analyzing whether the primary response data includes an interjection with 2 or less syllables, wherein the interjection is included in a repetition target; in a case where the analyzed primary response data is determined to include the interjection with 2 or less syllables, generating a voice sequence from a secondary response data that includes the repetition target repeated at least twice; synthesizing a voice based on the voice sequence; and outputting the synthesize voice in digital form that includes the secondary response data.
-
Specification