Dialogue-sound processing apparatus and method
First Claim
1. Dialogue-sound processing apparatus, comprising;
- sound input means for inputting speech fragments of dialogue-sound in sequence;
clue extraction means for extracting a plurality of clues, each clue comprising a word or prosodic feature representing a flow of a dialogue from the speech fragments;
utterance function rule memory means for memorizing a plurality of utterance function rules, each rule defining a relation between one of the clues and an utterance function representing a pragmatic effect for the flow of the dialogue;
utterance function extraction means for assigning the utterance function to the clue extracted by said clue extraction means in accordance with the corresponding utterance function rule; and
discourse structure generation means for generating a discourse structure representing the flow of the dialogue of the speech fragments in accordance with the assigned utterance function.
1 Assignment
0 Petitions
Accused Products
Abstract
A dialogue-sound processing appratus of the present invention generates discourse structure representing the flow of dialogue from fragmentary spoken utterances. In the dialogue-sound processing apparatus, the speech fragments of the dialogue-sound is inputted through a sound input section. A clue extraction section extracts clue which is a word or prosodic feature representing flow of dialogue from the speech fragments. An utterance function rule memory section memorizes utterance function rule which is correspondence relation between the clue and the utterance function representing pragmatic effect for the flow of dialogue. An utterance function extraction section assigns the utterance function to the clue in accordance with the utterance function rule. A discourse structure generation section generates discourse structure representing the flow of dialogue from fragmentary spoken utterances in accordance with the utterance function corresponding to the clue assigned by the utterance function extraction section.
61 Citations
11 Claims
-
1. Dialogue-sound processing apparatus, comprising;
-
sound input means for inputting speech fragments of dialogue-sound in sequence; clue extraction means for extracting a plurality of clues, each clue comprising a word or prosodic feature representing a flow of a dialogue from the speech fragments; utterance function rule memory means for memorizing a plurality of utterance function rules, each rule defining a relation between one of the clues and an utterance function representing a pragmatic effect for the flow of the dialogue; utterance function extraction means for assigning the utterance function to the clue extracted by said clue extraction means in accordance with the corresponding utterance function rule; and discourse structure generation means for generating a discourse structure representing the flow of the dialogue of the speech fragments in accordance with the assigned utterance function. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. Dialogue-sound processing method, comprising the steps of;
-
inputting speech fragments of the dialogue-sound in sequence; extracting a plurality of clues,, each clue comprising a word or prosodic feature representing a flow of a dialogue from the speech fragment; memorizing a plurality of utterance function rules, each rule defining a relation between one of the clues and an utterance function representing a pragmatic effect for the flow of the dialogue; assigning the utterance function to the clue extracted at the extracting step in accordance with the corresponding utterance function rule; and generating a discourse structure representing the flow of the dialogue of the speech fragments in accordance with the assigned utterance function.
-
-
10. Dialogue-sound processing apparatus, comprising;
-
sound input means for inputting user'"'"'s sound signal; input analysis means for analysing the sound signal and for outputting input-intention information of the sound signal; problem solution means for solving problem corresponding to the input-intention information and for outputting response-intention information as solution result; output generation means for generating response information to the user in accordance with the response-intention information; clue extraction means for extracting a plurality of clues, each clue comprising a word or prosodic feature representing a flow of a dialogue from speech fregments in the sound signal; utterance function rule memory means for memorizing a plurality of utterance function rules, each rule defining a relation between one of the clues and an utterance function representing a pragmatic effect for the flow of the dialogue; utterance function extraction means for assigning the utterance function to the clue extracted by said clue extraction means in accordance with the corresponding utterance function rule; and discourse management means for generating a discourse structure representing the flow of the dialogue between the user'"'"'s sound signal and the response information in accordance with at least one of the assigned utterance function and the input-intention information, at least one of the response-intention information and the response information, and for controlling at least one of the analysis processing of said input analysis means, the solution processing of the problem solution means and the generation processing of the output generation means in accordance with the discourse structure.
-
-
11. Dialogue-sound processing method, comprising the steps of;
-
inputting user'"'"'s sound signal; analysing the sound signal to output input-intention information of the sound-signal; solving problem corresponding to the input-intention information to output response-intention information as solution result; generating response information to the user in accordance with the response-intention information; extracting a plurality of clues, each clue comprising a word or prosodic feature representing a flow of a dialogue from speech fragment in the sound signal; memorizing a plurality of utterance function rules, each rule defining a relation between one of the clues and an utterance function representing a pragmatic effect for the flow of the dialogue; assigning the utterance function to the clue extracted at the extracting step in accordance with the corresponding utterance function rule; generating a discourse structure representing the flow of the dialogue between the user'"'"'s sound signal and the response information in accordance with at least one of the assigned utterance function and the input-intention information, at least one of the response-intention information and the response information; and controlling at least one of the analysis processing at the analysing step, the solution processing at the solving step and the response information-generation processing at the response information-generating step in accordance with the discourse structure.
-
Specification