SPEECH PROCESSING DEVICE AND SPEECH PROCESSING METHOD
First Claim
1. A speech processing device, comprising:
- a speech detector that detects speech of individual speakers from acoustic signals;
an established-conversation calculator that calculates degrees of established conversation of all pairs of the speakers in individual segments defined by dividing a determination time period, on the basis of the detected speech;
a long-time feature calculator that calculates a long-time feature of the degrees of established conversation within the determination time period for each of the pairs; and
a conversational-partner determining unit that extracts a conversation group holding conversation from the speakers, on the basis of the calculated long-time feature.
3 Assignments
0 Petitions
Accused Products
Abstract
A speech processing device which can accurately extract a conversation group from among a plurality of speakers, even when a conversation group formed of three or more people is present. This device (400) comprises: a spontaneous speech detection unit (420) and a direction-specific speech detection unit (430) which separately detect, from a sound signal, uttered speech from the speakers; a conversation establishment level calculation unit (450) which calculates a conversation establishment level for each separated segment of the time being determined, for all of the pairings of two people, on the basis of the detected uttered speech; an extended-period characteristic amount calculation unit (460) which calculates an extended-period characteristic amount for the conversation establishment level of the time being determined, for each pairing; and a conversation-partner determination unit (470) which extracts a conversation group which forms a conversation on the basis of the calculated extended-period characteristic amount.
-
Citations
10 Claims
-
1. A speech processing device, comprising:
-
a speech detector that detects speech of individual speakers from acoustic signals; an established-conversation calculator that calculates degrees of established conversation of all pairs of the speakers in individual segments defined by dividing a determination time period, on the basis of the detected speech; a long-time feature calculator that calculates a long-time feature of the degrees of established conversation within the determination time period for each of the pairs; and a conversational-partner determining unit that extracts a conversation group holding conversation from the speakers, on the basis of the calculated long-time feature. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A speech processing method, comprising:
-
detecting speech of individual speakers from acoustic signals; calculating degrees of established conversation of all pairs of the speakers in individual segments defined by dividing a determination time period, on the basis of the detected speech; calculating a long-time feature of the degrees of established conversation within the determination time period for each of the pairs; and extracting a conversation group holding conversation from the speakers on the basis of the calculated long-time feature.
-
Specification