INFORMATION PROCESSING METHOD, INFORMATION PROCESSING DEVICE, AND RECORDING MEDIUM
First Claim
1. An information processing method performed by a computer, the information processing method comprising:
- detecting at least one speech segment from speech input to a speech input unit;
extracting, from each of the at least one speech segment, a first feature quantity identifying a speaker whose voice is contained in the speech segment;
performing a comparison between the first feature quantity extracted and each of second feature quantities stored in storage and identifying respective voices of registered speakers who are target speakers in speaker recognition; and
determining registered speakers by performing the comparison for each of consecutive speech segments detected in the detecting and, under a predetermined condition, deleting, from the storage, at least one second feature quantity having a degree of similarity less than or equal to a threshold among the second feature quantities stored in the storage, to remove at least one registered speaker identified by the at least one second feature quantity, the degree of similarity being a degree of similarity with the first feature quantity.
1 Assignment
0 Petitions
Accused Products
Abstract
The information processing method in the present disclosure is performed as below. At least one speech segment is detected from speech input to a speech input unit. A first feature quantity is extracted from each speech segment detected, the first feature quantity identifying a speaker whose voice is contained in the speech segment. The first feature quantity extracted is compared with each of second feature quantities stored in storage and identifying the respective voices of registered speakers who are target speakers in speaker recognition. The comparison is performed for each of consecutive speech segments, and under a predetermined condition, among the second feature quantities stored in the storage, at least one second feature quantity whose similarity with the first feature quantity is less than or equal to a threshold is deleted, thereby removing the at least one registered speaker identified by the at least one second feature quantity.
4 Citations
12 Claims
-
1. An information processing method performed by a computer, the information processing method comprising:
-
detecting at least one speech segment from speech input to a speech input unit; extracting, from each of the at least one speech segment, a first feature quantity identifying a speaker whose voice is contained in the speech segment; performing a comparison between the first feature quantity extracted and each of second feature quantities stored in storage and identifying respective voices of registered speakers who are target speakers in speaker recognition; and determining registered speakers by performing the comparison for each of consecutive speech segments detected in the detecting and, under a predetermined condition, deleting, from the storage, at least one second feature quantity having a degree of similarity less than or equal to a threshold among the second feature quantities stored in the storage, to remove at least one registered speaker identified by the at least one second feature quantity, the degree of similarity being a degree of similarity with the first feature quantity. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An information processing device comprising:
-
a detector that detects at least one speech segment from speech input to a speech input unit; a feature quantity extraction unit configured to extract, from each of the at least one speech segment, a first feature quantity identifying a speaker whose voice is contained in the speech segment; a comparator that performs a comparison between the first feature quantity extracted and each of second feature quantities stored in storage and identifying respective registered speakers who are target speakers in speaker recognition; and a registered speaker determination unit configured to perform the comparison for each of consecutive speech segments detected in the detecting and, under a predetermined condition, remove at least one registered speaker identified by at least one second feature quantity having a degree of similarity less than or equal to a threshold among the second feature quantities stored in the storage, the degree of similarity being a degree of similarity with the first feature quantity.
-
-
12. A non-transitory computer-readable recording medium for use in a computer, the recording medium having a program recorded thereon for causing the computer to perform an information processing method, the information processing method comprising:
-
detecting at least one speech segment from speech input to a speech input unit; extracting, from each of the at least one speech segment, a first feature quantity identifying a speaker whose voice is contained in the speech segment; performing a comparison between the first feature quantity extracted and each of second feature quantities stored in storage and identifying respective registered speakers who are target speakers in speaker recognition; and determining registered speakers by performing the comparison for each of consecutive speech segments detected in the detecting and, under a predetermined condition, removing at least one registered speaker identified by at least one second feature quantity having a degree of similarity less than or equal to a threshold among the second feature quantities stored in the storage, the degree of similarity being a degree of similarity with the first feature quantity.
-
Specification