SPEECH SIGNAL EVALUATION APPARATUS, STORAGE MEDIUM STORING SPEECH SIGNAL EVALUATION PROGRAM, AND SPEECH SIGNAL EVALUATION METHOD
First Claim
1. A speech signal evaluation apparatus comprising:
- a memory storing speech signals;
an acquisition unit that acquires, as a first frame, a speech signal of a specified length from the speech signals stored in the memory;
a first detection unit that detects, on the basis of a speech condition indicating a presence of speech, whether the first frame is voiced or unvoiced, an unvoiced frame does not satisfy the speech condition and a voiced frame does satisfy the speech condition;
a variation calculation unit that, when the first frame is unvoiced, calculates a variation in a spectrum associated with the first frame on the basis of a spectrum of the first frame and a spectrum of a second frame, the second frame being unvoiced and preceding the first frame in time; and
a second detection unit that detects, on a basis of a non-stationary condition based on the variation in spectrum, whether the variation satisfies the non-stationary condition.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech signal evaluation apparatus includes: an acquisition unit that acquires, as a first frame, a speech signal of a specified length from speech signals; a first detection unit that detects, on the basis of a speech condition, whether the first frame is voiced or unvoiced; a variation calculation unit that, when the first frame is unvoiced, calculates a variation in a spectrum associated with the first frame on the basis of a spectrum of the first frame and a spectrum of a second frame that is unvoiced and precedes the first frame in time; and a second detection unit that detects, on the basis of a non-stationary condition based on the variation in spectrum, whether the variation of the first frame satisfies the non-stationary condition.
-
Citations
17 Claims
-
1. A speech signal evaluation apparatus comprising:
-
a memory storing speech signals; an acquisition unit that acquires, as a first frame, a speech signal of a specified length from the speech signals stored in the memory; a first detection unit that detects, on the basis of a speech condition indicating a presence of speech, whether the first frame is voiced or unvoiced, an unvoiced frame does not satisfy the speech condition and a voiced frame does satisfy the speech condition; a variation calculation unit that, when the first frame is unvoiced, calculates a variation in a spectrum associated with the first frame on the basis of a spectrum of the first frame and a spectrum of a second frame, the second frame being unvoiced and preceding the first frame in time; and a second detection unit that detects, on a basis of a non-stationary condition based on the variation in spectrum, whether the variation satisfies the non-stationary condition. - View Dependent Claims (2)
-
-
3. A computer-readable medium storing a speech signal evaluation program, which when executed by a computer, causes the computer to execute:
-
acquiring, as a first frame, a speech signal of a specified length from speech signals stored in a memory; detecting, on the basis of a speech condition indicating a presence of speech in a frame, whether the first frame is voiced or unvoiced, an unvoiced frame does not satisfy the speech condition and a voiced frame does satisfy the speech condition; calculating, when the first frame is unvoiced, a variation in a spectrum associated with the first frame on the basis of a spectrum of the first frame and a spectrum of a second frame, the second frame being unvoiced and preceding the first frame in time; and detecting, on the basis of a non-stationary condition based on the variation in spectrum, whether the variation satisfies the non-stationary condition. - View Dependent Claims (4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A speech signal evaluation method executed by a computer, the speech signal evaluation method comprising:
-
acquiring, as a first frame, a speech signal of a specified length from speech signals stored in a memory; detecting, on the basis of a speech condition indicating a presence of speech in a frame, whether the first frame is voiced or unvoiced, an unvoiced frame does not satisfy the speech condition and a voiced frame does satisfy the speech condition; calculating, when the first frame is unvoiced, a variation in a spectrum associated with the first frame on the basis of a spectrum of the first frame and a spectrum of a second frame, the second frame being unvoiced and preceding the first frame in time; and detecting, on the basis of a non-stationary condition based on the variation in spectrum, whether the variation satisfies the non-stationary condition. - View Dependent Claims (17)
-
Specification