Speech signal evaluation apparatus, storage medium storing speech signal evaluation program, and speech signal evaluation method
First Claim
1. A speech signal evaluation apparatus comprising:
- a processor; and
a memory storing speech signals and a plurality of instructions, which when executed by the processor, cause the processor to execute,acquiring, as a first frame, a speech signal of a specified length from the speech signals stored in the memory;
detecting, on the basis of a speech condition indicating a presence of speech, whether the first frame is voiced or unvoiced, wherein an unvoiced frame does not satisfy the speech condition and a voiced frame does satisfy the speech condition;
calculating, when the first frame is unvoiced, a variation in a spectrum associated with the first frame on the basis of a spectrum of the first frame and a spectrum of a second frame, the second frame being unvoiced and preceding the first frame in time; and
detecting, on a basis of a non-stationary condition based on the variation in spectrum, whether the variation satisfies the non-stationary condition, whereinthe variation in the spectrum is calculated on the basis of an absolute value of a difference between the spectrum of the first frame and the spectrum of the second frame at each frequency.
1 Assignment
0 Petitions
Accused Products
Abstract
A speech signal evaluation apparatus includes: an acquisition unit that acquires, as a first frame, a speech signal of a specified length from speech signals; a first detection unit that detects, on the basis of a speech condition, whether the first frame is voiced or unvoiced; a variation calculation unit that, when the first frame is unvoiced, calculates a variation in a spectrum associated with the first frame on the basis of a spectrum of the first frame and a spectrum of a second frame that is unvoiced and precedes the first frame in time; and a second detection unit that detects, on the basis of a non-stationary condition based on the variation in spectrum, whether the variation of the first frame satisfies the non-stationary condition.
-
Citations
16 Claims
-
1. A speech signal evaluation apparatus comprising:
-
a processor; and a memory storing speech signals and a plurality of instructions, which when executed by the processor, cause the processor to execute, acquiring, as a first frame, a speech signal of a specified length from the speech signals stored in the memory; detecting, on the basis of a speech condition indicating a presence of speech, whether the first frame is voiced or unvoiced, wherein an unvoiced frame does not satisfy the speech condition and a voiced frame does satisfy the speech condition; calculating, when the first frame is unvoiced, a variation in a spectrum associated with the first frame on the basis of a spectrum of the first frame and a spectrum of a second frame, the second frame being unvoiced and preceding the first frame in time; and detecting, on a basis of a non-stationary condition based on the variation in spectrum, whether the variation satisfies the non-stationary condition, wherein the variation in the spectrum is calculated on the basis of an absolute value of a difference between the spectrum of the first frame and the spectrum of the second frame at each frequency. - View Dependent Claims (2)
-
-
3. A computer-readable non-transitory medium storing a speech signal evaluation program, which when executed by a computer, causes the computer to execute:
-
acquiring, as a first frame, a speech signal of a specified length from speech signals stored in a memory; detecting, on the basis of a speech condition indicating a presence of speech in a frame, whether the first frame is voiced or unvoiced, wherein an unvoiced frame does not satisfy the speech condition and a voiced frame does satisfy the speech condition; calculating, when the first frame is unvoiced, a variation in a spectrum associated with the first frame on the basis of a spectrum of the first frame and a spectrum of a second frame, the second frame being unvoiced and preceding the first frame in time; and detecting, on the basis of a non-stationary condition based on the variation in spectrum, whether the variation satisfies the non-stationary condition, wherein the variation in the spectrum is calculated on the basis of an absolute value of a difference between the spectrum of the first frame and the spectrum of the second frame at each frequency. - View Dependent Claims (4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A speech signal evaluation method executed by a computer, the speech signal evaluation method comprising:
-
acquiring, as a first frame, a speech signal of a specified length from speech signals stored in a memory; detecting, on the basis of a speech condition indicating a presence of speech in a frame, whether the first frame is voiced or unvoiced, wherein an unvoiced frame does not satisfy the speech condition and a voiced frame does satisfy the speech condition; calculating, when the first frame is unvoiced, a variation in a spectrum associated with the first frame on the basis of a spectrum of the first frame and a spectrum of a second frame, the second frame being unvoiced and preceding the first frame in time; and detecting, on the basis of a non-stationary condition based on the variation in spectrum, whether the variation satisfies the non-stationary condition, wherein the variation in the spectrum is calculated on the basis of an absolute value of a difference between the spectrum of the first frame and the spectrum of the second frame at each frequency. - View Dependent Claims (16)
-
Specification