ADAPTIVE VOICE INTELLIGIBILITY PROCESSOR
First Claim
1. A method of adjusting a voice intelligibility enhancement, the method comprising:
- receiving an input voice signal;
obtaining a spectral representation of the input voice signal with a linear predictive coding (LPC) process, the spectral representation comprising one or more formant frequencies;
adjusting the spectral representation of the input voice signal with one or more processors to produce an enhancement filter configured to emphasize the one or more formant frequencies;
applying the enhancement filter to a representation of the input voice signal to produce a modified voice signal with enhanced formant frequencies;
detecting an envelope based on the input voice signal;
analyzing the envelope of the modified voice signal to determine one or more temporal enhancement parameters; and
applying the one or more temporal enhancement parameters to the modified voice signal to produce an output voice signal;
wherein at least said applying the one or more temporal enhancement parameters is performed by one or more processors.
6 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for adaptively processing speech to improve voice intelligibility are described. These systems and methods can adaptively identify and track formant locations, thereby enabling formants to be emphasized as they change. As a result, these systems and methods can improve near-end intelligibility, even in noisy environments. The systems and methods can be implemented in Voice-over IP (VoIP) applications, telephone and/or video conference applications (including on cellular phones, smart phones, and the like), laptop and tablet communications, and the like. The systems and methods can also enhance non-voiced speech, which can include speech generated without the vocal track, such as transient speech.
107 Citations
20 Claims
-
1. A method of adjusting a voice intelligibility enhancement, the method comprising:
-
receiving an input voice signal; obtaining a spectral representation of the input voice signal with a linear predictive coding (LPC) process, the spectral representation comprising one or more formant frequencies; adjusting the spectral representation of the input voice signal with one or more processors to produce an enhancement filter configured to emphasize the one or more formant frequencies; applying the enhancement filter to a representation of the input voice signal to produce a modified voice signal with enhanced formant frequencies; detecting an envelope based on the input voice signal; analyzing the envelope of the modified voice signal to determine one or more temporal enhancement parameters; and applying the one or more temporal enhancement parameters to the modified voice signal to produce an output voice signal; wherein at least said applying the one or more temporal enhancement parameters is performed by one or more processors. - View Dependent Claims (2, 3, 4)
-
-
5. A system for adjusting a voice intelligibility enhancement, the system comprising:
-
an analysis module configured to obtain a spectral representation of at least a portion of an input audio signal, the spectral representation comprising one or more formant frequencies; a formant enhancement module configured to generate an enhancement filter configured to emphasize the one or more formant frequencies; the enhancement filter configured to be applied to a representation of the input audio signal with one or more processors to produce a modified voice signal; and a temporal enveloper shaper configured to apply a temporal enhancement to the modified voice signal based at least in part on one or more envelopes of the modified voice signal. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system for adjusting a voice intelligibility enhancement, the system comprising:
-
a linear predictive coding analysis module configured to apply a linear predictive coding (LPC) technique to obtain LPC coefficients that correspond to a spectrum of an input voice signal, the spectrum comprising one or more formant frequencies; a mapping module configured to map the LPC coefficients to line spectral pairs; and a formant enhancement module comprising one or more processors, the formant enhancement module configured to modify the line spectral pairs to thereby adjust the spectrum of the input voice signal and produce an enhancement filter configured to emphasize the one or more formant frequencies; the enhancement filter configured to be applied to a representation of the input voice signal to produce a modified voice signal. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification