Format based speech reconstruction from noisy signals
First Claim
1. A method of reconstructing a speech signal from an audible signal using a formant-based codebook, the method comprising:
- detecting one or more formants in an audible signal;
receiving a pitch estimate associated with the one or more detected formants;
selecting one or more codebook tuples from the formant-based codebook based at least on the one or more detected formants, wherein each codebook tuple includes a respective formant spectrum value and a respective one or more formant amplitude values, wherein the respective formant spectrum value is indicative of the spectral location of one or more formants associated with the codebook tuple, and the respective one or more formant amplitude values are indicative of the corresponding amplitudes of the one or more formants associated with the codebook tuple; and
interpolating the spectrum between the corresponding one or more formants associated with the one or more selected codebook tuples to generate a reconstructed speech signal using the received pitch estimate.
0 Assignments
0 Petitions
Accused Products
Abstract
Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or the like. In particular, in some implementations, systems, methods and devices are operable to generate a machine readable formant based codebook. In some implementations, the method includes determining whether or not a candidate codebook tuple includes a sufficient amount of new information to warrant either adding the candidate codebook tuple to the codebook or using at least a portion of the candidate codebook tuple to update an existing codebook tuple. Additionally and/or alternatively, in some implementations systems, methods and devices are operable to reconstruct a target voice signal by detecting formants in an audible signal, using the detected formants to select codebook tuples, and using the formant information in the selected codebook tuples to reconstruct the target voice signal.
-
Citations
25 Claims
-
1. A method of reconstructing a speech signal from an audible signal using a formant-based codebook, the method comprising:
-
detecting one or more formants in an audible signal; receiving a pitch estimate associated with the one or more detected formants; selecting one or more codebook tuples from the formant-based codebook based at least on the one or more detected formants, wherein each codebook tuple includes a respective formant spectrum value and a respective one or more formant amplitude values, wherein the respective formant spectrum value is indicative of the spectral location of one or more formants associated with the codebook tuple, and the respective one or more formant amplitude values are indicative of the corresponding amplitudes of the one or more formants associated with the codebook tuple; and interpolating the spectrum between the corresponding one or more formants associated with the one or more selected codebook tuples to generate a reconstructed speech signal using the received pitch estimate. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A voice reconstruction device operable to reconstruct a speech signal from an audible signal using a formant based codebook, the device comprising:
-
a formant detection module configured to detect one or more formants in an audible signal; a tuple selection module configured to select one or more codebook tuples from the formant-based codebook based at least on the one or more detected formants, wherein each codebook tuple includes a respective formant spectrum value and a respective one or more formant amplitude values, wherein the respective formant spectrum value is indicative of the spectral location of one or more formants associated with the codebook tuple, and the respective one or more formant amplitude values are indicative of the corresponding amplitudes of the one or more formants associated with the codebook tuple; and a synthesis module configured to interpolate the spectrum between the corresponding one or more formants associated with the one or more selected codebook tuples to generate a reconstructed speech signal using a pitch estimate.
-
-
24. A voice reconstruction device operable to reconstruct a speech signal from an audible signal using a formant based codebook, the device comprising:
-
means for detecting one or more formants in an audible signal; means for selecting one or more codebook tuples from the formant-based codebook based at least on the one or more detected formants, wherein each codebook tuple includes a respective formant spectrum value and a respective one or more formant amplitude values, wherein the respective formant spectrum value is indicative of the spectral location of one or more formants associated with the codebook tuple, and the respective one or more formant amplitude values are indicative of the corresponding amplitudes of the one or more formants associated with the codebook tuple; and means for interpolating the spectrum between the corresponding one or more formants associated with the one or more selected codebook tuples to generate a reconstructed speech signal using a pitch estimate.
-
-
25. A voice reconstruction device operable to reconstruct a speech signal from an audible signal using a formant based codebook, the device comprising:
-
a processor; and a memory including instructions, that when executed by the processor cause the device to; detect one or more formants in an audible signal; select one or more codebook tuples from the formant-based codebook based at least on the one or more detected formants, wherein each codebook tuple includes a respective formant spectrum value and a respective one or more formant amplitude values, wherein the respective formant spectrum value is indicative of the spectral location of one or more formants associated with the codebook tuple, and the respective one or more formant amplitude values are indicative of the corresponding amplitudes of the one or more formants associated with the codebook tuple; and interpolate the spectrum between the corresponding one or more formants associated with the one or more selected codebook tuples to generate a reconstructed speech signal using a pitch estimate.
-
Specification