×

Methods for reconstructing an audio signal

  • US 10,127,918 B1
  • Filed: 05/03/2017
  • Issued: 11/13/2018
  • Est. Priority Date: 05/03/2017
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method, comprising:

  • receiving input audio data comprising a plurality of audio samples;

    detecting distortion in a first portion of the input audio data associated with a first period of time, the distortion caused by at least one of the plurality of audio samples missing from the input audio data or a magnitude value of one or more of the plurality of audio samples being equal to a saturation threshold value;

    determining that a second portion of the input audio data following the first portion is not distorted, the second portion corresponding to a second period of time that begins at a first time;

    performing, based on a magnitude of signal values of the input audio data, a quantization process to generate first audio data by mapping the signal values of the input audio data to discrete states corresponding to respective quantization intervals;

    generating, based on the first audio data, two or more first audio data predictions corresponding to at least part of the first period of time, the two or more first audio data predictions determined using a first generative model that receives the first audio data as input features and predicts a magnitude of signal values for audio samples recursively in a first direction in time;

    determining a first audio sample in the first audio data corresponding to the first time;

    determining a magnitude value associated with the first audio sample;

    selecting, based on at least the magnitude value associated with the first audio sample, a first data prediction of the two or more first audio data predictions;

    generating, based on the first data prediction, second audio data corresponding to at least part of the first period of time;

    generating, based on at least the first audio data and the second audio data, output audio data, the output audio data including the second audio data followed by a third portion of the first audio data that includes the first audio sample; and

    doing at least one of (a) causing audio corresponding to the output audio data to be output by at least one speaker, or (b) causing a function corresponding to a voice command represented by the output audio data to be executed.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×