×

Real-time speaker-dependent neural vocoder

  • US 10,770,063 B2
  • Filed: 08/22/2018
  • Issued: 09/08/2020
  • Est. Priority Date: 04/13/2018
  • Status: Active Grant
First Claim
Patent Images

1. A method for generating speech samples, the method comprising:

  • receiving an input tensor;

    splitting said received input tensor into a first portion and a second portion;

    performing a 1×

    1 convolution respectively on said first portion and said second portion to generate a respective first intermediate result and a second intermediate result;

    summing said first intermediate result and said second intermediate result to generate a third intermediate result;

    applying a post-processing function on said third intermediate result to generate a fourth intermediate result;

    computing an output tensor by summing said received input tensor with said fourth intermediate result;

    recursing by setting said input tensor to said output tensor until said output tensor is of size one in a pre-determined dimension; and

    ,performing a prediction of a speech sample using said output tensor of size one in a pre-determined dimension.

View all claims
  • 3 Assignments
Timeline View
Assignment View
    ×
    ×