×

Real-time voice masking in a computer network

  • US 9,947,341 B1
  • Filed: 01/18/2017
  • Issued: 04/17/2018
  • Est. Priority Date: 01/19/2016
  • Status: Active Grant
First Claim
Patent Images

1. A communication system configured to support real-time voice masking, the system comprising:

  • a first client computer configured to receive over a computer network a first set of instructions that control the first client computer to;

    receive an audio signal representing a portion of speech;

    split the audio signal into a plurality of overlapping segments;

    generate a frequency domain representation of a current signal segment in the plurality of overlapping segments, wherein the frequency domain representation comprises components corresponding to a plurality of frequency bins;

    generate, from the frequency domain representation of the current signal segment, a polar representation comprising a magnitude component and a phase component for each of the frequency bins;

    generate a refined frequency domain representation of the current signal segment based on a comparison, for each of the frequency bins, between a first phase component from the current signal segment and a second phase component from a prior signal segment;

    calculate an initial cepstrum from the refined frequency domain representation;

    calculate a spectral envelope from the initial cepstrum using iterative smoothing with a resolution lower than a resolution of the frequency domain representation, wherein the iterative smoothing terminates after a predetermined number of iterations or a predetermined degree of convergence is reached;

    calculate an excitation spectrum from the refined frequency domain representation and the spectral envelope;

    rescale the spectral envelope based on a formant adjustment parameter to obtain a modified spectral envelope, wherein the spectral envelope is distinct from the current signal segment, the frequency domain representation, and the initial cepstrum;

    calculate a modified frequency domain representation by combining the modified spectral envelope and the excitation spectrum;

    synthesize a modified signal segment from the modified frequency domain representation; and

    transmit the modified signal segment over the computer network;

    a second client computer configured to receive over the computer network a second set of instructions that control the second client computer to play audio signal segments received over the computer network; and

    a server configured to receive the modified signal segment from the first client computer and transmit the modified signal segment to the second client computer.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×