×

Compressing and using a concatenative speech database in text-to-speech systems

  • US 7,035,794 B2
  • Filed: 03/30/2001
  • Issued: 04/25/2006
  • Est. Priority Date: 03/30/2001
  • Status: Expired due to Fees
First Claim
Patent Images

1. A method, comprising:

  • receiving input text at a client device;

    analyzing the input text to determine diphones;

    sending a request to a server for diphone waveform data based on the determined diphones;

    locating the requested diphone waveform data by searching a concatenative diphone waveform database at the server;

    generating a set of compressed diphone residuals and Linear Predictive Coding (LPC) coefficients by compressing results of the searched diphone waveform database;

    storing the set of compressed diphone residuals and the LPC coefficients in a compressed packet;

    transmitting the compressed packet to the client device; and

    upon receiving the compressed packet, the client device decompresses the compressed packet back to diphone waveform data available for use in a text-to-speech synthesizer.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×