Systems and methods for performing phonemic synthesis
First Claim
1. A processing system for generating an output data set for use in phonemic synthesis to produce patterns of transition from one speech excitation state to another, said output data set including a plurality of output data subsets, said processing system comprising:
- means for receiving a textual data set, said textual data set including at least one textual data subset;
at least one memory storage device operable to store a plurality of processing system instructions; and
at least one processing unit for generating said output data set by retrieving and executing at least one of said processing unit instructions from said memory storage device, said processing unit operable to;
transform said received textual data set into a phonetic data set, said phonetic data set including a plurality of phonetic data subsets wherein each of said phonetic data subsets represents a particular speech state, said transformation modelling a number of acoustic parameters affecting the excitation sources of the vocal tract by deriving said parameters as nonlinear functions of a single excitation control variable; and
interpolate said phonetic data set as a function of the single excitation control variable to generate said output data set whereby said phonetic data subsets are combined to determine their collective contributions to each one of said output data subsets.
4 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for performing phonemic synthesis which operate to generate an output data set of acoustic B parameters from a received textual data set wherein the output data set represents patterns of transition from one speech excitation state to another. The textual data set is converted to a plurality of phonetic data sets, at least one phone descriptor is assigned to each of the phonemic data sets, and the output data set is generated by processing the phonetic data sets as a non-linear function of a speech excitation control variable whereby the collective contributions of the phonetic data sets are determined for each pattern of transition from one speech excitation state to another. The speech excitation control variable represents selected portions of a human vocal system.
16 Citations
20 Claims
-
1. A processing system for generating an output data set for use in phonemic synthesis to produce patterns of transition from one speech excitation state to another, said output data set including a plurality of output data subsets, said processing system comprising:
-
means for receiving a textual data set, said textual data set including at least one textual data subset; at least one memory storage device operable to store a plurality of processing system instructions; and at least one processing unit for generating said output data set by retrieving and executing at least one of said processing unit instructions from said memory storage device, said processing unit operable to; transform said received textual data set into a phonetic data set, said phonetic data set including a plurality of phonetic data subsets wherein each of said phonetic data subsets represents a particular speech state, said transformation modelling a number of acoustic parameters affecting the excitation sources of the vocal tract by deriving said parameters as nonlinear functions of a single excitation control variable; and interpolate said phonetic data set as a function of the single excitation control variable to generate said output data set whereby said phonetic data subsets are combined to determine their collective contributions to each one of said output data subsets. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A processing system comprising:
-
an input port for receiving a textual data set including a plurality of textual data subsets; and at least one processing unit for generating an output data set representing a sequence of speech sounds, said processing unit operable to; calculate an excitation control variable as a function of selected physical changes of a human vocal system as said human vocal system transitions from one speech excitation state to another; and process said textual data set as a function of said excitation control variable to generate said output data set and model a number of acoustic parameters affecting the excitation sources of the vocal tract by deriving said parameters as nonlinear functions of the excitation control variable, whereby said textual data subsets are converted to a plurality of phonetic data sets which are combined together to determine their collective contributions to each one of said speech sounds. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. A method for generating an output data set of acoustic parameters from a received textual data set, said output data set representative of patterns of transition from one speech excitation state to another, said method comprising the steps of:
-
converting said received textual data set to a phonetic data set, said phonetic data set including a plurality of phonetic data subsets wherein each of said phonetic data subsets represents a particular speech state; assigning at least one phone descriptor to each of said phonemic data subsets and converting each said assigned phone descriptor to time series; producing a speech excitation control variable representative of selected portions of a human vocal system; generating said output data set of acoustic parameters by processing said phonetic data set with a number of acoustic parameters affecting the excitation sources of the vocal tract derived from a non-linear function of said speech excitation variable whereby the collective contributions of the phonetic data subsets are determined for each pattern of transition from one speech excitation state to another. - View Dependent Claims (17, 18, 19, 20)
-
Specification