System and method of processing structured text for text-to-speech synthesis
First Claim
1. A method of processing a structured text comprising the steps of:
- creating, from the structured text, a tokenizer text including simplex constituents constructed in accordance with a predetermined set of tokenized rules of a token pattern knowledge base, each tokenizer rule defining a simplex constituent;
creating, from the tokenized text, a parsed text including complex constituents constructed in accordance with a predetermined set of parser rules of a parser rule knowledge base, each parser rule defining a complex constituent; and
creating, from the parsed text, a processed text including message elements constructed in accordance with a predetermined set of interpreter rules of an interpretation knowledge base, each interpreter rule defining a message element.
1 Assignment
0 Petitions
Accused Products
Abstract
A method for processing structured text is provided. Tokenized text is created from structured text in accordance with a predetermined set of tokenizer rules set forth in a token pattern knowledge base (13), in which each tokenizer rule defines a simplex constituent. Parsed text is created from tokenized text in accordance with a predetermined set of parser rules set forth in a parser rule knowledge base (15), in which each parser rule defines a complex constituent. Processed text is created from parsed text in accordance with a predetermined set of interpreter rules set forth in an interpretation knowledge base (17), in which each interpreter rule defines a message element corresponding to a simplex or complex constituent, whereby the processed text identifies and provides an interpretation of the message elements of the corresponding structured text for a useful purpose, such as text-to-speech synthesis.
212 Citations
20 Claims
-
1. A method of processing a structured text comprising the steps of:
-
creating, from the structured text, a tokenizer text including simplex constituents constructed in accordance with a predetermined set of tokenized rules of a token pattern knowledge base, each tokenizer rule defining a simplex constituent;
creating, from the tokenized text, a parsed text including complex constituents constructed in accordance with a predetermined set of parser rules of a parser rule knowledge base, each parser rule defining a complex constituent; and
creating, from the parsed text, a processed text including message elements constructed in accordance with a predetermined set of interpreter rules of an interpretation knowledge base, each interpreter rule defining a message element. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A program for processing a structured text stored on a computer readable medium comprising:
-
a computer readable program code for creating a tokenized text, from the structured text, including simplex constituents constructed in accordance with a predetermined set of tokenizer rules of a token pattern knowledge base;
a computer readable program code for creating a parsed text, from the tokenized text, including complex constituents constructed in accordance with a predetermined set of parser rules of a parser rule knowledge base; and
a computer readable program code for creating a processed text, from the parsed text, including message elements constructed in accordance with a predetermined set of interpreter rules of an interpretation knowledge base.
-
Specification