Hierarchical approach for the statistical vowelization of Arabic text
First Claim
1. A method for converting an input text given in an incomplete language into speech, wherein a computer-aided graphem-phonem conversion is used, characterized by the steps of:
- a) using statistical methods for enriching said input text with missing information, b) subjecting the enriched input text to said grapheme-phoneme conversion to produce a phonetic description of said input text, c) converting said phonetic description into synthetic speech.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to the field of computer-aided text and speech processing, and in particular to a method and respective system for converting an input text given in an incomplete language, for example a language, in which unvowelized text is used, into speech, wherein a computer-aided grapheme-phoneme conversion is used. In order to improve completion of the text, it is proposed to a) use statistical methods including decision trees and stochastic language models for enriching, i.e. completing said input text with missing information—which may be desired for a full understanding of the input text b) subjecting the completed input text to said grapheme-phoneme conversion to produce synthetic speech.
Advantageously, the text is completed according to a model hierarchy giving higher priority to longer chunks of text, ie sentences (310, 315, 320) then multiword phrases (330, 335, 340), then words (350, 355, 360) and finally character groups (370, 375, 380, 390).
-
Citations
17 Claims
-
1. A method for converting an input text given in an incomplete language into speech, wherein a computer-aided graphem-phonem conversion is used, characterized by the steps of:
-
a) using statistical methods for enriching said input text with missing information, b) subjecting the enriched input text to said grapheme-phoneme conversion to produce a phonetic description of said input text, c) converting said phonetic description into synthetic speech. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 16, 17)
-
-
2. A method for training a speech recognizer with an input text and corresponding speech data, wherein said input text is given in an incomplete language, characterized by the steps of:
-
a) using statistical methods for enriching an input word of said input text with missing information, b) subjecting the enriched input text to said grapheme-phoneme conversion to produce a phonetic description of said input text, c) training acoustic Hidden Markov Models for the recognition of words from said input text.
-
-
13. A computer system having a functional component for converting an input text given in an incomplete language into speech, wherein a computer-aided graphem-phonem conversion is used, characterized by comprising a functional vowelization program component using statistical methods for enriching said input text with missing information and having access to:
-
a) a database comprising language models for words or characters and/or for classes of words or characters, b) a database comprising language models for sentences and/or phrases. - View Dependent Claims (14)
-
-
15. A text server computer system having a functional component for training a speech recognizer with an input text and corresponding speech data, wherein said input text is given in an incomplete language, characterized by comprising a functional vowelization program component using statistical methods for enriching an input word of said input text with missing information and having access to:
-
a) a database comprising language models for words or characters and/or for classes of words or characters, b) a database comprising language models for sentences and/or phrases.
-
Specification