Hierarchical approach for the statistical vowelization of Arabic text
First Claim
1. A method of supplementing an input text given in an incomplete language with missing information, the method comprising:
- enriching said input text given in an incomplete language with the missing information using at least one processor programmed to implement a first statistical method configured to operate on a first type of linguistic unit and a second statistical method configured to operate on a second type of linguistic unit, wherein the enriching comprises applying the first statistical method to the input text to generate an intermediate result, and after the first statistical method has been applied, applying the second statistical method to at least a portion of the intermediate result to generate an enriched representation of the input text.
2 Assignments
0 Petitions
Accused Products
Abstract
The present invention relates to the field of computer-aided text and speech processing, and in particular to a method and respective system for converting an input text given in an incomplete language, for example a language, in which unvowelized text is used, into speech, wherein a computer-aided grapheme-phoneme conversion is used. In order to improve completion of the text, it is proposed to
- a) use statistical methods including decision trees and stochastic language models for enriching, i.e. completing said input text with missing information—which may be desired for a full understanding of the input text
- b) subjecting the completed input text to said grapheme-phoneme conversion to produce synthetic speech.
Advantageously, the text is completed according to a model hierarchy giving higher priority to longer chunks of text, ie sentences (310, 315, 320) then multiword phrases (330, 335, 340), then words (350, 355, 360) and finally character groups (370, 375, 380, 390).
-
Citations
20 Claims
-
1. A method of supplementing an input text given in an incomplete language with missing information, the method comprising:
enriching said input text given in an incomplete language with the missing information using at least one processor programmed to implement a first statistical method configured to operate on a first type of linguistic unit and a second statistical method configured to operate on a second type of linguistic unit, wherein the enriching comprises applying the first statistical method to the input text to generate an intermediate result, and after the first statistical method has been applied, applying the second statistical method to at least a portion of the intermediate result to generate an enriched representation of the input text. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
14. A method for training a speech recognizer with an input text given in an incomplete language and corresponding speech data, the method comprising:
-
enriching an input word of said input text given in an incomplete language with missing information using at least one processor programmed to implement a first statistical method configured to operate on a first type of linguistic unit and a second statistical method configured to operate on a second type of linguistic unit, wherein the enriching comprises applying the first statistical method to the input text to generate an intermediate result, and after the first statistical method has been applied, applying the second statistical method to at least a portion of the intermediate result to generate an enriched representation of the input text; subjecting the enriched representation of the input text to grapheme-phoneme conversion to produce a phonetic description of said input text; and using said phonetic description to train at least one acoustic model to recognize words from said input text. - View Dependent Claims (15)
-
-
16. A computer system, comprising:
at least one processor programmed to; enrich an input text given in an incomplete language, with missing information using a first statistical method configured to operate on a first type of linguistic unit and a second statistical method configured to operate on a second type of linguistic unit, wherein the enriching comprises applying the first statistical method to the input text to generate an intermediate result, and after the first statistical method has been applied, applying the second statistical method to at least a portion of the intermediate result to generate an enriched representation of the input text; and convert the enriched representation of the input text into speech. - View Dependent Claims (17)
-
18. A text server computer system, comprising:
at least one processor programmed to; train a speech recognizer with an input text given in an incomplete language and corresponding speech data; and enrich an input word of said input text with missing information using a first statistical method configured to operate on a first type of linguistic unit and a second statistical method configured to operate on a second type of linguistic unit, wherein the enriching comprises applying the first statistical method to the input text to generate an intermediate result, and after the first statistical method has been applied, applying the second statistical method to at least a portion of the intermediate result to generate an enriched representation of the input text.
-
19. A non-transitory computer usable medium, encoded with a plurality of instructions-that, when executed by a computer, perform a method of supplementing an input text given in an incomplete language with missing information, the method comprising:
enriching said input text with the missing information using a first statistical method configured to operate on a first type of linguistic unit and a second statistical method configured to operate on a second type of linguistic unit, wherein the enriching comprises applying the first statistical method to the input text to generate an intermediate result, and after the first statistical method has been applied, applying the second statistical method to at least a portion of the intermediate result to generate an enriched representation of the input text. - View Dependent Claims (20)
Specification