Method for compressing dictionary data
First Claim
1. A method for pre-processing a pronunciation dictionary for compression in a data processing device, the pronunciation dictionary comprising at least one entry, the entry comprising a sequence of character units and a sequence of phoneme units,the method comprising:
- aligning said sequence of character units and said sequence of phoneme units using a statistical algorithm so that the alignment between said character units and said phoneme units is determined; and
interleaving said aligned sequence of character units and said aligned sequence of phoneme units by inserting each phoneme unit at a predetermined location relative to the corresponding character unit.
1 Assignment
0 Petitions
Accused Products
Abstract
The invention relates to pre-processing of a pronunciation dictionary for compression in a data processing device, the pronunciation dictionary comprising at least one entry, the entry comprising a sequence of character units and a sequence of phoneme units. According to one aspect of the invention the sequence of character units and the sequence of phoneme units are aligned using a statistical algorithm. The aligned sequence of character units and aligned sequence of phoneme units are interleaved by inserting each phoneme unit at a predetermined location relative to the corresponding character unit.
189 Citations
15 Claims
-
1. A method for pre-processing a pronunciation dictionary for compression in a data processing device, the pronunciation dictionary comprising at least one entry, the entry comprising a sequence of character units and a sequence of phoneme units,
the method comprising: -
aligning said sequence of character units and said sequence of phoneme units using a statistical algorithm so that the alignment between said character units and said phoneme units is determined; and interleaving said aligned sequence of character units and said aligned sequence of phoneme units by inserting each phoneme unit at a predetermined location relative to the corresponding character unit. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer program product loadable into the memory of a data processing device, comprising a code which is executable in the data processing device causing the data processing device to:
-
retrieve from the memory a pronunciation dictionary comprising at least one entry, the entry comprising a sequence of character units and a sequence of phoneme units; align said sequence of character units and said sequence of phoneme units using a statistical algorithm; and interleave said aligned sequence of character units and said aligned sequence of phoneme units by inserting each phoneme unit at a predetermined location relative to the corresponding character unit.
-
-
9. A data processing device comprising memory for storing a pronunciation dictionary comprising at least one entry, the entry comprising a sequence of character units and a sequence of phoneme units, wherein
the data processing device is configured to retrieve from the memory a pronunciation dictionary comprising at least one entry; -
the data processing device is configured to align said sequence of character units and said sequence of phoneme units using a statistical algorithm; and the data processing device is configured to interleave said aligned sequence of character units and said aligned sequence of phoneme units by inserting each phoneme unit at a predetermined location relative to the corresponding character unit. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
Specification