METHOD AND APPARATUS FOR GENERATING MULTILINGUAL TRANSCRIPTION GROUPS
First Claim
1. ) A computer readable storage medium comprising a data structure containing a speech recognition dictionary suitable for use in a speech recognizer, said speech recognition dictionary comprising a set of vocabulary items, the set of vocabulary items including a sub-set of vocabulary items, each vocabulary item in the sub-set being associated to a group of transcriptions belonging to languages selected from a pool of available languages, each vocabulary item in said sub-set of vocabulary items having characteristics allowing to divide the pool of available languages in a first sub-group and a second sub-group, the vocabulary item manifesting a higher probability of belonging to any one of the languages in the first sub-group than belonging to any language in the second sub-group, the group of transcriptions associated with each vocabulary item from the sub-set of vocabulary items being characterized by the absence of at least one transcription belonging to a language in the second sub-group of languages established for the vocabulary item.
10 Assignments
0 Petitions
Accused Products
Abstract
The invention relates to a method and apparatus for generating transcriptions suitable for use in a speech-processing device. The invention provides processing the vocabulary item to derive a characteristic from the vocabulary item allowing to divide a pool of available languages in a first sub-group and a second sub-group. The vocabulary item manifests a higher probability of belonging to any one of the languages in the first sub-group than belonging to any language in the second sub-group. The invention further provides processing the vocabulary item to generate a group of transcriptions, the group of transcriptions being characterized by the absence of at least one transcription belonging to a language in the second sub-group of languages established for the vocabulary item. The group of transcriptions is then released for use by a speech-processing device.
17 Citations
20 Claims
- 1. ) A computer readable storage medium comprising a data structure containing a speech recognition dictionary suitable for use in a speech recognizer, said speech recognition dictionary comprising a set of vocabulary items, the set of vocabulary items including a sub-set of vocabulary items, each vocabulary item in the sub-set being associated to a group of transcriptions belonging to languages selected from a pool of available languages, each vocabulary item in said sub-set of vocabulary items having characteristics allowing to divide the pool of available languages in a first sub-group and a second sub-group, the vocabulary item manifesting a higher probability of belonging to any one of the languages in the first sub-group than belonging to any language in the second sub-group, the group of transcriptions associated with each vocabulary item from the sub-set of vocabulary items being characterized by the absence of at least one transcription belonging to a language in the second sub-group of languages established for the vocabulary item.
-
6. ) A method for generating a group of transcriptions suitable for use in a speech processing device, said method comprising:
-
providing a vocabulary item;
processing the vocabulary item to derive a characteristic from said vocabulary item allowing to divide a pool of available languages in a first sub-group and a second sub-group, the vocabulary item manifesting a higher probability of belonging to any one of the languages in the first sub-group than belonging to any language in the second sub-group;
processing the vocabulary item to generate a group of transcriptions, the group of transcriptions being characterized by the absence of at least one transcription belonging to a language in the second sub-group of languages established for the vocabulary item;
storing the group of transcriptions in a format suitable for use by a speech processing device. - View Dependent Claims (7, 8, 9, 10, 11, 13, 14, 15, 16, 17, 18)
-
-
12. ) An apparatus for generating a group of transcriptions suitable for use in a speech processing device, said apparatus comprising:
-
an input for receiving signal conveying data representative of a vocabulary item;
a processing unit coupled to said input, said processing unit being operative for;
(a) processing the vocabulary item to derive a characteristic from said vocabulary item allowing to divide a pool of available languages in a first sub-group and a second sub-group, the vocabulary item manifesting a higher probability of belonging to any one of the languages in the first sub-group than belonging to any language in the second sub-group;
(b) processing the vocabulary item to generate a group of transcriptions, the group of transcriptions being characterized by the absence of at least one transcription belonging to a language in the second sub-group of languages established for the vocabulary item;
an output coupled to said processing unit for releasing a signal representative of the group of transcriptions.
-
-
19. ) A computer readable storage medium comprising a program element suitable for execution by a computing apparatus for generating a group of transcriptions, said computing apparatus comprising:
-
a memory unit for storing an electronic representation of a vocabulary item;
a processor operatively connected to said memory unit, said program element when executing on said processor being operative for;
(a) processing the vocabulary item to derive a characteristic from said vocabulary item allowing to divide a pool of available languages in a first sub-group and a second sub-group, the vocabulary item manifesting a higher probability of belonging to any one of the languages in the first sub-group than belonging to any language in the second sub-group;
(b) processing the vocabulary item to generate a group of transcriptions, the group of transcriptions being characterized by the absence of at least one transcription belonging to a language in the second sub-group of languages established for the vocabulary item;
(c) releasing an electronic representation of the group of transcriptions in a format suitable for use by a speech processing device.
-
-
20. ) An apparatus for generating a group of transcriptions suitable for use in a speech processing device, said apparatus comprising:
-
means for receiving data elements representative of a vocabulary item;
means for processing the vocabulary item to derive a characteristic from said vocabulary item allowing to divide a pool of available languages in a first sub-group and a second sub-group, the vocabulary item manifesting a higher probability of belonging to any one of the languages in the first sub-group than belonging to any language in the second sub-group;
means for processing the vocabulary item to generate a group of transcriptions, the group of transcriptions being characterized by the absence of at least one transcription belonging to a language in the second sub-group of languages established for the vocabulary item;
means for releasing the group of transcriptions in a format suitable for use by a speech processing device.
-
Specification