Automated transformation from American English to British English
First Claim
1. A method of generating a British English phonemic transcription from a word and an American English phonemic transcription of the word using a computer system:
- providing the word to a first computer program, the first computer program transforming the spelling of the word according to British English spelling conventions;
providing the word to a second computer program, the second computer program for lexically transforming the word to a semantically equivalent word used in British English and marking the word when no semantic equivalent is found and when the semantic equivalent cannot be identified; and
applying phoneme conversion rules with a third computer program to transform the American English phonemic transcription to the British English phonemic transcription.
3 Assignments
0 Petitions
Accused Products
Abstract
A method of transforming a voice application program designed for US English speakers to a voice application program for UK English speakers using a computer system is described. In one embodiment, scripts and grammars associated with the voice application program are converted from US-to-UK English. The process includes spelling normalization, lexical normalization, and pronunciation conversion (including where appropriate accounting for stress shifts). The result is necessary word pronunciations for speech recognition of UK English speaker (especially for proper nouns) as well as a script that has been conformed to use UK English spelling and lexical conventions. Additionally, the script can be annotated with pronunciations as a part of the process. Further, in one embodiment a web based interface to the conversion process is provided either standalone or as part of a voice application development environment.
189 Citations
22 Claims
-
1. A method of generating a British English phonemic transcription from a word and an American English phonemic transcription of the word using a computer system:
-
providing the word to a first computer program, the first computer program transforming the spelling of the word according to British English spelling conventions;
providing the word to a second computer program, the second computer program for lexically transforming the word to a semantically equivalent word used in British English and marking the word when no semantic equivalent is found and when the semantic equivalent cannot be identified; and
applying phoneme conversion rules with a third computer program to transform the American English phonemic transcription to the British English phonemic transcription. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 15, 16, 18, 19, 20, 21, 22)
-
-
13. An apparatus for transforming a voice application prepared for American English speakers for use by British English speakers, the apparatus comprising:
-
a first means for analyzing the voice application to identify one or more scripts and one or more grammars, the one or more scripts corresponding to textual material for presentation by one of a text-to-speech system and a human voice talent, the one or more grammars corresponding to descriptions of words that the voice application must be capable of responding to in one or more states;
a second means for automatically performing spelling and lexical normalization of the one or more scripts and the one or more grammars from American English to British English, the normalization producing one or more predetermined markings, the markings being indicative of words likely to require manual review;
a third means for permitting manual review of the normalized scripts and grammars; and
a fourth means for performing phonemic conversions for words in the scripts and the grammars from American English pronunciations to British English pronunciations
-
-
17. A method of extending the CPA to indicate word stress in a phonemic transcription represented in the computer phonetic alphabet (CPA), the method comprising:
-
selecting at least two lower ASCII characters;
selecting a first of the two characters to indicate primary word stress and a second of the two characters to indicate secondary word stress; and
representing word stress to computer programs in CPA phonemic transcriptions by placing the two characters in front of the CPA symbol for the start of the stressed portion of the word.
-
Specification