Abbreviation and acronym/initialism expansion procedures for a text to speech reader
First Claim
1. A computer system for converting a text message into audio signals, including:
- a text to speech converter that translates text into audio signals;
a text reader control procedure for receiving said text message, said text message having a sequence of words;
a classifier procedure, coupled to said text reader control procedure, that classifies specified words in said text message with regard to syntax, punctuation and membership in predefined classes of words, said predefined classes of words including numbers, measurement units, geographic designations, and date/time values;
said classifier procedure generating classification values for said words in said text message; and
an abbreviation expanding procedure that receives said classification values and said text message, identifies abbreviation words in said text message that match predefined abbreviations and identifying a set of potential expansion values for each said identified abbreviation word, evaluates based on said classification values for words neighboring said identified words and in accordance with a predefined set of rules which, if any, of said potential expansion values is applicable, and substituting said potential expansion value for said identified abbreviation word when said evaluation yields a success value;
wherein said text reader control procedure includes instructions for receiving expansion values from said abbreviation expanding procedure, and passing to said text to speech converter for each word in said text message said expansion value, if any, generated by said abbreviation expanding procedure for said word.
6 Assignments
0 Petitions
Accused Products
Abstract
An improved text-to-speech synthesizer that employs a text to speech converter, a text reader control procedure, a classifier procedure, an abbreviation expansion procedure, and an acronym/initialism expanding procedure is herein described. A classifier procedure is used to classify generate classification values for each word in the text message with regard to syntax, punctuation and membership in predefined classes of words, the predefined classes of words including number, measurement units, geographic designations, and date/time values. An abbreviation expansion procedure evaluates, based on the classification values for words neighboring the identified words, which, if any, of the potential expansion values is applicable, and substitutes the potential expansion for the identified abbreviation word when evaluation yields a success value. An acronym/initialism expanding procedure identifies words in the text message that are acronyms and initialisms, parses pronounceable syllables within the identified words and generates a substitute string that can consist of any combination of letters, numbers, pronounceable syllables or multiple letter identifiers.
-
Citations
13 Claims
-
1. A computer system for converting a text message into audio signals, including:
-
a text to speech converter that translates text into audio signals; a text reader control procedure for receiving said text message, said text message having a sequence of words; a classifier procedure, coupled to said text reader control procedure, that classifies specified words in said text message with regard to syntax, punctuation and membership in predefined classes of words, said predefined classes of words including numbers, measurement units, geographic designations, and date/time values;
said classifier procedure generating classification values for said words in said text message; andan abbreviation expanding procedure that receives said classification values and said text message, identifies abbreviation words in said text message that match predefined abbreviations and identifying a set of potential expansion values for each said identified abbreviation word, evaluates based on said classification values for words neighboring said identified words and in accordance with a predefined set of rules which, if any, of said potential expansion values is applicable, and substituting said potential expansion value for said identified abbreviation word when said evaluation yields a success value; wherein said text reader control procedure includes instructions for receiving expansion values from said abbreviation expanding procedure, and passing to said text to speech converter for each word in said text message said expansion value, if any, generated by said abbreviation expanding procedure for said word. - View Dependent Claims (5)
-
-
2. A computer system for converting a text message into audio signals, including:
-
a text to speech converter that translates text into audio signals; a text reader control procedure for receiving said text message, said text message having a sequence of words; an acronym/initialism expanding procedure, coupled to said text reader control procedure, that identifies words in said text message that are acronyms and initialisms, parsing pronounceable syllables within said identified words and generating a substitute string comprising a sequence of units, each unit selected from the set consisting of a letter, number, pronounceable syllable and multiple letter identifier;
said acronym/initialism expanding procedure including instructions for separating pronounceable syllables and individual letters not pronounceable as syllables from an identified word in said text message in accordance with predefined word parsing criteria;wherein said text reader control procedure includes instructions for receiving said substitute strings from said acronym/initialism expanding procedure, and passing to said text to speech converter for each word in said text message said substitute string, if any, generated by said acronym/initialism expanding procedure.
-
-
3. A computer system for converting a text message into audio signals, including:
-
a text to speech converter that translates text into audio signals; a text reader control procedure for receiving said text message, said text message having a sequence of words; a classifier procedure, coupled to said text reader control procedure, that classifies specified words in said text message with regard to syntax, punctuation and membership in predefined classes of words, said predefined classes of words including numbers, measurement units, geographic designations, and date/time values;
said classifier procedure generating classification values for said words in said text message;an abbreviation expanding procedure that receives said classification values and said text message, identifies abbreviation words in said text message that match predefined abbreviations and identifying a set of potential expansion values for each said identified abbreviation word, evaluates based on said classification values for words neighboring said identified words which, if any, of said potential expansion values is applicable, and substituting said potential expansion value for said identified abbreviation word when said evaluation yields a success value; and an acronym/initialism expanding procedure, coupled to said classifier procedure, that identifies words in said text message that are acronyms and initialisms, parsing pronounceable syllables within said identified words and generating a substitute string comprising a sequence of units, each unit selected from the set consisting of a letter, number, pronounceable syllable and multiple letter identifier;
said acronym/initialism expanding procedure including instructions for separating pronounceable syllables and individual letters not pronounceable as syllables from an identified word in said text message in accordance with predefined word parsing criteria;wherein said text reader control procedure includes instructions for receiving expansion values from said abbreviation expanding procedure and said substitute strings from said acronym/initialism expanding procedure, and passing to said text to speech converter for each word in said text message (A) said expansion value, if any, generated by said abbreviation expanding procedure for said word, (B) said substitute string, if any, generated by said acronym/initialism expanding procedure, and (C) said word otherwise. - View Dependent Claims (4)
-
-
6. A memory for storing data for access by programs being executed on a data processing system, said memory comprising:
-
a text to speech converter that translates text into audio signals; a text reader control procedure for receiving said text message, said text message having a sequence of words; a classifier procedure, coupled to said text reader control procedure, that classifies specified words in said text message with regard to syntax, punctuation and membership in predefined classes of words, said predefined classes of words including numbers, measurement units, geographic designations, and date/time values;
said classifier procedure generating classification values for said words in said text message; andan abbreviation expanding procedure that receives said classification values and said text message, identifies abbreviation words in said text message that match predefined abbreviations and identifying a set of potential expansion values for each said identified abbreviation word, evaluates based on said classification values for words neighboring said identified words and in accordance with a predefined set of rules which, if any, of said potential expansion values is applicable, and substituting said potential expansion value for said identified abbreviation word when said evaluation yields a success value; wherein said text reader control procedure includes instructions for receiving expansion values from said abbreviation expanding procedure, and passing to said text to speech converter for each word in said text message said expansion value, if any, generated by said abbreviation expanding procedure for said word. - View Dependent Claims (7, 8)
-
-
9. A memory for storing data for access by programs being executed on a data processing system, said memory comprising:
-
a text to speech converter that translates text into audio signals; a text reader control procedure for receiving said text message, said text message having a sequence of words; an acronym/initialism expanding procedure, coupled to said text reader control procedure, that identifies words in said text message that are acronyms and initialisms, parsing pronounceable syllables within said identified words and generating a substitute string comprising a sequence of units, each unit selected from the set consisting of a letter, number, pronounceable syllable and multiple letter identifier;
said acronym/initialism expanding procedure including instructions for separating pronounceable syllables and individual letters not pronounceable as syllables from an identified word in said text message in accordance with predefined word parsing criteria;wherein said text reader control procedure includes instructions for receiving said substitute strings from said acronym/initialism expanding procedure, and passing to said text to speech converter for each word in said text message said substitute string, if any, generated by said acronym/initialism expanding procedure.
-
-
10. A method of converting a text message into audio signals, comprising the steps of:
-
receiving said text message, said text message having a sequence of words; classifying specified words in said text message with regard to syntax, punctuation and membership in predefined classes of words, said predefined classes of words including numbers, measurement units, geographic designations, and date/time values;
said classifying step generating classification values for said words in said text message; andidentifying abbreviation words in said text message that match predefined abbreviations and identifying a set of potential expansion values for each said identified abbreviation word; evaluating based on said classification values for words neighboring said identified words and in accordance with a predefined set of rules which, if any, of said potential expansion values is applicable, and substituting in said received text message said potential expansion value for said identified abbreviation word when said evaluation yields a success value; and translating said received text message with said substituted expansion values, if any, into audio signals. - View Dependent Claims (11, 12)
-
-
13. A method of converting a text message into audio signals, comprising the steps of:
-
receiving said text message, said text message having a sequence of words; identifying words in said text message that are acronyms and initialisms, parsing pronounceable syllables within said identified words and generating a substitute string comprising a sequence of units, each unit selected from the set consisting of a letter, number, pronounceable syllable and multiple letter identifier;
said substitute string generating step including separating pronounceable syllables and individual letters not pronounceable as syllables from an identified word in said text message in accordance with predefined word parsing criteria;substituting in said received text message for each word in said text, said substitute string, if any, generated by said acronyms and initialisms identifying, parsing and generating steps; and translating said received text message with said substitute strings, if any, into audio signals.
-
Specification