System and method for processing out of vocabulary compound words
First Claim
Patent Images
1. A computer-implemented method for out-of-vocabulary compound word handling comprising:
- storing a plurality of compound word rules and compound word dictionaries in a database;
evaluating membership criteria associated with a received compound word, wherein membership criteria includes at least one of dictionary based or part of speech (POS) based criteria;
applying one or more filtering rules to the received compound word;
identifying the received compound word as an out-of-vocabulary compound word based upon, at least in part, the one or more filtering rules, wherein the one or more filtering rules includes determining whether one of a first word and a last word of the received compound word is one of the first word and the last word in the compound word dictionaries;
predicting a compound prominence pattern associated with the received compound word based upon, at least in part, identifying the received compound word as the out-of-vocabulary compound word, wherein the compound prominence pattern includes a default compound prominence when a no confidence situation exists for predicting the compound prominence pattern associated with the received compound word in the out-of-vocabulary compound word handling for text-to-speech synthesis, wherein the default compound prominence includes one of right side prominence and left side prominence for a portion of the compound word; and
generating an output representative of the received compound word based upon, at least in part, the default compound prominence.
7 Assignments
0 Petitions
Accused Products
Abstract
A system and method for out-of-vocabulary compound word handling is provided. Embodiments may include storing a plurality of compound word rules and compound word dictionaries in a database. Embodiments may also include evaluating membership criteria associated with a received compound word, wherein membership criteria includes at least one of dictionary based or part of speech (POS) based criteria. Embodiments may further include applying one or more filtering rules to the received compound word.
-
Citations
20 Claims
-
1. A computer-implemented method for out-of-vocabulary compound word handling comprising:
-
storing a plurality of compound word rules and compound word dictionaries in a database; evaluating membership criteria associated with a received compound word, wherein membership criteria includes at least one of dictionary based or part of speech (POS) based criteria; applying one or more filtering rules to the received compound word; identifying the received compound word as an out-of-vocabulary compound word based upon, at least in part, the one or more filtering rules, wherein the one or more filtering rules includes determining whether one of a first word and a last word of the received compound word is one of the first word and the last word in the compound word dictionaries; predicting a compound prominence pattern associated with the received compound word based upon, at least in part, identifying the received compound word as the out-of-vocabulary compound word, wherein the compound prominence pattern includes a default compound prominence when a no confidence situation exists for predicting the compound prominence pattern associated with the received compound word in the out-of-vocabulary compound word handling for text-to-speech synthesis, wherein the default compound prominence includes one of right side prominence and left side prominence for a portion of the compound word; and generating an output representative of the received compound word based upon, at least in part, the default compound prominence. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A non-transitory computer-readable storage medium having stored thereon instructions, which when executed by a processor result in one or more operations for out-of-vocabulary compound word handling, the operations comprising:
-
storing a plurality of compound word rules and compound word dictionaries in a database; evaluating membership criteria associated with a received compound word, wherein membership criteria includes at least one of dictionary based or part of speech (POS) based criteria; applying one or more filtering rules to the received compound word; identifying the received compound word as an out-of-vocabulary compound word based upon, at least in part, the one or more filtering rules, wherein the one or more filtering rules includes determining whether one of a first word and a last word of the received compound word is one of the first word and the last word in the compound word dictionaries; predicting a compound prominence pattern associated with the received compound word based upon, at least in part, identifying the received compound word as the out-of-vocabulary compound word, wherein the compound prominence pattern includes a default compound prominence when a no confidence situation exists for predicting the compound prominence pattern associated with the received compound word in the out-of-vocabulary compound word handling for text-to-speech synthesis, wherein the default compound prominence includes one of right side prominence and left side prominence for a portion of the compound word; and generating an output representative of the received compound word based upon, at least in part, the default compound prominence. - View Dependent Claims (12, 13, 14, 15, 16, 17, 19, 20)
-
-
18. A system configured to perform out-of-vocabulary compound word handling comprising:
one or more processors configured to allow for storing a plurality of compound word rules and compound word dictionaries in a database, the one or more processors further configured to evaluate membership criteria associated with a received compound word, wherein membership criteria includes at least one of dictionary based or part of speech (POS) based criteria, the one or more processors further configured to identify the received compound word as an out-of-vocabulary compound word based upon, at least in part, the one or more filtering rules, wherein the one or more filtering rules determining whether one of a first word and a last word of the received compound word is one of the first word and the last word in the compound word dictionaries, the one or more processors further configured to apply one or more filtering rules to the received compound word, the one or more processors further configured to predict a compound prominence pattern associated with the received compound word based upon, at least in part, identifying the received compound word as the out-of-vocabulary compound word, wherein the compound prominence pattern includes a default compound prominence when a no confidence situation exists for predicting the compound prominence pattern associated with the received compound word in the out-of-vocabulary compound word handling for text-to-speech synthesis, wherein the default compound prominence includes one of right side prominence and left side prominence for a portion of the compound word, the one or more processors further configured to generate an output representative of the received compound word based upon, at least in part, the default compound prominence.
Specification