SYSTEM AND METHOD FOR TEXT NORMALIZATION IN NOISY CHANNELS
First Claim
1. A method for text normalization in a plurality of noisy channels, performed on a computing device having a processor, memory, and one or more code sets stored in the memory and executing in the processor, the method comprising:
- receiving, by the processor, a text entry and channel origin data of the text entry;
determining, by the processor, whether the text entry matches an in-vocabulary (IV) entry or whether the text entry is an out-of-vocabulary (OOV) entry;
if the text entry is determined to have a matching IV entry;
outputting, by the processor, the matching IV entry; and
if the text entry is determined to be an OOV entry;
implementing, by the processor, a channel-specific error-type adapter framework based on the channel origin data;
wherein the channel-specific error-type adapter framework is optimized for a specific channel from which the text entry originated;
normalizing, by the processor, the text entry using the channel-specific error-type adapter framework; and
outputting one or more candidate normalized forms of the text entry.
3 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for text normalization in a plurality of noisy channels receive a text entry and channel origin data of the text entry; determine whether the text entry matches an in-vocabulary (IV) entry or whether the text entry is an out-of-vocabulary (OOV) entry; if the text entry is determined to have a matching IV entry, output the matching IV entry, and if the text entry is determined to be an OOV entry, implement a channel-specific error-type adapter framework based on the channel origin data, wherein the channel-specific error-type adapter framework is optimized for a specific channel from which the text entry originated; normalize the text entry using the channel-specific error-type adapter framework; and output one or more candidate normalized forms of the text entry.
24 Citations
20 Claims
-
1. A method for text normalization in a plurality of noisy channels, performed on a computing device having a processor, memory, and one or more code sets stored in the memory and executing in the processor, the method comprising:
-
receiving, by the processor, a text entry and channel origin data of the text entry; determining, by the processor, whether the text entry matches an in-vocabulary (IV) entry or whether the text entry is an out-of-vocabulary (OOV) entry; if the text entry is determined to have a matching IV entry; outputting, by the processor, the matching IV entry; and if the text entry is determined to be an OOV entry; implementing, by the processor, a channel-specific error-type adapter framework based on the channel origin data; wherein the channel-specific error-type adapter framework is optimized for a specific channel from which the text entry originated; normalizing, by the processor, the text entry using the channel-specific error-type adapter framework; and outputting one or more candidate normalized forms of the text entry. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for text normalization in a plurality of noisy channels, comprising:
-
a processor; a memory; and one or more code sets stored in the memory and executing in the processor, which, when executed, configure the processor to; receive a text entry and channel origin data of the text entry; determine whether the text entry matches an in-vocabulary (IV) entry or whether the text entry is an out-of-vocabulary (OOV) entry; if the text entry is determined to have a matching IV entry; output the matching IV entry; and if the text entry is determined to be an OOV entry; implement a channel-specific error-type adapter framework based on the channel origin data; wherein the channel-specific error-type adapter framework is optimized for a specific channel from which the text entry originated; normalize the text entry using the channel-specific error-type adapter framework; and output one or more candidate normalized forms of the text entry. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A method for text normalization in a plurality of noisy channels, performed on a computing device having a processor, memory, and one or more code sets stored in the memory and executing in the processor, the method comprising:
-
receiving, by the processor, a text entry and channel origin data of the text entry; determining, by the processor, whether the text entry matches an in-vocabulary (IV) entry or whether the text entry is an out-of-vocabulary (OOV) entry; wherein the matching IV entry is outputted when the text entry is determined to have a matching IV entry, wherein a channel-specific error-type adapter framework is implemented based on the channel origin data when the text entry is determined to be an OOV entry, and wherein the channel-specific error-type adapter framework is optimized for a specific channel from which the text entry originated; normalizing, by the processor, the text entry using the channel-specific error-type adapter framework; and outputting one or more candidate normalized forms of the text entry. - View Dependent Claims (20)
-
Specification