Automatic Text Correction
First Claim
1. A method of generating text transformation rules (210, 212, 214) for an automatic text correction by making use of at least one erroneous training text (204) and a corresponding correct reference text (200), the method comprising the steps of:
- comparing the at least one erroneous training text with the correct reference text, deriving a set of text transformation rules (210, 212, 214) by making use of deviations between the training text and the reference text, the deviations being detected by means of the comparison, evaluating the set of text transformation rules by applying each transformation rule to the training text, selecting of at least one of the set of evaluated text transformation rules for the automatic text correction.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention provides a method of generating text transformation rules for speech to text transcription systems. The text transformation rules are generated by means of comparing an erroneous text generated by a speech to text transcription system with a correct reference text. Comparison of erroneous and reference text allows to derive a set of text transformation rules that are evaluated by means of a strict application to the training text and successive comparison with the reference text. Evaluation of text transformation rules provides a sufficient approach to determine which of the automatically generated text transformation rules provide an enhancement or degradation of the erroneous text. In this way only those text transformation rules of the set of text transformation rules are selected that guarantee an enhancement of the erroneous text. In this way systematic errors of an automatic speech recognition or natural language process system can be effectively compensated.
212 Citations
14 Claims
-
1. A method of generating text transformation rules (210, 212, 214) for an automatic text correction by making use of at least one erroneous training text (204) and a corresponding correct reference text (200), the method comprising the steps of:
-
comparing the at least one erroneous training text with the correct reference text, deriving a set of text transformation rules (210, 212, 214) by making use of deviations between the training text and the reference text, the deviations being detected by means of the comparison, evaluating the set of text transformation rules by applying each transformation rule to the training text, selecting of at least one of the set of evaluated text transformation rules for the automatic text correction. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A text correction system (404) making use of text transformation rules (210, 212, 214) for correcting erroneous text, the text correction system being adapted to generate the text transformation rules by making use of at least one erroneous training text (204) and a corresponding correct reference text (200), the text correction system comprising:
-
means for comparing the at least one erroneous training text with the correct reference text, means for deriving a set of text transformation rules by making use of deviations between the training text and the reference text, the deviations being detected by means of the comparison, means for evaluating the set of text transformation rules by applying each transformation rule to the training text, means for selecting of at least one of the set of evaluated text transformation rules for the text correction system.
-
-
13. A computer program product for generating text transformation rules for a text correction system (404), the computer program product being adapted to process at least one erroneous training text (204) and a corresponding correct reference text (200), the computer program product comprising program means being operable to:
-
compare the at least one erroneous training text with the correct reference text, derive a set of text transformation rules (210, 212, 214) by making use of deviations between the training text and the reference text, the deviations being detected by means of the comparison, evaluate the set of text transformation rules by applying each transformation rule to the training text, select at least one of the set of evaluated text transformation rules for the text correction system.
-
-
14. A speech to text transformation system for transcribing speech into text, the speech to text transformation system having a text correction module (404) making use of text transformation rules (210, 212, 214) for correcting errors of the text and having a rule generation module (414) for generating the text transformation rules by making use of at least one erroneous training text being generated by the speech to text transformation system and a corresponding correct reference text, the speech to text transformation system comprising:
-
a storage module (408) for storing the reference and the training text, a comparator module (412) for comparing the at least one erroneous training text with the correct reference text, a transformation rule generator (414) for deriving a set of text transformation rules, the transformation rule generator being adapted to make use of deviations between the training text and the reference text, the deviation being detected by means of the processing module, an evaluator (410) being adapted to evaluate the set of text transformation rules by applying each transformation rule to the training text, a selection module (420) for selecting of at least one of the set of evaluated text transformation rules for the text correction module.
-
Specification