Improving speech recognition through text-based linguistic post-processing
First Claim
1. A method for improving speech recognition, comprising the steps of:
- collecting text data generated from a speech recognition system;
collecting a corresponding true transcription of the speech recognition text data;
aligning the text data generated from the speech recognition system with the corresponding true transcription of text data, wherein the aligning is text-based;
generating a plurality of correction rules from differences in alignment between the text data generated from the speech recognition system and the corresponding true transcription of text data; and
applying the plurality of correction rules to new text data generated from a speech recognition system.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention discloses a method and system for improving speech recognition. In this invention, there is a training phase where text data generated from a speech recognition system is collected and aligned with a corresponding true transcription of the speech recognition text data. A preliminary set of correction rules are generated and observed against a corpus of fully verified text data. Rules that are applicable are validated, while invalid rules are updated. The updated rules are then applied to the parallel sample of speech recognition text data and corresponding text data, as well as the corpus of text data. The rules are examined again to determine their validity. This process continues until all of the rules have been validated or until no further progress is made. The finalized correction rules are then put into a production phase.
93 Citations
18 Claims
-
1. A method for improving speech recognition, comprising the steps of:
-
collecting text data generated from a speech recognition system; collecting a corresponding true transcription of the speech recognition text data; aligning the text data generated from the speech recognition system with the corresponding true transcription of text data, wherein the aligning is text-based; generating a plurality of correction rules from differences in alignment between the text data generated from the speech recognition system and the corresponding true transcription of text data; and applying the plurality of correction rules to new text data generated from a speech recognition system. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for improving speech recognition, comprising:
-
a text aligner for aligning text data generated from a speech recognition system with a corresponding true transcription of the speech recognition text data, wherein the aligning is text-based; a rule generator coupled to the text aligner for generating a plurality of correction rules from differences in alignment between the speech recognition text data and the corresponding true transcription of text data; and a rule administrator for applying the plurality of correction rules to new text data generated from a speech recognition system. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification