Select a recognition error by comparing the phonetic
First Claim
1. A correction device (4) for correcting a text (ETI) recognized by a speech recognition device (2) for a spoken text (GTI), where the recognized text (ETI) for spoken words of the spoken text (GTI) includes correctly recognized words and incorrectly recognized words (FETI), the correction device comprising:
- input means (13) for receiving at least one manually input correction word (KWI), in order to replace at least one of the incorrectly recognized words (FETI) with the at least one correction word (KWI);
transcription means (16) for phonetically transcribing at least the input correction word (KWT) into a phoneme sequence (PT(KWI));
search means (17) for finding the phoneme sequence (PT(KWI)) of the at least one correction word (KWT) in phoneme sequences (PT(KTI)) of the words of the recognized text based on an adjustable search area, wherein a step-wise expansion of the search area is performed when the search means does not find the phoneme sequence (PI(KWI)) of the at least one correction word (KWT) in phoneme sequences (PT(KTI)) of the words of the recognized text, and for issuing position information (PI) which identifies the position of at least one word within the recognized text (ETI) whose phoneme sequence essentially matches the phoneme sequence (PT(KWI)) of the at least one correction word (KWI); and
output means (17) for issuing said position information (PI) so as to enable a marking of the at least one word identified by the position information (PI) in the recognized text information (ETI).
2 Assignments
0 Petitions
Accused Products
Abstract
A correction device (4) for a speech recognition device (2) is provided, with which the replacement of incorrectly recognized words (FETI) of the recognized text (ETI) is especially simple to execute. The correction device (4) is based on the recognition that the phoneme sequences of incorrectly recognized words and the spoken words actually to be recognized are very similar, and automatically marks words in the recognized text (ETI) which show a phoneme sequence similar to that of a correction word (KWI) put in by the user.
-
Citations
13 Claims
-
1. A correction device (4) for correcting a text (ETI) recognized by a speech recognition device (2) for a spoken text (GTI), where the recognized text (ETI) for spoken words of the spoken text (GTI) includes correctly recognized words and incorrectly recognized words (FETI), the correction device comprising:
-
input means (13) for receiving at least one manually input correction word (KWI), in order to replace at least one of the incorrectly recognized words (FETI) with the at least one correction word (KWI);
transcription means (16) for phonetically transcribing at least the input correction word (KWT) into a phoneme sequence (PT(KWI));
search means (17) for finding the phoneme sequence (PT(KWI)) of the at least one correction word (KWT) in phoneme sequences (PT(KTI)) of the words of the recognized text based on an adjustable search area, wherein a step-wise expansion of the search area is performed when the search means does not find the phoneme sequence (PI(KWI)) of the at least one correction word (KWT) in phoneme sequences (PT(KTI)) of the words of the recognized text, and for issuing position information (PI) which identifies the position of at least one word within the recognized text (ETI) whose phoneme sequence essentially matches the phoneme sequence (PT(KWI)) of the at least one correction word (KWI); and
output means (17) for issuing said position information (PI) so as to enable a marking of the at least one word identified by the position information (PI) in the recognized text information (ETI). - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A correction method for correcting a text (GTI) recognized by a speech recognition device (2) for a spoken text, the recognized text (ETI) for spoken words of the spoken text (GTI) including correctly recognized words and incorrectly recognized words (FETI), the method comprising the following steps:
-
receiving at least one manually entered correction word (KWI), so as to replace at least one of the incorrectly recognized words (FETI) with the at least one correction word (KWI);
phonetically transcribing at least the input correction word (KWI) into a phoneme sequence (PT(KWI));
searching for the phoneme sequence of the at least one correction word (KWI) in phoneme sequences (PI(ETI)) of the words of the recognized text (ETI) based on an adjustable search area, wherein a step-wise expansion of the search area is performed when the searching step does not find the phoneme sequence (PI(KWI)) of the at least one correction word (KWT) in the phoneme sequences (PI(KTI)) of the words of the recognized text, mid issuing position information (PT) which identifies the position of at least one word within the recognized text (ETI) whose phoneme sequence essentially matches the phoneme sequence of the at least one correction word (KWI); and
issuing the position information (PT) so as to enable marking of the at least one word identified by the position information (PI) in the recognized text information (ETI). - View Dependent Claims (9, 10, 11, 12, 13)
interrupting a synchronous reproduction, in which the spoken words of the spoken text (GTI) are acoustically reproduced and the recognized words of the recognized text (ETI) are synchronously optically marked for the spoken words (GTI), when a correction word (KWI) is manually put in.
-
-
10. A correction method as claimed in claim 9, wherein the following further process step is performed:
terminating the interruption of the synchronous reproduction when the replacement of the at least one word identified by the position information (PT) with the at least one correction word (KWI) has been confirmed by manual input of a confirmation.
-
11. A correction method as claimed in claim 9, wherein the following further process step is performed:
searching for the phoneme sequence of the at least one correction word (KWI) in the phoneme sequences of the words contained iii a search area of the recognized text (ETI), the search area being defined by a number of M words before and N words behind the last marked word in the recognized text (ETI) before the interruption of the synchronous reproduction.
-
12. A correction method as claimed in claim 8, wherein the following further process step is performed:
searching for the phoneme sequence (PI(KWI)) of the at least one correction word (KWI) in phoneme sequences determined by the speech recognition device (2) from the spoken words of the spoken text (GTI).
-
13. A correction method as claimed in claim 8, wherein the following further process step is performed:
searching for essentially matching phoneme sequences, the phonemes that differ from the compared phoneme sequences but sound similar being ignored.
Specification