Select a recognition error by comparing the phonetic

US 6,735,565 B2
Filed: 09/13/2002
Issued: 05/11/2004
Est. Priority Date: 09/17/2001
Status: Expired due to Term

First Claim

Patent Images

1. A correction device (4) for correcting a text (ETI) recognized by a speech recognition device (2) for a spoken text (GTI), where the recognized text (ETI) for spoken words of the spoken text (GTI) includes correctly recognized words and incorrectly recognized words (FETI), the correction device comprising:

input means (13) for receiving at least one manually input correction word (KWI), in order to replace at least one of the incorrectly recognized words (FETI) with the at least one correction word (KWI);

transcription means (16) for phonetically transcribing at least the input correction word (KWT) into a phoneme sequence (PT(KWI));

search means (17) for finding the phoneme sequence (PT(KWI)) of the at least one correction word (KWT) in phoneme sequences (PT(KTI)) of the words of the recognized text based on an adjustable search area, wherein a step-wise expansion of the search area is performed when the search means does not find the phoneme sequence (PI(KWI)) of the at least one correction word (KWT) in phoneme sequences (PT(KTI)) of the words of the recognized text, and for issuing position information (PI) which identifies the position of at least one word within the recognized text (ETI) whose phoneme sequence essentially matches the phoneme sequence (PT(KWI)) of the at least one correction word (KWI); and

output means (17) for issuing said position information (PI) so as to enable a marking of the at least one word identified by the position information (PI) in the recognized text information (ETI).

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A correction device (4) for a speech recognition device (2) is provided, with which the replacement of incorrectly recognized words (FETI) of the recognized text (ETI) is especially simple to execute. The correction device (4) is based on the recognition that the phoneme sequences of incorrectly recognized words and the spoken words actually to be recognized are very similar, and automatically marks words in the recognized text (ETI) which show a phoneme sequence similar to that of a correction word (KWI) put in by the user.

Citations

13 Claims

1. A correction device (4) for correcting a text (ETI) recognized by a speech recognition device (2) for a spoken text (GTI), where the recognized text (ETI) for spoken words of the spoken text (GTI) includes correctly recognized words and incorrectly recognized words (FETI), the correction device comprising:
- input means (13) for receiving at least one manually input correction word (KWI), in order to replace at least one of the incorrectly recognized words (FETI) with the at least one correction word (KWI);
  
  transcription means (16) for phonetically transcribing at least the input correction word (KWT) into a phoneme sequence (PT(KWI));
  
  search means (17) for finding the phoneme sequence (PT(KWI)) of the at least one correction word (KWT) in phoneme sequences (PT(KTI)) of the words of the recognized text based on an adjustable search area, wherein a step-wise expansion of the search area is performed when the search means does not find the phoneme sequence (PI(KWI)) of the at least one correction word (KWT) in phoneme sequences (PT(KTI)) of the words of the recognized text, and for issuing position information (PI) which identifies the position of at least one word within the recognized text (ETI) whose phoneme sequence essentially matches the phoneme sequence (PT(KWI)) of the at least one correction word (KWI); and
  
  output means (17) for issuing said position information (PI) so as to enable a marking of the at least one word identified by the position information (PI) in the recognized text information (ETI).
- View Dependent Claims (2, 3, 4, 5, 6, 7)
- - 2. A correction device (4) as claimed in claim 1, wherein the correction device (4) is designed for interrupting a synchronous reproduction, in which the spoken words (GTI) of the spoken text are acoustically reproduced and the recognized words of the recognized text (ETI) for the spoken words (GTI) are synchronously optically marked, when a correction ward (KWI) is manually input.
  - 3. A correction device (4) as claimed in claim 2, wherein the correction device (4) is designed for terminating the interruption of the synchronous reproduction when the replacement of the at least one word identified by the position information (PI) with the at least one correction word (KWI) has been confirmed by manual input of a confirmation.
  - 4. A correction device (4) as claimed in claim 2, wherein the search means (17) are designed so as to search for the phoneme sequence (PI(KWI)) of the at least one correction word (KWI) in the phoneme sequences (PI(ETI)) of the words contained in a search area of the recognized text, said search area being defined by a number of M words before and N words behind the last marked word in the recognized text (ETI) before the interruption of the synchronous reproduction.
  - 5. A correction device (4) as claimed in claim 1, wherein the search means (17) are designed so as to search for the phoneme sequence (PI(KWI)) of the at least one correction word (KWI) in phoneme sequences determined by the speech recognition device (2) from the spoken words of the spoken text (GTI).
  - 6. A correction device (4) as claimed in claim 5, wherein die correction device (4) is designed so as to form part of the speech recognition device (2).
  - 7. A correction device (4) as claimed in claim 1, wherein the search means (17) are designed for ignoring phonemes that differ from the compared phoneme sequences but sound similar in the search for essentially matching phoneme sequences.

8. A correction method for correcting a text (GTI) recognized by a speech recognition device (2) for a spoken text, the recognized text (ETI) for spoken words of the spoken text (GTI) including correctly recognized words and incorrectly recognized words (FETI), the method comprising the following steps:
- receiving at least one manually entered correction word (KWI), so as to replace at least one of the incorrectly recognized words (FETI) with the at least one correction word (KWI);
  
  phonetically transcribing at least the input correction word (KWI) into a phoneme sequence (PT(KWI));
  
  searching for the phoneme sequence of the at least one correction word (KWI) in phoneme sequences (PI(ETI)) of the words of the recognized text (ETI) based on an adjustable search area, wherein a step-wise expansion of the search area is performed when the searching step does not find the phoneme sequence (PI(KWI)) of the at least one correction word (KWT) in the phoneme sequences (PI(KTI)) of the words of the recognized text, mid issuing position information (PT) which identifies the position of at least one word within the recognized text (ETI) whose phoneme sequence essentially matches the phoneme sequence of the at least one correction word (KWI); and
  
  issuing the position information (PT) so as to enable marking of the at least one word identified by the position information (PI) in the recognized text information (ETI).
- View Dependent Claims (9, 10, 11, 12, 13)
- - 9. A correction method as claimed in claim 8, wherein the following further process step is performed:
10. A correction method as claimed in claim 9, wherein the following further process step is performed:
- terminating the interruption of the synchronous reproduction when the replacement of the at least one word identified by the position information (PT) with the at least one correction word (KWI) has been confirmed by manual input of a confirmation.
11. A correction method as claimed in claim 9, wherein the following further process step is performed:
- searching for the phoneme sequence of the at least one correction word (KWI) in the phoneme sequences of the words contained iii a search area of the recognized text (ETI), the search area being defined by a number of M words before and N words behind the last marked word in the recognized text (ETI) before the interruption of the synchronous reproduction.
12. A correction method as claimed in claim 8, wherein the following further process step is performed:
- searching for the phoneme sequence (PI(KWI)) of the at least one correction word (KWI) in phoneme sequences determined by the speech recognition device (2) from the spoken words of the spoken text (GTI).
13. A correction method as claimed in claim 8, wherein the following further process step is performed:
- searching for essentially matching phoneme sequences, the phonemes that differ from the compared phoneme sequences but sound similar being ignored.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Nuance Communications Austria Gmbh (Microsoft Corporation)
Original Assignee
Koninklijke Philips Electronics N.V. (Koninklijke Philips N.V.)
Inventors
Gschwendtner, Wolfgang
Primary Examiner(s)
Dorvil, Richemond
Assistant Examiner(s)
Ham, Qi

Application Number

US10/242,930
Publication Number

US 20030061043A1
Time in Patent Office

606 Days
Field of Search

704/231, 704/251, 704/235, 704/254, 704/531, 704/260, 704/270
US Class Current

704/254
CPC Class Codes

G10L 15/08   Speech classification or se...

G10L 2015/025   Phonemes, fenemes or fenone...

G10L 2015/221   Announcement of recognition...

G10L 2015/225   Feedback of the input speech

Select a recognition error by comparing the phonetic

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

13 Claims

Specification

Solutions

Use Cases

Quick Links

Select a recognition error by comparing the phonetic

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

13 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links