PRONUNCIATION ACCURACY IN SPEECH RECOGNITION
First Claim
1. A method for improving reading accuracy in speech recognition using processing by a computer, the method comprising computer-executed steps of:
- obtaining a plurality of candidate word strings from speech recognition results;
determining a reading of each of the plurality of candidate word strings;
determining a total value of a speech recognition score for each of one or more candidate word strings with the same reading to determine a reading score; and
selecting a candidate among the plurality of candidate word strings to output on the basis of the reading score and the speech recognition score corresponding to each word string.
1 Assignment
0 Petitions
Accused Products
Abstract
A reading accuracy-improving system includes: a reading conversion unit for retrieving a plurality of candidate word strings from speech recognition results to determine the reading of each candidate word string; a reading score calculating unit for determining the speech recognition score for each of one or more candidate word strings with the same reading to determine a reading score; and a candidate word string selection unit for selecting a candidate to output from the plurality of candidate word strings on the basis of the reading score and speech recognition score corresponding to each candidate word string.
-
Citations
18 Claims
-
1. A method for improving reading accuracy in speech recognition using processing by a computer, the method comprising computer-executed steps of:
-
obtaining a plurality of candidate word strings from speech recognition results; determining a reading of each of the plurality of candidate word strings; determining a total value of a speech recognition score for each of one or more candidate word strings with the same reading to determine a reading score; and selecting a candidate among the plurality of candidate word strings to output on the basis of the reading score and the speech recognition score corresponding to each word string. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A reading accuracy-improving computer system, comprising:
-
a computer having a processor and a computer-readable storage device; a program embodied on the storage device for execution by the processor, the program having a plurality of program modules, the program modules including; an obtaining module configured to obtain a plurality of candidate word strings from speech recognition results; a first determining module configured to determine a reading of each of the plurality of candidate word strings; a second determining module configured to determine a total value of a speech recognition score for each of one or more candidate word strings with the same reading to determine a reading score; and a selecting module configured to select a candidate among the plurality of candidate word strings to output on the basis of the reading score and the speech recognition score corresponding to each word string. - View Dependent Claims (8, 9, 10, 11, 12)
-
-
13. A reading-accuracy improving non-transitory computer program product, comprising a computer-readable storage medium having program code embodied therewith, the program code executable by a processor of a computer to perform a method comprising:
-
obtaining, by the processor, a plurality of candidate word strings from speech recognition results; determining, by the processor, a reading of each of the plurality of candidate word strings; determining, by the processor, a total value of a speech recognition score for each of one or more candidate word strings with the same reading to determine a reading score; and selecting, by the processor, a candidate among the plurality of candidate word strings to output on the basis of the reading score and the speech recognition score corresponding to each word string. - View Dependent Claims (14, 15, 16, 17)
-
-
18. The non-transitory computer program product of claim 18, wherein selecting a candidate includes selecting, by the processor, all of a plurality of candidate word strings as candidates to be outputted, and rescoring the candidate word strings, by the processor, on the basis of the speech recognition score of each candidate word string and the corresponding reading score.
Specification