Pronunciation accuracy in speech recognition
First Claim
1. A method for improving reading accuracy in speech recognition using processing by a computer, the method comprising computer-executed steps of:
- program instructions to obtain a plurality of candidate word strings from speech recognition results, wherein the speech recognition results contain a speech recognition score for each of the plurality of candidate work strings;
program instructions to determine a reading of each of the plurality of candidate word strings, wherein two or more candidate word strings have the same reading, and wherein the two or more candidate word strings having the same reading are homophones;
program instructions to determine a reading score for each candidate word string, wherein the reading score for each of the two or more candidate word strings with the same reading is based on a total value of the speech recognition scores for the two or more candidate word strings with the same reading, wherein determining the total value of the speech recognition scores for the two or more candidate word strings with the same reading includes computer-executed steps of;
program instructions to determine two or more candidate word strings with partial tolerable different readings allowing for a partial difference in readings between two or more candidate word strings treated as having the same reading, partial tolerable different readings being a first predetermined value, the partial difference in readings range being a predetermined range of values; and
program instructions to calculate the reading score total value of the speech recognition scores for the two or more candidate word strings with the same reading wherein the speech recognition scores for the two or more candidate word strings with partial tolerable different readings are to be treated as having the same reading, wherein calculating the reading score total value of the speech recognition scores for the two or more candidate word strings with the same reading includes;
program instructions to receive a conversion table, wherein the conversion table includes word strings, wherein the word strings includes word notations and phoneme strings, and wherein the conversion table contains word strings with partial tolerable different readings allowing for a partial difference in readings between two or more candidate word strings treated as having the same reading, and wherein determining the two or more candidate word strings with partial tolerable different readings allowing for a partial difference in readings between two or more candidate word strings treated as having the same reading is based on the conversion table;
program instructions to receive N-best lists from a plurality of speech recognition systems, wherein the N-best lists contain the two or more candidate word strings;
program instructions to determine a match of the two or more candidate word strings within the N-best list and the conversion table of; and
program instructions to convert the matched of the two or more candidate word strings according to the conversion table; and
program instructions to select a candidate among the plurality of candidate word strings to output on the basis of the reading score and the speech recognition score corresponding to each word string, wherein program instructions to select the candidate includes a computer system-executed step selected from the group consisting of;
program instructions to weight and add together the speech recognition score and the corresponding reading score for each candidate word string to obtain a new score, and program instructions to select the candidate word string with the highest new score; and
program instructions to select a candidate word string with the highest speech recognition score from among the one or more candidate word strings with the highest reading score.
1 Assignment
0 Petitions
Accused Products
Abstract
A reading accuracy-improving system includes: a reading conversion unit for retrieving a plurality of candidate word strings from speech recognition results to determine the reading of each candidate word string; a reading score calculating unit for determining the speech recognition score for each of one or more candidate word strings with the same reading to determine a reading score; and a candidate word string selection unit for selecting a candidate to output from the plurality of candidate word strings on the basis of the reading score and speech recognition score corresponding to each candidate word string.
16 Citations
12 Claims
-
1. A method for improving reading accuracy in speech recognition using processing by a computer, the method comprising computer-executed steps of:
-
program instructions to obtain a plurality of candidate word strings from speech recognition results, wherein the speech recognition results contain a speech recognition score for each of the plurality of candidate work strings; program instructions to determine a reading of each of the plurality of candidate word strings, wherein two or more candidate word strings have the same reading, and wherein the two or more candidate word strings having the same reading are homophones; program instructions to determine a reading score for each candidate word string, wherein the reading score for each of the two or more candidate word strings with the same reading is based on a total value of the speech recognition scores for the two or more candidate word strings with the same reading, wherein determining the total value of the speech recognition scores for the two or more candidate word strings with the same reading includes computer-executed steps of; program instructions to determine two or more candidate word strings with partial tolerable different readings allowing for a partial difference in readings between two or more candidate word strings treated as having the same reading, partial tolerable different readings being a first predetermined value, the partial difference in readings range being a predetermined range of values; and program instructions to calculate the reading score total value of the speech recognition scores for the two or more candidate word strings with the same reading wherein the speech recognition scores for the two or more candidate word strings with partial tolerable different readings are to be treated as having the same reading, wherein calculating the reading score total value of the speech recognition scores for the two or more candidate word strings with the same reading includes; program instructions to receive a conversion table, wherein the conversion table includes word strings, wherein the word strings includes word notations and phoneme strings, and wherein the conversion table contains word strings with partial tolerable different readings allowing for a partial difference in readings between two or more candidate word strings treated as having the same reading, and wherein determining the two or more candidate word strings with partial tolerable different readings allowing for a partial difference in readings between two or more candidate word strings treated as having the same reading is based on the conversion table; program instructions to receive N-best lists from a plurality of speech recognition systems, wherein the N-best lists contain the two or more candidate word strings; program instructions to determine a match of the two or more candidate word strings within the N-best list and the conversion table of; and program instructions to convert the matched of the two or more candidate word strings according to the conversion table; and program instructions to select a candidate among the plurality of candidate word strings to output on the basis of the reading score and the speech recognition score corresponding to each word string, wherein program instructions to select the candidate includes a computer system-executed step selected from the group consisting of; program instructions to weight and add together the speech recognition score and the corresponding reading score for each candidate word string to obtain a new score, and program instructions to select the candidate word string with the highest new score; and program instructions to select a candidate word string with the highest speech recognition score from among the one or more candidate word strings with the highest reading score. - View Dependent Claims (2, 3, 4)
-
-
5. A computer program product for improving reading accuracy in speech recognition, the computer program product comprising:
-
one or more non-transitory computer-readable storage media and program instructions stored on the one or more computer-readable storage media, the program instructions comprising; program instructions to obtain a plurality of candidate word strings from speech recognition results, wherein the speech recognition results contain a speech recognition score for each of the plurality of candidate work strings; program instructions to determine a reading of each of the plurality of candidate word strings, wherein two or more candidate word strings have the same reading, and wherein the two or more candidate word strings having the same reading are homophones; program instructions to determine a reading score for each candidate word string, wherein the reading score for each of the two or more candidate word strings with the same reading is based on a total value of the speech recognition scores for the two or more candidate word strings with the same reading, wherein determining the total value of the speech recognition scores for the two or more candidate word strings with the same reading includes computer-executed step steps of; program instructions to determine two or more candidate word strings with partial tolerable different readings allowing for a partial difference in readings between two or more candidate word strings treated as having the same reading, partial tolerable different readings being a first predetermined value, the partial difference in readings range being a predetermined range of values; and program instructions to calculate the reading score total value of the speech recognition scores for the two or more candidate word strings with the same reading wherein the speech recognition scores for the two or more candidate word strings with partial tolerable different readings are to be treated as having the same reading, wherein calculating the reading score total value of the speech recognition scores for the two or more candidate word strings with the same reading includes; program instructions to receive a conversion table, wherein the conversion table includes word strings, wherein the word strings includes word notations and phoneme strings, and wherein the conversion table contains word strings with partial tolerable different readings allowing for a partial difference in readings between two or more candidate word strings treated as having the same reading, and wherein determining the two or more candidate word strings with partial tolerable different readings allowing for a partial difference in readings between two or more candidate word strings treated as having the same reading is based on the conversion table; program instructions to receive N-best lists from a plurality of speech recognition systems, wherein the N-best lists contain the two or more candidate word strings; program instructions to determine a match of the two or more candidate word strings within the N-best list and the conversion table of; and program instructions to convert the matched of the two or more candidate word strings according to the conversion table; and program instructions to select a candidate among the plurality of candidate word strings to output on the basis of the reading score and the speech recognition score corresponding to each word string, wherein program instructions to select the candidate includes a computer program product-executed step selected from the group consisting of; program instructions to weight and add together the speech recognition score and the corresponding reading score for each candidate word string to obtain a new score, and program instructions to select the candidate word string with the highest new score; and program instructions to select a candidate word string with the highest speech recognition score from among the one or more candidate word strings with the highest reading score. - View Dependent Claims (6, 7, 8)
-
-
9. A computer system for improving reading accuracy in speech recognition, the computer system comprising:
-
one or more computer processors; one or more computer-readable storage media; program instructions stored on the computer-readable storage media for execution by at least one of the one or more processors, the program instructions comprising; program instructions to obtain a plurality of candidate word strings from speech recognition results, wherein the speech recognition results contain a speech recognition score for each of the plurality of candidate work strings; program instructions to determine a reading of each of the plurality of candidate word strings, wherein two or more candidate word strings have the same reading, and wherein the two or more candidate word strings having the same reading are homophones; program instructions to determine a reading score for each candidate word string, wherein the reading score for each of the two or more candidate word strings with the same reading is based on a total value of the speech recognition scores for the two or more candidate word strings with the same reading, wherein determining the total value of the speech recognition scores for the two or more candidate word strings with the same reading includes computer-executed step steps of; program instructions to determine two or more candidate word strings with partial tolerable different readings allowing for a partial difference in readings between two or more candidate word strings treated as having the same reading, partial tolerable different readings being a first predetermined value, the partial difference in readings range being a predetermined range of values; and program instructions to calculate the reading score total value of the speech recognition scores for the two or more candidate word strings with the same reading wherein the speech recognition scores for the two or more candidate word strings with partial tolerable different readings are to be treated as having the same reading, wherein calculating the reading score total value of the speech recognition scores for the two or more candidate word strings with the same reading includes; program instructions to receive a conversion table, wherein the conversion table includes word strings, wherein the word strings includes word notations and phoneme strings, and wherein the conversion table contains word strings with partial tolerable different readings allowing for a partial difference in readings between two or more candidate word strings treated as having the same reading, and wherein determining the two or more candidate word strings with partial tolerable different readings allowing for a partial difference in readings between two or more candidate word strings treated as having the same reading is based on the conversion table; program instructions to receive N-best lists from a plurality of speech recognition systems, wherein the N-best lists contain the two or more candidate word strings; program instructions to determine a match of the two or more candidate word strings within the N-best list and the conversion table of; and program instructions to convert the matched of the two or more candidate word strings according to the conversion table; and program instructions to select a candidate among the plurality of candidate word strings to output on the basis of the reading score and the speech recognition score corresponding to each word string, wherein program instructions to select the candidate includes a computer system-executed step selected from the group consisting of; program instructions to weight and add together the speech recognition score and the corresponding reading score for each candidate word string to obtain a new score, and program instructions to select the candidate word string with the highest new score; and program instructions to select a candidate word string with the highest speech recognition score from among the one or more candidate word strings with the highest reading score. - View Dependent Claims (10, 11, 12)
-
Specification