×

Pronunciation accuracy in speech recognition

  • US 9,384,730 B2
  • Filed: 04/14/2014
  • Issued: 07/05/2016
  • Est. Priority Date: 05/30/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method for improving reading accuracy in speech recognition using processing by a computer, the method comprising computer-executed steps of:

  • obtaining a plurality of candidate word strings from speech recognition results, wherein the speech recognition results contain a speech recognition score for each of the plurality of candidate word strings;

    determining a reading of each of the plurality of candidate word strings, wherein two or more candidate word strings have the same reading, and wherein the two or more candidate word strings having the same reading are homophones;

    determining a reading score for each candidate word string, wherein the reading score for each of the two or more candidate word strings with the same reading is based on a total value of the speech recognition scores for the two or more candidate word strings with the same reading, and wherein determining the total value of the speech recognition scores for the two or more candidate word strings with the same reading includes computer-executed steps of;

    determining two or more candidate word strings with partial tolerable different readings to be treated as having the same reading; and

    calculating the total value of the speech recognition scores for the two or more candidate word strings with the same reading includes speech recognition scores for the two or more candidate word strings with partial tolerable different readings to be treated as having the same reading; and

    providing a conversion table containing word strings with partial tolerable different readings to be treated as having the same reading, andwherein determining the two or more candidate word strings with partial tolerable different readings to be treated as having the same reading is based on the conversion table; and

    selecting a candidate among the plurality of candidate word strings to output on the basis of the reading score and the speech recognition score corresponding to each word string, wherein selecting the candidate includes a computer-executed step selected from the group consisting of(a) weighting and adding together the speech recognition score and the corresponding reading score for each candidate word string to obtain a new score, and selecting the candidate word string with the highest new score; and

    (b) selecting a candidate word string with the highest speech recognition score from among the one or more candidate word strings with the highest reading score.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×