Speech and language translation of an utterance
First Claim
1. A method comprising:
- performing, by computer processing hardware, operations of;
receiving an utterance spoken in a first language;
partitioning a spoken sentence in the utterance into multiple segments, a given segment of the multiple segments including multiple words spoken in the first language;
converting the given segment of the multiple segments into multiple candidate textual phrases in a second language, further comprising;
performing a speech-to-text translation of the given segment into a set of candidate textual expressions in the first language by translating the given segment into at least a first candidate textual expression and a second candidate textual expression in the first language; and
wherein performing the language translation includes;
identifying that the first candidate textual expression translates into a first candidate textual phrase and a second candidate textual phrase; and
identifying that the second candidate textual expression translates into a third candidate textual phrase and a fourth candidate textual phrase, the first candidate textual phrase being identical to the third candidate textual phrase; and
for each respective candidate textual expression in the set;
performing a language translation of the respective candidate textual expression into multiple candidate textual phrases in the second language;
producing a confidence metric for each respective candidate textual phrase of the multiple candidate textual phrases in the second language, the confidence metric indicating a confidence that the respective candidate textual phrase is an accurate translation of the given segment of the utterance into the second language;
producing a confidence value for each of the candidate textual expressions in the first language;
producing a confidence value for each of the candidate textual phrases in the second language; and
generating a confidence metric for the first candidate textual phrase based on a sum of a first term and a second term, the first term being a product of a confidence value for the first candidate textual expression multiplied by a confidence value for the first candidate textual phrase, the second term being a product of a confidence value for the second candidate textual expression multiplied by a confidence value for the third candidate textual phrase.
2 Assignments
0 Petitions
Accused Products
Abstract
According to example configurations, a speech-processing system parses an uttered sentence into segments. The speech-processing system translates each of the segments in the uttered sentence into candidate textual expressions (i.e., phrases of one or more words) in a first language. The uttered sentence can include multiple phrases or candidate textual expressions. Additionally, the speech-processing system translates each of the candidate textual expressions into candidate textual phrases in a second language. Based at least in part on a product of confidence values associated with the candidate textual expressions in the first language and confidence values associated with the candidate textual phrases in the second language, the speech-processing system produces a confidence metric for each of the candidate textual phrases in the second language. The confidence metric can indicate degree to which the candidate textual phrase in the second language is an accurate translation of a respective segment in the utterance.
-
Citations
8 Claims
-
1. A method comprising:
performing, by computer processing hardware, operations of; receiving an utterance spoken in a first language; partitioning a spoken sentence in the utterance into multiple segments, a given segment of the multiple segments including multiple words spoken in the first language; converting the given segment of the multiple segments into multiple candidate textual phrases in a second language, further comprising; performing a speech-to-text translation of the given segment into a set of candidate textual expressions in the first language by translating the given segment into at least a first candidate textual expression and a second candidate textual expression in the first language; and wherein performing the language translation includes; identifying that the first candidate textual expression translates into a first candidate textual phrase and a second candidate textual phrase; and identifying that the second candidate textual expression translates into a third candidate textual phrase and a fourth candidate textual phrase, the first candidate textual phrase being identical to the third candidate textual phrase; and for each respective candidate textual expression in the set; performing a language translation of the respective candidate textual expression into multiple candidate textual phrases in the second language; producing a confidence metric for each respective candidate textual phrase of the multiple candidate textual phrases in the second language, the confidence metric indicating a confidence that the respective candidate textual phrase is an accurate translation of the given segment of the utterance into the second language; producing a confidence value for each of the candidate textual expressions in the first language; producing a confidence value for each of the candidate textual phrases in the second language; and generating a confidence metric for the first candidate textual phrase based on a sum of a first term and a second term, the first term being a product of a confidence value for the first candidate textual expression multiplied by a confidence value for the first candidate textual phrase, the second term being a product of a confidence value for the second candidate textual expression multiplied by a confidence value for the third candidate textual phrase. - View Dependent Claims (2, 3, 4, 5, 6)
-
7. A method comprising:
-
performing, by computer processing hardware, operations of; parsing an uttered sentence into segments; translating each of the segments into candidate textual expressions in a first language; translating each of the candidate textual expressions into candidate textual phrases in a second language; and producing, based at least in part on a product of confidence values associated with the candidate textual expressions in the first language and confidence values associated with the candidate textual phrases in the second language, a confidence metric for each of the candidate textual phrases in the second language, producing the confidence metric including; executing separate translation paths in which a given segment of the utterance translates into a common candidate textual phrase in the second language, the separate translation paths including a first translation path and a second translation path; the first translation path including;
a translation of the given segment of the utterance into a first candidate textual expression in the first language and a subsequent translation of the first candidate textual expression in the first language to the common candidate textual phrase in the second language; andthe second translation path including;
a translation of the given segment of the utterance into a second candidate textual expression in the first language and a subsequent translation of the second candidate textual expression in the first language to the common candidate textual phrase in the second language. - View Dependent Claims (8)
-
Specification