Spoken translation system using meta information strings
First Claim
Patent Images
1. A system, comprising:
- a speech receiving part, receiving a segment of speech signal in a source language to be processed;
a computer part, operating to process the segment of speech signal comprising;
processing in a first information channel the segment of speech signal in the source language using a statistical machine translation training, comprising;
recognizing speech in the processed segment of speech signal in the source language,converting the recognized speech into text in the source language, andconverting the text in the source language into a lattice in a target language;
processing in a second information channel the segment of speech signal in the source language using an information transfer training, the second information channel independent and separate from the first information channel, the processing in the second information channel comprising;
extracting, from the segment of speech signal, meta information associated with the recognized speech, wherein the meta information includes at least one non-textual aspect of the recognized speech,obtaining descriptors in the source language from the meta information that includes at least one non-textual aspect, andtransforming the obtained descriptors in the source language into descriptors in the target language; and
an output part producing an output in the target language comprising combining the lattice in the target language and the obtained descriptors in the second language using lattice rescoring.
1 Assignment
0 Petitions
Accused Products
Abstract
Spoken translation system which detects both speech from the information and also detects meta information streams from the information. A first aspect produces an enriched training corpus of information for use in the machine translation. A second aspect uses two different extraction techniques, and combines them by lattice rescoring.
74 Citations
15 Claims
-
1. A system, comprising:
-
a speech receiving part, receiving a segment of speech signal in a source language to be processed; a computer part, operating to process the segment of speech signal comprising; processing in a first information channel the segment of speech signal in the source language using a statistical machine translation training, comprising; recognizing speech in the processed segment of speech signal in the source language, converting the recognized speech into text in the source language, and converting the text in the source language into a lattice in a target language; processing in a second information channel the segment of speech signal in the source language using an information transfer training, the second information channel independent and separate from the first information channel, the processing in the second information channel comprising; extracting, from the segment of speech signal, meta information associated with the recognized speech, wherein the meta information includes at least one non-textual aspect of the recognized speech, obtaining descriptors in the source language from the meta information that includes at least one non-textual aspect, and transforming the obtained descriptors in the source language into descriptors in the target language; and an output part producing an output in the target language comprising combining the lattice in the target language and the obtained descriptors in the second language using lattice rescoring. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer-implemented method, comprising:
-
processing in a first information channel, at a computer comprising a processor, a segment of speech signal in a source language using a statistical machine translation training, the processing in the first information channel comprising; recognizing speech in the processed segment of speech signal in the source language, converting the recognized speech into text in the source language, and converting the text in the source language into a lattice in a target language; processing, at the computer, the segment of speech signal in the source language using an information transfer training in a second information channel independent and separate from the first information channel, the processing in the second information channel comprising; extracting, from the segment of speech signal, meta information associated with the recognized speech, wherein the meta information includes at least one non-textual aspect of the recognized speech, obtaining descriptors in the source language from the meta information that includes at least one non-textual aspect, and transforming the obtained descriptors in the source language into descriptors in the target language; and generating an output in the target language comprising combining the lattice in the target language and the descriptors in the target language using a lattice rescoring system. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
Specification