Spoken Translation System Using Meta Information Strings
First Claim
Patent Images
1. A method, comprising:
- processing a segment of speech to be recognized to recognize speech therein, and also to recognize meta information associated with the recognized speech, wherein the meta information includes at least one non-textual aspect of the recognized speech; and
producing an output that represents both the text recognized by said processing, and the at least one non-textual aspect.
1 Assignment
0 Petitions
Accused Products
Abstract
Spoken translation system which detects both speech from the information and also detects meta information streams from the information. A first aspect produces an enriched training corpus of information for use in the machine translation. A second aspect uses two different extraction techniques, and combines them by lattice rescoring.
63 Citations
23 Claims
-
1. A method, comprising:
-
processing a segment of speech to be recognized to recognize speech therein, and also to recognize meta information associated with the recognized speech, wherein the meta information includes at least one non-textual aspect of the recognized speech; and
producing an output that represents both the text recognized by said processing, and the at least one non-textual aspect. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system, comprising:
-
a speech receiving part, receiving speech to be processed; and
a computer part, operating to process a segment of speech to be recognized and to recognize speech therein, and also to recognize meta information associated with the recognized speech, wherein the meta information includes at least one non-textual aspect of the recognized speech, and producing an output indicative of the recognized speech and the meta information; and
an output part, receiving said output from said computer part, and producing an output represents both the text recognized by said processing, and the at least one non-textual aspect. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A method, comprising:
-
processing an audio version indicative of a segment of speech to be recognized, to recognize speech therein, and also to recognize additional information associated with the recognized speech, wherein the additional information includes at least one of keywords, prominence information, and/or emotional information; and
producing an output that represents both the text recognized by said processing, and the additional information. - View Dependent Claims (22, 23)
-
Specification