Translation and capture architecture for output of conversational utterances
First Claim
1. A system that facilitates data translation, comprising:
- a processor; and
memory coupled to the processor;
an input component stored in the memory and executable on the processor that processes input data received from a plurality of sources that represent context and content, wherein the input component processes the input data to determine which linguistic language is represented by the context and the content, andextracts text from the input data to generate query terms and employs the query terms with a search engine to determine a first linguistic language;
wherein the context is established based on at least one of video, image data or scanned document indicia, and the determined linguistic language comprises the first linguistic language;
a translation component stored in the memory and executable on the processor that translates the processed input data into a translated output in a second linguistic language which includes at least one of text or audible signals for perception by a recipient, the first linguistic language being different from the second linguistic language; and
a feedback component stored in the memory and executable on the processor that receives, in response to the translated output, feedback in the second linguistic language from the recipient relating to accuracy of the translation, wherein the feedback is employed as additional input data to provide a new translated output in the second linguistic language, and wherein the feedback is employed as additional input data for establishing the context.
2 Assignments
0 Petitions
Accused Products
Abstract
Architecture that combines capture and translation of concepts, goals, needs, locations, objects, locations, and items (e.g., sign text) into complete conversational utterances that take a translation of the item, and morph it with fluidity into sets of sentences that can be echoed to a user, and that the user can select to communicate speech (or textual utterances). A plurality of modalities that process images, audio, video, searches and cultural context, for example, which are representative of at least context and/or content, and can be employed to glean additional information regarding a communications exchange to facilitate more accurate and efficient translation. Gesture recognition can be utilized to enhance input recognition, urgency, and/or emotional interaction, for example. Speech can be used for document annotation. Moreover, translation (e.g., speech to speech, text to speech, speech to text, handwriting to speech, text or audio, . . . ) can be significantly improved in combination with this architecture.
-
Citations
17 Claims
-
1. A system that facilitates data translation, comprising:
-
a processor; and memory coupled to the processor; an input component stored in the memory and executable on the processor that processes input data received from a plurality of sources that represent context and content, wherein the input component processes the input data to determine which linguistic language is represented by the context and the content, and extracts text from the input data to generate query terms and employs the query terms with a search engine to determine a first linguistic language; wherein the context is established based on at least one of video, image data or scanned document indicia, and the determined linguistic language comprises the first linguistic language; a translation component stored in the memory and executable on the processor that translates the processed input data into a translated output in a second linguistic language which includes at least one of text or audible signals for perception by a recipient, the first linguistic language being different from the second linguistic language; and a feedback component stored in the memory and executable on the processor that receives, in response to the translated output, feedback in the second linguistic language from the recipient relating to accuracy of the translation, wherein the feedback is employed as additional input data to provide a new translated output in the second linguistic language, and wherein the feedback is employed as additional input data for establishing the context. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A computer-implemented method of translating data, comprising:
-
receiving one or more input data from one or more sensing sources, wherein the one or more sensing sources comprise at least one of audio, video, global positioning, or image sensing sources; generating context data of at least one of the one or more input data, the generating context data including; extracting text from at least one of the one or more input data to generate query terms and employing the query terms with a search engine to determine a first linguistic language; translating one or more results from the search engine into a translated output in a second linguistic language; presenting the translated output to a recipient in the second linguistic language that is understandable by the recipient; receiving a user feedback in the second linguistic language from the recipient, wherein the user feedback includes an indication that the translation is successful or unsuccessful; establishing the context of the at least one of the one or more input data based on the user feedback; and employing the established context as an additional input for translating the content. - View Dependent Claims (11, 12, 13, 14, 15)
-
-
16. A method comprising:
-
under control of one or more processors configured with executable instructions; receiving one or more inputs from one or more sensing sources, the one or more inputs comprising image data and/or video data, the image data and/or video comprise a gesture; determining context of at least one of the one or more inputs;
the determining comprising using the gesture to identify or narrow information to determine the context;extracting text from the one or more inputs to generate query terms and employing the query terms with a search engine to determine a first linguistic language; translating an input of the one or more inputs into a translated output in a second linguistic language based at least upon the context of the at least one of the one or more inputs; receiving a user feedback of the translated output; and producing a new translated output based at least upon the user feedback. - View Dependent Claims (17)
-
Specification