VISUALIZING AUTOMATIC SPEECH RECOGNITION AND MACHINE
First Claim
1. An automated speech processing method comprising:
- using a speech-to-text (STT) engine for receiving an audio input and for converting the audio input to text data in a source language;
using a machine translation (MT) engine for receiving the text data from the STT engine and for translating the text data to text data in a target language;
using a caption engine for rendering the text data in the target language on a display device; and
applying different visualization schemes to different parts of the rendered text data based on defined characteristics of the STT engine and the MT engine.
3 Assignments
0 Petitions
Accused Products
Abstract
An automated speech processing method, system and computer program product are disclosed. In one embodiment, a speech-to-text (STT) engine is used for converting an audio input to text data in a source language, and a machine translation (MT) engine is used for translating this text data to text data in a target language. In this embodiment, the text data in the target language is rendered on a display device, and different visualization schemes are applied to different parts of the rendered text data based on defined characteristics of the STT engine and the MT engine. In one embodiment, the defined characteristics include a defined confidence value representing the accuracy of the rendered text. For example, this confidence value may be based on both the accuracy of the conversion of the audio input and the accuracy of the translation of the text data to the target language.
30 Citations
20 Claims
-
1. An automated speech processing method comprising:
-
using a speech-to-text (STT) engine for receiving an audio input and for converting the audio input to text data in a source language; using a machine translation (MT) engine for receiving the text data from the STT engine and for translating the text data to text data in a target language; using a caption engine for rendering the text data in the target language on a display device; and applying different visualization schemes to different parts of the rendered text data based on defined characteristics of the STT engine and the MT engine. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. An automated speech processing system comprising:
-
a speech-to-text (STT) engine for receiving an audio input and for converting the audio input to text data in a source language; a machine translation (MT) engine for receiving the text data from the STT engine and for translating the text data to text data in a target language; a caption engine for rendering the text data in the target language on a display device, and for applying different visualization schemes to different parts of the rendered text data based on defined characteristics of the STT engine and the MT engine. - View Dependent Claims (12, 13, 14, 15)
-
-
16. An article of manufacture comprising:
-
at least one tangible computer readable medium having computer readable program code logic to execute machine instructions in one or more processing units for processing speech, said computer readable program code logic, when executing, performing the following; receiving an audio input at a speech-to-text (STT) engine and converting the audio input to text data in a source language; translating the text data, using a machine translation (MT) engine, to text data in a target language; rendering the text data in the target language on a display device; and applying different visualization schemes to different parts of the rendered text data based on defined characteristics of the STT engine and the MT engine. - View Dependent Claims (17, 18, 19, 20)
-
Specification