INTERACTIVE SPEECH RECOGNITION
First Claim
1. A computer program product tangibly embodied on a computer-readable storage medium and including executable code that causes at least one data processing apparatus to:
- obtain audio data associated with a first utterance;
obtain, via a device processor, a text result associated with a first speech-to-text translation of the first utterance based on an audio signal analysis associated with the audio data, the text result including a plurality of selectable text alternatives corresponding to at least one word;
initiate a display of at least a portion of the text result that includes a first one of the text alternatives; and
receive a selection indication indicating a second one of the text alternatives.
2 Assignments
0 Petitions
Accused Products
Abstract
A first plurality of audio features associated with a first utterance may be obtained. A first text result associated with a first speech-to-text translation of the first utterance may be obtained based on an audio signal analysis associated with the audio features, the first text result including at least one first word. A first set of audio features correlated with at least a first portion of the first speech-to-text translation associated with the at least one first word may be obtained. A display of at least a portion of the first text result that includes the at least one first word may be initiated. A selection indication may be received, indicating an error in the first speech-to-text translation, the error associated with the at least one first word.
21 Citations
20 Claims
-
1. A computer program product tangibly embodied on a computer-readable storage medium and including executable code that causes at least one data processing apparatus to:
-
obtain audio data associated with a first utterance; obtain, via a device processor, a text result associated with a first speech-to-text translation of the first utterance based on an audio signal analysis associated with the audio data, the text result including a plurality of selectable text alternatives corresponding to at least one word; initiate a display of at least a portion of the text result that includes a first one of the text alternatives; and receive a selection indication indicating a second one of the text alternatives. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method comprising:
-
obtaining a first plurality of audio features associated with a first utterance; obtaining, via a device processor, a first text result associated with a first speech-to-text translation of the first utterance based on an audio signal analysis associated with the audio features, the first text result including at least one first word; obtaining a first set of audio features correlated with at least a first portion of the first speech-to-text translation associated with the at least one first word; initiating a display of at least a portion of the first text result that includes the at least one first word; and receiving a selection indication indicating an error in the first speech-to-text translation, the error associated with the at least one first word. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. A system comprising:
-
an input acquisition component that obtains a first plurality of audio features associated with a first utterance; a speech-to-text component that obtains, via a device processor, a first text result associated with a first speech-to-text translation of the first utterance based on an audio signal analysis associated with the audio features, the first text result including at least one first word; a clip correlation component that obtains a first correlated portion of the first plurality of audio features associated with the first speech-to-text translation to the at least one first word; a result delivery component that initiates an output of the first text result and the first correlated portion of the first plurality of audio features; and a correction request acquisition component that obtains a correction request that includes an indication that the at least one first word is a first speech-to-text translation error, and the first correlated portion of the first plurality of audio features. Docket No. 333249.01 - View Dependent Claims (17, 18, 19, 20)
-
Specification