Audio renderings for expressing non-audio nuances
First Claim
1. A method of enhancing audio renderings of non-audio data sources, comprising:
- detecting a nuance of a non-audio data source;
locating an audio cue corresponding to the detected nuance; and
associating the located audio cue with the detected nuance for playback to a listener, wherein detecting a nuance of a non-audio data source detects a plurality of nuances of the non-audio data source, locating an audio cue locates audio cues for each of the detected nuances, and associating the located audio cue with the detected nuance for playback to a listener associates each of the located audio cues with the respective detected nuance, and further comprising;
creating an audio rendering of the non-audio data source; and
mixing the associated audio cues in with the audio rendering to generate integrated sounds therefrom to the listener.
8 Assignments
0 Petitions
Accused Products
Abstract
Methods, systems, computer program products, and methods of doing business by adapting audio renderings of non-audio messages (for example, e-mail messages that are processed by a text-to-speech translator) to reflect various nuances of the non-audio information. Audio cues are provided for this purpose, which are sounds that are “mixed” in with the audio rendering as a separate (background) audio stream. Audio cues may reflect information such as the topical structure of a text file, or changes in paragraphs. Or, audio cues may be used to signal nuances such as changes in the color or font of the source text. Audio cues may also be advantageously used to reflect information about the translation process with which the audio rendering of a text file was created, such as using varying background tones to convey the degree of certainty in the accuracy of translating text to audio using a text-to-speech translation system, or of translating audio to text using a voice recognition system, or of translating between languages, and so forth. Stylesheets, such as those encoded in the Extensible Stylesheet Language (“XSL”), may optionally be used to customize the audio cues. For example, a user-specific stylesheet customization may be performed to override system-wide default audio cues for a particular user, enabling her to hear a different background sound for messages on a particular topic than other users will hear.
-
Citations
55 Claims
-
1. A method of enhancing audio renderings of non-audio data sources, comprising:
-
detecting a nuance of a non-audio data source; locating an audio cue corresponding to the detected nuance; and associating the located audio cue with the detected nuance for playback to a listener, wherein detecting a nuance of a non-audio data source detects a plurality of nuances of the non-audio data source, locating an audio cue locates audio cues for each of the detected nuances, and associating the located audio cue with the detected nuance for playback to a listener associates each of the located audio cues with the respective detected nuance, and further comprising; creating an audio rendering of the non-audio data source; and mixing the associated audio cues in with the audio rendering to generate integrated sounds therefrom to the listener. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A system for enhancing audio renderings of non-audio data sources, comprising:
-
means for detecting one or more nuances of a non-audio data source; means for locating an audio cue corresponding to each of the detected nuances; means for associating the located audio cues with their respective detected nuances for playback to a listener; means for creating an audio rendering of the non-audio data source, wherein the non-audio segment is associated with the nuance; and means for mixing the associated audio cues in with the audio rendering to generate integrated sounds therefrom to the listener. - View Dependent Claims (22, 23, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37)
-
-
38. A computer program product for enhancing audio renderings of non-audio data sources, the computer program product embodied on one or more computer-readable media and comprising:
-
computer-readable program code that is configured to detect one or more nuances of a non-audio data source; computer-readable program code that is configured to locate an audio cue corresponding to each of the detected nuances; computer-readable program code that is configured to associate the located audio cues with their respective detected nuances for playback to a listener; computer-readable program code that is configured to create an audio rendering of a non-audio segment of the non-audio data source, wherein the non-audio segment is associated with the nuance; and computer-readable program code that is configured to mix the associated audio cue with the audio rendering of the segment to generate integrated sounds therefrom to the listener. - View Dependent Claims (24, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55)
-
Specification