Text oriented, user-friendly editing of a voicemail message
First Claim
1. A computer-implemented method for a unified messaging system (UMS) running on one or more network nodes comprising:
- creating a Multi-Layered Voicemail (MLVM) message in a UMS mailbox of a recipient, the MLVM message being left by a caller from an endpoint, the MLVM message including an audio layer comprising audio data corresponding to speech of the caller, a text layer comprising text data generated from a translation of the audio data, and a mapping layer that maps each word in the text layer to a corresponding segment of audio data in the audio layer;
receiving input from a graphical user interface (GUI) running on the endpoint, the input comprising edits made by the caller via the GUI to one or more words of the text layer;
changing one or more segments of audio data in the audio layer in correspondence with the edits to the one or more words of the text layer, wherein the changing comprises;
in response to receiving the edits to the one or more words of the text layer;
automatically searching one or more databases for a user'"'"'s previous recording of one or more sound patterns that correspond to the edits to the one or more words of the text layer;
generating one or more new segments of audio data based on the one or more sound patterns and the edits to the one or more words of the text layer; and
inserting the one or more new segments of audio data at a position in the audio layer corresponding to a location where the edits to the one or more words of the text layer were added to the text layer;
storing the MLVM message with the changed one or more segments of audio data in the UMS mailbox of the recipient.
1 Assignment
0 Petitions
Accused Products
Abstract
A system in one embodiment includes a server associated with a unified messaging system (UMS). The server records speech of a user as an audio data file, translates the audio data file into a text data file, and maps each word within the text data file to a corresponding segment of audio data in the audio data file. A graphical user interface (GUI) of a message editor running on an endpoint associated with the user displays the text data file on the endpoint and allows the user to identify a portion of the text data file for replacement. The server being further operable to record new speech of the user as new audio data and to replace one or more segments of the audio data file corresponding to the portion of the text with the new audio data.
-
Citations
7 Claims
-
1. A computer-implemented method for a unified messaging system (UMS) running on one or more network nodes comprising:
-
creating a Multi-Layered Voicemail (MLVM) message in a UMS mailbox of a recipient, the MLVM message being left by a caller from an endpoint, the MLVM message including an audio layer comprising audio data corresponding to speech of the caller, a text layer comprising text data generated from a translation of the audio data, and a mapping layer that maps each word in the text layer to a corresponding segment of audio data in the audio layer; receiving input from a graphical user interface (GUI) running on the endpoint, the input comprising edits made by the caller via the GUI to one or more words of the text layer; changing one or more segments of audio data in the audio layer in correspondence with the edits to the one or more words of the text layer, wherein the changing comprises; in response to receiving the edits to the one or more words of the text layer; automatically searching one or more databases for a user'"'"'s previous recording of one or more sound patterns that correspond to the edits to the one or more words of the text layer; generating one or more new segments of audio data based on the one or more sound patterns and the edits to the one or more words of the text layer; and inserting the one or more new segments of audio data at a position in the audio layer corresponding to a location where the edits to the one or more words of the text layer were added to the text layer; storing the MLVM message with the changed one or more segments of audio data in the UMS mailbox of the recipient.
-
-
2. A non-transitory computer-readable storage medium encoded with a computer program, when executed, the computer program operable to:
-
record speech of a caller as an audio data file, the speech comprising a voicemail message for a recipient; translating the speech into a text file for display on a graphical user interface (GUI) running on an endpoint associated with the caller; map each word of the text file to a corresponding segment of audio data of the recorded speech; receive input from the GUI that identifies a location within the text file and adds additional text to the text file at the location; in response to receiving the input; automatically search one or more databases for a user'"'"'s previous recording of one or more sound patterns that correspond to the additional text; generate one or more new segments of audio data based on the one or more sound patterns and the additional text; and insert the one or more new segments of audio data at a position in the audio data file corresponding to the location where the additional text was added to the text file.
-
-
3. A system comprising:
one or more network nodes running one or more application programs that implement a unified messaging system (UMS), the one of the nodes including; one or more processors; and a memory comprising one or more instructions executable at the processors, the one or more processors being operable, when executing the instructions, to; create a Multi-Layered Voicemail (MLVM) message from speech left by a caller associated with an endpoint, the MLVM message including an audio layer comprising audio data corresponding to the speech, a text layer comprising text data generated from a translation of the audio data, and a mapping layer that maps each word in the text layer to a corresponding segment of audio data in the audio layer; receive input from a graphical user interface (GUI) running on the endpoint, the input comprising one or more edits to the text layer; in response to receiving the input; automatically search one or more databases for a user'"'"'s previous recording of one or more sound patterns that correspond to additional text included in the input; generate one or more new segments of audio data based on the one or more sound patterns and the additional text; and insert the one or more new segments of audio data at a position in the audio layer corresponding to a location where the additional text was added to the text data file; and store the MLVM message with the one or more segments of audio data in the UMS mailbox of the recipient.
-
4. A computer-implemented method for a unified messaging system (UMS) comprising:
-
recording, by the UMS, speech of a user as an audio data file, the speech comprising a voicemail message intended for a recipient; translating, by the UMS, the audio data file into a text data file; mapping, by the UMS, each word within the text data file to a corresponding segment of audio data in the audio data file; editing one or more segments of the audio data file to produce an edited voicemail message, the editing being based on input commands received from the user, the input commands including operations on one or more words in the text data file which correspond to the one or more segments in the audio data file, the editing comprising replacing a segment in the audio data file corresponding to a selected word in the text data file with a new segment which comprises new speech of the user; and sending the edited voicemail message to the recipient; and wherein the editing comprises; receiving typed input from the user which adds additional text to the text data file; and in response to receiving the typed input from the user; automatically searching one or more databases for a user'"'"'s previous recording of one or more sound patterns that correspond to the additional text; generating one or more new segments of audio data based on the one or more sound patterns and the additional text; and inserting the one or more new segments of audio data at a position in the audio data file corresponding to a location where the additional text was added to the text data file. - View Dependent Claims (5, 6, 7)
-
Specification