Method and apparatus for modifying digital messages containing at least audio
First Claim
1. A computer-implemented method comprising steps of:
- obtaining a digital representation of a message having at least audio from a mailbox of a user in a server;
automatically converting, by a processor, speech contained in the audio of the message to corresponding text;
automatically converting, by the processor, the audio of the message to a corresponding spectral diagram, the spectral diagram being a frequency domain representation of the audio of the message;
presenting a unified display on a computing device screen containing both the text and the spectral diagram to the user;
automatically editing the text in response to one or more user inputs in relation to the presentation of the text;
using the spectral diagram to remove sound in the audio that does not correspond to the text;
automatically editing, by the processor, the digital representation of the message, including the audio of the message, in a manner corresponding to the editing of the text;
annotating the text in response to one or more user inputs in relation to the presentation of the text;
annotating the digital representation of the message, including the audio of the message, in a manner corresponding to the annotating of the text; and
storing, in the server, the edited and annotated digital representation of the message.
1 Assignment
0 Petitions
Accused Products
Abstract
A voice and/or video message for a user in the form of a voicemail or a video mail is edited in accordance with editing of text corresponding to audio of the message. Speech contained in the audio of the voice or video message is automatically converted to corresponding text and presented to the user via a graphical user interface. In response to various user inputs in relation to the presentation of the corresponding text, the corresponding text is modified and the digital representation of the voice or video message is modified in a manner corresponding to the modification of the corresponding text. New versions of the modified voice or video message including the modified digital representation of the voice or video message can be created and saved.
-
Citations
24 Claims
-
1. A computer-implemented method comprising steps of:
-
obtaining a digital representation of a message having at least audio from a mailbox of a user in a server; automatically converting, by a processor, speech contained in the audio of the message to corresponding text; automatically converting, by the processor, the audio of the message to a corresponding spectral diagram, the spectral diagram being a frequency domain representation of the audio of the message; presenting a unified display on a computing device screen containing both the text and the spectral diagram to the user; automatically editing the text in response to one or more user inputs in relation to the presentation of the text; using the spectral diagram to remove sound in the audio that does not correspond to the text;
automatically editing, by the processor, the digital representation of the message, including the audio of the message, in a manner corresponding to the editing of the text;annotating the text in response to one or more user inputs in relation to the presentation of the text; annotating the digital representation of the message, including the audio of the message, in a manner corresponding to the annotating of the text; and storing, in the server, the edited and annotated digital representation of the message. - View Dependent Claims (2, 3, 4, 5, 6, 19, 20)
-
-
7. A server, comprising:
-
an interface for network communication; a processor coupled to the interface; a program for the processor; and storage for the program and for a mailbox of a user, wherein execution of the program by the processor causes the server to perform functions, including functions to; process a digital representation of a message in the mailbox of the user to convert speech contained in audio of the message to corresponding text; process the digital representation of the message to convert the audio of the message to a corresponding spectral diagram, the spectral diagram being a frequency domain representation of the audio of the message; present a unified display on a computing device screen containing both the text and the spectral diagram to the user; automatically edit the text in response to one or more user inputs in relation to the presentation of the text; use the spectral diagram to remove sound in the audio that does not correspond to the text;
automatically edit, by the processor, the digital representation of the message including the audio of the message, in a manner corresponding to the editing the text;annotate the text in response to one or more user inputs in relation to the presentation of the text; annotate the digital representation of the message, including the audio of the message, in a manner corresponding to the annotating of the text; and store, in the server, the edited and annotated digital representation of the message. - View Dependent Claims (8, 9, 10, 11, 12, 13, 21, 22)
-
-
14. A user terminal device, comprising:
-
an interface for network communication; a processor coupled to the interface; a program for the processor; and storage for the program and for a digital representation of a message having at least audio, obtained from a mailbox of a user in a server, wherein execution of the program by the processor causes the user terminal device to perform functions, including functions to; process the digital representation of the message to convert speech contained in the audio of the message to corresponding text; process the digital representation of the message to convert the audio of the message to a corresponding spectral diagram, the spectral diagram being a frequency domain representation of the audio of the message; present a unified display on a computing device screen containing both the text and the spectral diagram to the user; automatically edit the text in response to one or more user inputs in relation to the presentation of the text; use the spectral diagram to remove sound in the audio that does not correspond to the text;
automatically edit, by the processor, the digital representation of the message including the audio of the message, in a manner corresponding to the editing of the text;annotate the text in response to one or more user inputs in relation to the presentation of the text; annotate the digital representation of the message, including the audio of the message, in a manner corresponding to the annotating of the text; and store, in the server, the edited and annotated digital representation of the message. - View Dependent Claims (15, 16, 17, 18, 23, 24)
-
Specification