Methods for authoring and interacting with multimedia representations of documents
First Claim
Patent Images
1. A method comprising:
- extracting a plurality of visual, audible, and audiovisual document elements from content contained within an input document, wherein the input document comprises a single file;
displaying at least one visual, audible, or audiovisual document elements from the extracted visual, audible, and audiovisual document elements of the input document in a graphical user interface;
automatically selecting two or more of the extracted visual, audible, or audiovisual document elements for inclusion in a multimedia thumbnail representation of the input document based on a time attribute and an information attribute associated with each of the extracted visual, audible, and audiovisual document elements to maximize a total of information content provided by the selected visual, audible, or audiovisual document elements in a given amount of time, wherein the automatically selecting comprises calculating a time attribute and an information attribute for each of the extracted visual, audible, and audiovisual document elements and selecting one or more visual, audible, or audiovisual document elements based on each time attribute and each information attribute, wherein each time attribute is calculated, using one or more characteristics of a device that is to display the multimedia representation, to determine a minimum presentation time for a user to comprehend each of the extracted visual, audible, and audiovisual document elements, and wherein each information attribute is calculated, using one or more application constraints, to determine an amount of information content that is contained in each of the extracted visual, audible, and audiovisual document elements, and wherein automatically selecting two or more of the extracted visual, audible, or audiovisual document elements to maximize the total of information content provided by the selected visual, audible, or audiovisual document elements in the given amount of time, further comprises;
selecting audible document elements that maximize an amount of information contained in the selected audible document elements when presented in one or more time intervals of an audible channel of the multimedia representation that have not been filled with audio data associated with the selected audiovisual elements, andfilling the one or more time intervals of the audible channel with the selected audible document elements; and
automatically generating the multimedia representation based on the automatically selected one or more visual, audible, or audiovisual document elements.
1 Assignment
0 Petitions
Accused Products
Abstract
A method, apparatus and article of manufacture for assisting a user with authoring a multimedia representation of documents. In one embodiment, the method comprises displaying one or more visual, audible, or audiovisual document elements from an input document in graphical user interface; and selecting one or more elements and for inclusion in a multimedia representation.
138 Citations
37 Claims
-
1. A method comprising:
-
extracting a plurality of visual, audible, and audiovisual document elements from content contained within an input document, wherein the input document comprises a single file; displaying at least one visual, audible, or audiovisual document elements from the extracted visual, audible, and audiovisual document elements of the input document in a graphical user interface; automatically selecting two or more of the extracted visual, audible, or audiovisual document elements for inclusion in a multimedia thumbnail representation of the input document based on a time attribute and an information attribute associated with each of the extracted visual, audible, and audiovisual document elements to maximize a total of information content provided by the selected visual, audible, or audiovisual document elements in a given amount of time, wherein the automatically selecting comprises calculating a time attribute and an information attribute for each of the extracted visual, audible, and audiovisual document elements and selecting one or more visual, audible, or audiovisual document elements based on each time attribute and each information attribute, wherein each time attribute is calculated, using one or more characteristics of a device that is to display the multimedia representation, to determine a minimum presentation time for a user to comprehend each of the extracted visual, audible, and audiovisual document elements, and wherein each information attribute is calculated, using one or more application constraints, to determine an amount of information content that is contained in each of the extracted visual, audible, and audiovisual document elements, and wherein automatically selecting two or more of the extracted visual, audible, or audiovisual document elements to maximize the total of information content provided by the selected visual, audible, or audiovisual document elements in the given amount of time, further comprises; selecting audible document elements that maximize an amount of information contained in the selected audible document elements when presented in one or more time intervals of an audible channel of the multimedia representation that have not been filled with audio data associated with the selected audiovisual elements, and filling the one or more time intervals of the audible channel with the selected audible document elements; and automatically generating the multimedia representation based on the automatically selected one or more visual, audible, or audiovisual document elements. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 37)
-
-
11. A method comprising:
-
extracting a plurality of visual, audible, and audiovisual document elements from content contained within an input document, wherein the input document comprises a single file; automatically selecting two or more visual, audible, or audiovisual document elements from the extracted visual, audible, and audiovisual document elements of the input document based on a time attribute and an information attribute associated with each of the extracted visual, audible, and audiovisual document elements to maximize a total of information content provided by the selected visual, audible, and audiovisual document elements in a given amount of time when included in a multimedia representation of the input document, wherein the automatically selecting comprises calculating a time attribute and an information attribute for each of the extracted visual, audible, and audiovisual document elements and selecting at least one visual, audible, or audiovisual document element based on each time attribute and each information attribute, wherein each time attribute is calculated, using one or more characteristics of a device that is to display the multimedia representation, to determine a minimum presentation time for a user to comprehend each of the extracted visual, audible, and audiovisual document elements, and wherein each information attribute is calculated, using one or more application constraints, to determine an amount of information content that is contained in each of the extracted visual, audible, and audiovisual document elements, and wherein automatically selecting two or more of the extracted visual, audible, or audiovisual document elements to maximize the total of information content provided by the selected visual, audible, or audiovisual document elements in the given amount of time, further comprises; selecting audible document elements that maximize an amount of information contained in the selected audible document elements when presented in one or more time intervals of an audible channel of the multimedia representation that have not been filled with audio data associated with the selected audiovisual elements, and filling the one or more time intervals of the audible channel with the selected audible document elements; and automatically generating the multimedia representation based on the automatically selected one or more visual, audible, or audiovisual document elements; and displaying an interactive user interface to playback a combination of the multimedia representation and the input document under user control. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. An apparatus comprising:
-
a memory to store an input document; and a processor coupled with the memory to execute a a multimedia thumbnail generator to extract a plurality of visual, audible, and audiovisual document elements from content contained within the input document and automatically generate a multimedia representation from one or more visual, audible, or audiovisual document elements automatically selected from extracted document elements from the input document based on one or more optimizing constraints to maximize a total of information content that is provided by the selected document element in a given amount of time, wherein the one or more optimizing constraints comprises one or more characteristics of a device and one or more application constraints, wherein the automatic selection comprises calculating a time attribute and an information attribute for each of the extracted visual, audible, and audiovisual document elements and selecting one or more visual, audible, or audiovisual document elements based on each time attribute and each information attribute, wherein each time attribute is calculated, using one or more characteristics of a device that is to display the multimedia representation, to determine a minimum presentation time for a user to comprehend each of the extracted visual, audible, and audiovisual document elements, wherein each information attribute is calculated, using the one or more application constraints, to determine an amount of information content that is contained in each of the extracted visual, audible, and audiovisual document elements, and wherein the automatic selection of two or more of the extracted visual, audible, or audiovisual document elements to maximize the total of information content provided by the selected visual, audible, or audiovisual document elements in the given amount of time, further comprises the processor configured to; select audible document elements that maximize an amount of information contained in the selected audible document elements when presented in one or more time intervals of an audible channel of the multimedia representation that have not been filled with audio data associated with the selected audiovisual elements, and fill the one or more time intervals of the audible channel with the selected audible document elements, and a graphical user interface, coupled to the multimedia thumbnail generator, to present the automatically selected visual, audible, and audiovisual document elements and to receive a user input to edit the multimedia representation. - View Dependent Claims (21, 22, 23, 24, 25, 26)
-
-
27. An apparatus comprising:
-
a memory to store an input document; and a processor coupled with the memory to execute a a multimedia thumbnail generator to analyze the input document and extract a plurality of visual, audible, and audiovisual document elements from the input document and automatically generate a multimedia representation from one or more visual, audible, or audiovisual document elements automatically selected from extracted document elements from the input document based on one or more optimizing constraints and a user input, wherein the one or more optimizing constraints are utilized to maximize a total of information content provided by the selected document elements in a given amount of time, and comprise one or more characteristics of a device displaying a graphical user interface and one or more application constraints, wherein the automatic selection comprises calculating a time attribute and an information attribute for each of the extracted visual, audible, and audiovisual document elements and selecting one or more visual, audible, or audiovisual document elements based on each time attribute and each information attribute, wherein each time attribute is calculated, using one or more characteristics of a device that is to display the multimedia representation, to determine a minimum presentation time for a user to comprehend each of the extracted visual, audible, and audiovisual document elements, and wherein each information attribute is calculated, using the one or more application constraints, to determine an amount of information content that is contained in each of the extracted visual, audible, and audiovisual document elements, and wherein the automatic selection of two or more of the extracted visual, audible, or audiovisual document elements to maximize the total of information content provided by the selected visual, audible, or audiovisual document elements in the given amount of time, further comprises the processor configured to; select audible document elements that maximize an amount of information contained in the selected audible document elements when presented in one or more time intervals of an audible channel of the multimedia representation that have not been filled with audio data associated with the selected audiovisual elements, and fill the one or more time intervals of the audible channel with the selected audible document elements, and a graphical user interface, coupled to the multimedia thumbnail generator, to present the multimedia representation and to switch between the presentation of the multimedia representation and presentation of the input document under user control. - View Dependent Claims (28, 29, 30, 31)
-
-
32. A non-transitory computer readable storage medium with instructions thereon which, when executed by a system, cause the system to perform a method comprising:
-
extracting a plurality of visual, audible, and audiovisual document elements from content within an input document, wherein the input document comprises a single file;
automatically selecting two or more visual, audible, or audiovisual document elements from the extracted visual, audible, and audiovisual document elements of the input document for inclusion into a multimedia representation of the input document based on one or more optimizing constraints to maximize a total of information content provided by the selected two or more visual, audible, or audiovisual document elements in a given amount of time, wherein the one or more optimizing constraints comprises one or more characteristics of a device displaying a graphical user interface and one or more application constraints, wherein the automatically selecting comprises calculating a time attribute and an information attribute for each of the extracted visual, audible, and audiovisual document elements and selecting one or more visual, audible, or audiovisual document elements based on each time attribute and each information attribute, wherein each time attribute is calculated, using the one or more characteristics of the device that is to display the multimedia representation, to determine a minimum presentation time for a user to comprehend each of the extracted visual, audible, and audiovisual document elements, and wherein each information attribute is calculated, using the one or more application constraints, to determine an amount of information content that is contained in each of the extracted visual, audible, and audiovisual document elements, and wherein automatically selecting two or more of the extracted visual, audible, or audiovisual document elements to maximize the total of information content provided by the selected visual, audible, or audiovisual document elements in the given amount of time, further comprises;selecting audible document elements that maximize an amount of information contained in the selected audible document elements when presented in one or more time intervals of an audible channel of the multimedia representation that have not been filled with audio data associated with the selected audiovisual elements, and filling the one or more time intervals of the audible channel with the selected audible document elements; automatically generating the multimedia representation based on the automatically selected one or more visual, audible, and audiovisual document elements; providing a graphical user interface to view the automatically selected visual, audible, and audiovisual document elements and to edit the multimedia representation. - View Dependent Claims (33)
-
-
34. A non-transitory computer readable storage medium with instructions thereon which, when executed by a system, cause the system to perform a method comprising:
-
extracting a plurality of visual, audible, and audiovisual document elements from content contained within an input document, wherein the input document comprises a single file; automatically selecting two or more visual, audible, or audiovisual document elements from the extracted visual, audible, and audiovisual document elements of the input document for inclusion into a multimedia representation of the input document based on one or more optimizing constraints to maximize a total of information content provided by the selected two or more visual, audible, or audiovisual document elements in a given amount of time, wherein the one or more optimizing constraints comprise one or more characteristics of a device displaying a graphical user interface and one or more application constraints, wherein the automatic selection comprises calculating a time attribute and an information attribute for each of the extracted visual, audible, and audiovisual document elements and selecting one or more visual, audible, or audiovisual document elements based on each time attribute and each information attribute, wherein each time attribute is calculated, using the one or more characteristics of the device that is to display the multimedia representation, to determine a minimum presentation time for a user to comprehend each of the extracted visual, audible, and audiovisual document elements, and wherein each information attribute is calculated, using the one or more application constraints, to determine an amount of information content that is contained in each of the extracted visual, audible, and audiovisual document elements, and wherein automatically selecting two or more of the extracted visual, audible, or audiovisual document elements to maximize the total of information content provided by the selected visual, audible, or audiovisual document elements in the given amount of time, further comprises; selecting audible document elements that maximize an amount of information contained in the selected audible document elements when presented in one or more time intervals of an audible channel of the multimedia representation that have not been filled with audio data associated with the selected audiovisual elements, and filling the one or more time intervals of the audible channel with the selected audible document elements; automatically generating the multimedia representation based on the automatically selected one or more visual, audible, or audiovisual document elements; providing an interactive user interface to playback a combination of the multimedia representation and the input document under user control. - View Dependent Claims (35, 36)
-
Specification