Activity-ware for non-textual objects
First Claim
1. A system that facilitates organization of audio media, the system comprising:
- a memory, wherein the memory is encoded with instructions;
a processor, wherein the processor executes the instructions;
the instructions being executed comprising;
an inference component that determines a point of interest based at least in part upon identification of an energy level, wherein identification of the energy level occurs in an oral conversation or review of a recording of the oral conversation, wherein the energy level is based at least in part on measurable auditory indications;
an annotation component that marks an audio media file at a location associated with the point of interest; and
a summarization component that generates a summary of the oral conversation by compiling portions of the audio media file that are in a threshold proximity to one or more locations marked by the annotation component, wherein the summarization component automatically determines an appropriate size portion for the threshold proximity as a function of relevancy to each marked point of interest, wherein the appropriate size portion is determined by;
analyzing a first portion of the audio media file within a first proximity to a point of interest by translating the first portion into text and identifying a first set of keywords representative of the first portion of the audio media file;
analyzing a second portion of the audio media file within a second threshold proximity to a point of interest by translating the first portion into text and identifying a second set of keywords representative of the second portion of the audio media file;
comparing relevancy of keywords within the first portion to keywords within the second portion of the audio media file and determining whether relevancy of keywords within the second portion drops below a default relevancy factor, wherein the default relevancy factor is a function of the first set of keywords representative of the first portion of the audio media file; and
selecting the appropriate size portion based on whether relevancy of keywords within the second portion drops below the default relevancy factor.
2 Assignments
0 Petitions
Accused Products
Abstract
Providing for summarization and analysis of audio content is described herein. By way of example, an oral conversation can be analyzed, such that points of interest within the oral conversation can be identified and file locations related to such points of interest can be marked. Points of interest can be inferred based on a level of energy, e.g., excitement, pitch, tone, pace, or the like, associated with one or more speakers. Alternatively, or in addition, speaker and/or reviewer activity can form the basis for identifying points of interest within the conversation. Moreover, a compilation of the identified points of interest and portions of the original oral conversation related thereto can be assembled. As described herein, audio content can be succinctly summarized with respect to inferred and/or indicated points of interest, to facilitate an efficient and pertinent review of such content.
27 Citations
17 Claims
-
1. A system that facilitates organization of audio media, the system comprising:
-
a memory, wherein the memory is encoded with instructions; a processor, wherein the processor executes the instructions; the instructions being executed comprising; an inference component that determines a point of interest based at least in part upon identification of an energy level, wherein identification of the energy level occurs in an oral conversation or review of a recording of the oral conversation, wherein the energy level is based at least in part on measurable auditory indications; an annotation component that marks an audio media file at a location associated with the point of interest; and a summarization component that generates a summary of the oral conversation by compiling portions of the audio media file that are in a threshold proximity to one or more locations marked by the annotation component, wherein the summarization component automatically determines an appropriate size portion for the threshold proximity as a function of relevancy to each marked point of interest, wherein the appropriate size portion is determined by; analyzing a first portion of the audio media file within a first proximity to a point of interest by translating the first portion into text and identifying a first set of keywords representative of the first portion of the audio media file; analyzing a second portion of the audio media file within a second threshold proximity to a point of interest by translating the first portion into text and identifying a second set of keywords representative of the second portion of the audio media file; comparing relevancy of keywords within the first portion to keywords within the second portion of the audio media file and determining whether relevancy of keywords within the second portion drops below a default relevancy factor, wherein the default relevancy factor is a function of the first set of keywords representative of the first portion of the audio media file; and selecting the appropriate size portion based on whether relevancy of keywords within the second portion drops below the default relevancy factor. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for providing a summary of an audio content, comprising:
-
storing, in a memory, instructions for performing the method of providing a summary of an audio content; executing the instructions on a processor; according to the instructions being executed; capturing at least a portion of an oral conversation in an audio file; marking the audio file at one or more locations proximate to one or more points of interest, wherein the one or more points of interest are identified via a speaker activity or inferred from a degree of emotion in one or more speakers'"'"' voices; associating portions of the audio file that are within a threshold proximity to at least one point of interest; and summarizing the oral conversation by compiling portions of the audio media file that are in a threshold proximity to one or more locations marked, wherein summarizing automatically determines an appropriate size portion for the threshold proximity as a function of relevancy to each marked point of interest. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A system that facilitates annotation and summarization of auditory objects, the system comprising:
-
a memory, wherein the memory is encoded with instructions; a processor, wherein the processor executes the instructions; the instructions being executed comprising; means for identifying one or more points of interest within audio content based on a level of emotion of one or more speakers'"'"' voices, or based on a predetermined human activity, or combinations thereof; means for book marking an audio file at one or more locations commensurate with the one or more identified points of interest; means for correlating the audio file with diverse media related to the audio content, wherein the diverse media includes photographic media, video media, and textual media; and means for book marking one or more diverse media files containing the diverse media at locations commensurate with the one or more points of interest within the audio content. - View Dependent Claims (17)
-
Specification