SYSTEM AND METHOD FOR INSERTING A DESCRIPTION OF IMAGES INTO AUDIO RECORDINGS
First Claim
1. A method of inserting a description of an image into an audio recording, comprising:
- interpreting an image and producing a word description of the image including at least one image keyword;
parsing an audio recording into a plurality of audio clips, and producing a transcription of each audio clip, each audio clip transcription including at least one audio keyword;
calculating a similarity distance between the at least one image keyword and the at least one audio keyword of each audio clip; and
selecting the audio clip transcription having a shortest similarity distance to the at least one image keyword as a location to insert the word description of the image.
1 Assignment
0 Petitions
Accused Products
Abstract
There is disclosed a system and method for interpreting and describing graphic images. In an embodiment, the method of inserting a description of an image into an audio recording includes: interpreting an image and producing a word description of the image including at least one image keyword; parsing an audio recording into a plurality of audio clips, and producing a transcription of each audio clip, each audio clip transcription including at least one audio keyword; calculating a similarity distance between the at least one image keyword and the at least one audio keyword of each audio clip; and selecting the audio clip transcription having a shortest similarity distance to the at least one image keyword as a location to insert the word description of the image. The word description of the image can then be appended to the selected audio clip to produce an augmented audio recording including the interpreted word description of the image.
22 Citations
21 Claims
-
1. A method of inserting a description of an image into an audio recording, comprising:
-
interpreting an image and producing a word description of the image including at least one image keyword; parsing an audio recording into a plurality of audio clips, and producing a transcription of each audio clip, each audio clip transcription including at least one audio keyword; calculating a similarity distance between the at least one image keyword and the at least one audio keyword of each audio clip; and selecting the audio clip transcription having a shortest similarity distance to the at least one image keyword as a location to insert the word description of the image. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system for inserting a description of an image into an audio recording, comprising:
-
an interpreting system for interpreting an image and producing a word description of the image including at least one image keyword; a parsing system for parsing an audio recording into a plurality of audio clips, and producing a transcription of each audio clip, each audio clip transcription including at least one audio keyword; a calculating system for calculating a similarity distance between the at least one image keyword and the at least one audio keyword of each audio clip; and a selecting system for selecting the audio clip transcription having a shortest similarity distance to the at least one image keyword as a location to insert the word description of the image. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A program product stored on a computer readable medium, which when executed, inserts a description of an image into an audio recording, the computer readable medium comprising program code for:
-
interpreting an image and producing a word description of the image including at least one image keyword; parsing an audio recording into a plurality of audio clips and producing a transcription of each audio clip, each audio clip transcription including at least one audio keyword; calculating a similarity distance between the at least one image keyword and the at least one audio keyword of each audio clip; and selecting the audio clip transcription having a shortest similarity distance to the at least one image keyword as a location to insert the word description of the image. - View Dependent Claims (16, 17, 18, 19, 20, 21)
-
Specification