Recording audio metadata for stored images

US 8,385,588 B2
Filed: 12/11/2007
Issued: 02/26/2013
Est. Priority Date: 12/11/2007
Status: Expired due to Fees

First Claim

Patent Images

1. A method of processing audio signals including speech signals, the audio signals and image data being recorded in a media file, comprising:

a. automatically extracting the speech signals from the audio signals from the media file and converting the speech signals to textual metadata wherein the textual metadata are keywords recognized from a pre-determined vocabulary;

b. automatically analyzing the textual metadata using natural language processing algorithms to identify people'"'"'s names, place names, or object names and adding the identified names to the textual metadata;

c. using the updated textual metadata to compute a commentary value metric wherein the commentary value metric is a measure of the amount of viewer commentary associated with the media file;

d. automatically semantically analyzing the image data from the media file to identify a person, place, object or activity to produce a visual display of selected portions of the image data, and prompt the user to provide additional textual metadata associated with each of the selected portions of the image data; and

e. associating the updated textual metadata automatically obtained from the speech signals in the media file, the additional textual metadata provided by the user during the display of the selected portions of the image data from the media file, and the commentary value metric with the media file.

View all claims

5 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method of processing audio signals recorded during display of image data from a media file on a display device to produce semantic understanding data and associating such data with the original media file, includes: separating a desired audio signal from the aggregate mixture of audio signals; analyzing the separated signal for purposes of gaining semantic understanding; and associating the semantic information obtained from the audio signals recorded during image display with the original media file.

50 Citations

View as Search Results

15 Claims

1. A method of processing audio signals including speech signals, the audio signals and image data being recorded in a media file, comprising:
- a. automatically extracting the speech signals from the audio signals from the media file and converting the speech signals to textual metadata wherein the textual metadata are keywords recognized from a pre-determined vocabulary;
  
  b. automatically analyzing the textual metadata using natural language processing algorithms to identify people'"'"'s names, place names, or object names and adding the identified names to the textual metadata;
  
  c. using the updated textual metadata to compute a commentary value metric wherein the commentary value metric is a measure of the amount of viewer commentary associated with the media file;
  
  d. automatically semantically analyzing the image data from the media file to identify a person, place, object or activity to produce a visual display of selected portions of the image data, and prompt the user to provide additional textual metadata associated with each of the selected portions of the image data; and
  
  e. associating the updated textual metadata automatically obtained from the speech signals in the media file, the additional textual metadata provided by the user during the display of the selected portions of the image data from the media file, and the commentary value metric with the media file.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
- - 2. The method of claim 1, further including providing at least one microphone in the display device and digitizing audio signals captured by the microphone(s).
  - 3. The method of claim 1, wherein a still or video digitized image is stored in, and read from, a display device'"'"'s internal memory or from a removable storage device.
  - 4. The method of claim 1, wherein the still or video digitized image is stored on, and read from, a remotely located computer on a wired or wireless network.
  - 5. The method of claim 1, further comprising separating the audio signal into components and selecting one of the components for analysis.
  - 6. A method of applying the method of claim 1 to a plurality of different media files to obtain common metadata and associating such obtained metadata with the related media files.
  - 7. The method of claim 6, wherein the related media files share a common location of capture.
  - 8. The method of claim 6, wherein the related media files share one or more common person(s), places, activities or object(s).
  - 9. The method of claim 6, wherein the related media files share content-descriptive metadata.
  - 10. The method of claim 6, wherein the related media files share common event metadata.
  - 11. The method of claim 1, further including providing a value metric to measure the amount of viewer commentary associated with the media files.
  - 12. The method of claim 2, further including analysis of the audio signal to determine the beginning and ending of viewer commentary associated with image data.
  - 13. The method of claim 12, further including providing control of image transitions during display according to the analysis of the audio signal.
  - 14. The method of claim 6, wherein the plurality of media files is displayed as a group of related images or thumbnail icons.
  - 15. The method of claim 6, wherein the plurality of media files is displayed as a single image or thumbnail icon representing a group or collection of related images.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Monument Peak Ventures, LLC (Dominion Harbor Enterprises, LLC)
Original Assignee
Eastman Kodak Company
Inventors
Jacoby, Keith A., Murray, Thomas J., Nelson, John V., Gobeyn, Kevin M.
Primary Examiner(s)
YEN, ERIC L

Application Number

US11/954,089
Publication Number

US 20090150147A1
Time in Patent Office

1,904 Days
Field of Search

382/100, 704/200
US Class Current

382/100
CPC Class Codes

G06F 16/58   Retrieval characterised by ...

G06F 16/7867   using information manually ...

G06F 16/787   using geographical or spati...

G10L 15/26   Speech to text systems G10L...

G11B 27/105   of operating discs

G11B 27/28   by using information signal...

G11B 27/34   Indicating arrangements in...

Recording audio metadata for stored images

First Claim

5 Assignments

0 Petitions

Accused Products

Abstract

50 Citations

15 Claims

Specification

Solutions

Use Cases

Quick Links

Recording audio metadata for stored images

First Claim

5 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

50 Citations

15 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links