System and method for rich media annotation

US 10,127,231 B2
Filed: 07/22/2008
Issued: 11/13/2018
Est. Priority Date: 07/22/2008
Status: Active Grant

First Claim

Patent Images

1. A method comprising:

receiving, at a processor, first media content for annotation by the processor based on an associated audio annotation;

identifying, by the processor, the audio annotation, wherein the audio annotation is stored within a digital file;

generating, via the processor, text from the audio annotation using a grammar;

parsing, via the processor, the text to identify first metadata indicative of one or more of a location, an individual or an activity;

based on weights assigned to the first metadata, identifying, via the processor, second media content having an associated weighted second metadata corresponding to the first metadata;

generating, via the processor, at least one descriptive annotation for the first media content based on the first metadata and the second metadata;

receiving, by the processor, additional metadata via a dialog with a user who recorded the first media content;

annotating, via the processor, the first media content using the at least one descriptive annotation and the additional metadata to yield annotated first media content; and

providing the first annotated media content to users.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Disclosed herein are systems, methods, and computer readable-media for rich media annotation, the method comprising receiving a first recorded media content, receiving at least one audio annotation about the first recorded media, extracting metadata from the at least one of audio annotation, and associating all or part of the metadata with the first recorded media content. Additional data elements may also be associated with the first recorded media content. Where the audio annotation is a telephone conversation, the recorded media content may be captured via the telephone. The recorded media content, audio annotations, and/or metadata may be stored in a central repository which may be modifiable. Speech characteristics such as prosody may be analyzed to extract additional metadata. In one aspect, a specially trained grammar identifies and recognizes metadata.

61 Citations

18 Claims

1. A method comprising:
- receiving, at a processor, first media content for annotation by the processor based on an associated audio annotation;
  
  identifying, by the processor, the audio annotation, wherein the audio annotation is stored within a digital file;
  
  generating, via the processor, text from the audio annotation using a grammar;
  
  parsing, via the processor, the text to identify first metadata indicative of one or more of a location, an individual or an activity;
  
  based on weights assigned to the first metadata, identifying, via the processor, second media content having an associated weighted second metadata corresponding to the first metadata;
  
  generating, via the processor, at least one descriptive annotation for the first media content based on the first metadata and the second metadata;
  
  receiving, by the processor, additional metadata via a dialog with a user who recorded the first media content;
  
  annotating, via the processor, the first media content using the at least one descriptive annotation and the additional metadata to yield annotated first media content; and
  
  providing the first annotated media content to users.
- View Dependent Claims (2, 3, 4, 5, 6)
- - 2. The method of claim 1, wherein additional data elements are also associated with the first media content.
  - 3. The method of claim 1, wherein the audio annotation is a telephone conversation and the first media content is captured via a telephone.
  - 4. The method of claim 1, wherein one of the first media content, the audio annotation, and the text are stored in a central repository.
  - 5. The method of claim 4, wherein the audio annotation and the text stored in the central repository are modifiable.
  - 6. The method of claim 5, wherein the central repository solicits additional text from a user about the first media content.

7. A system comprising:
- a processor; and
  
  a computer-readable storage medium having instructions stored which, when executed by the processor, cause the processor to perform operations comprising;
  
  receiving first media content for annotation by the processor based on an associated audio annotation;
  
  identifying the audio annotation, wherein the audio annotation is stored within a digital file;
  
  generating text from the audio annotation using a grammar;
  
  parsing the text to identify first metadata indicative of one or more of a location, an individual or an activity;
  
  based on weights assigned to the first metadata, identifying second media content having an associated weighted second metadata corresponding to the first metadata;
  
  generating, at least one descriptive annotation for the first media content based on the first metadata and the second metadata;
  
  receiving additional metadata via a dialog with a user who recorded the first media content;
  
  annotating, the first media content using the at least one descriptive annotation and the additional metadata to yield annotated first media content; and
  
  providing the first annotated media content to users.
- View Dependent Claims (8, 9, 10, 11, 12)
- - 8. The system of claim 7, wherein additional data elements are also associated with the first media content.
  - 9. The system of claim 7, wherein the audio annotation is a telephone conversation and the first media content is captured via a telephone.
  - 10. The system of claim 7, wherein the first media content, the audio annotation, and the text are stored in a central repository.
  - 11. The system of claim 10, wherein the audio annotation and the text stored in the central repository are modifiable.
  - 12. The system of claim 11, wherein the central repository solicits additional text from a user about the first media content.

13. A non-transitory computer-readable storage device having instructions for performing annotation stored which, when executed by a computing device, cause the computing device to perform operations comprising:
- receiving first media content for annotation by the processor based on an associated audio annotation;
  
  identifying the audio annotation, wherein the audio annotation is stored within a digital file;
  
  generating text from the audio annotation using a grammar;
  
  parsing the text to identify first metadata indicative of one or more of a location, an individual or an activity;
  
  based on weights assigned to the first metadata, identifying second media content having an associated weighted second metadata corresponding to the first metadata;
  
  generating, at least one descriptive annotation for the first media content based on the first metadata and the second metadata;
  
  receiving additional metadata via a dialog with a user who recorded the first media content;
  
  annotating the first media content using the at least one descriptive annotation and the additional metadata to yield annotated first media content; and
  
  providing the first annotated media content to users.
- View Dependent Claims (14, 15, 16, 17, 18)
- - 14. The non-transitory computer-readable storage device of claim 13, wherein additional data elements are also associated with the first media content.
  - 15. The non-transitory computer-readable storage device of claim 13, wherein the audio annotation is a telephone conversation and the first media content is captured via a telephone.
  - 16. The non-transitory computer-readable storage device of claim 13, wherein at least one of the first media content, the audio annotation, and the text are stored in a central repository.
  - 17. The non-transitory computer-readable storage device of claim 16, wherein the audio annotation and the text stored in the central repository are modifiable.
  - 18. The non-transitory computer-readable storage device of claim 17, wherein the central repository solicits additional text from a user about the first media content.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
AT&T Intellectual Property I LP (AT&T, Inc.)
Original Assignee
AT&T Intellectual Property I LP (AT&T, Inc.)
Inventors
Gausman, Paul, Gibbon, David C.
Primary Examiner(s)
Ly, Anh

Application Number

US12/177,638
Publication Number

US 20100023553A1
Time in Patent Office

3,766 Days
Field of Search

707706, 707708, 707755, 707770, 707771, 709206, 709219, 709231, 725 30, 725 53, 725 86, 704236, 704251
US Class Current
CPC Class Codes

G06F 16/48 Retrieval characterised by ...

System and method for rich media annotation

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

61 Citations

18 Claims

Specification

Solutions

Use Cases

Quick Links

System and method for rich media annotation

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

61 Citations

18 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links