Using audio cues to improve object retrieval in video

US 10,108,617 B2
Filed: 10/30/2013
Issued: 10/23/2018
Est. Priority Date: 10/30/2013
Status: Active Grant

First Claim

Patent Images

1. A method of object retrieval from visual data, the method comprising:

annotating at least one portion of the visual data with a context keyword corresponding to an object, wherein the annotating is performed responsive to recognition of the context keyword by performing speech recognition in audio data corresponding to the at least one portion of the visual data;

receiving a query to retrieve the object, wherein the query comprises a query keyword associated with both the object and the context keyword;

identifying the at least one portion of the visual data based on the context keyword; and

searching for the object in the at least one portion of the visual data using an appearance model corresponding to the query keyword.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A method of object retrieval from visual data is provided that includes annotating at least one portion of the visual data with a context keyword corresponding to an object, wherein the annotating is performed responsive to recognition of the context keyword in audio data corresponding to the at least one portion of the visual data, receiving a query to retrieve the object, wherein the query includes a query keyword associated with both the object and the context keyword, identifying the at least one portion of the visual data based on the context keyword, and searching for the object in the at least one portion of the visual data using an appearance model corresponding to the query keyword.

Citations

5 Claims

1. A method of object retrieval from visual data, the method comprising:
- annotating at least one portion of the visual data with a context keyword corresponding to an object, wherein the annotating is performed responsive to recognition of the context keyword by performing speech recognition in audio data corresponding to the at least one portion of the visual data;
  
  receiving a query to retrieve the object, wherein the query comprises a query keyword associated with both the object and the context keyword;
  
  identifying the at least one portion of the visual data based on the context keyword; and
  
  searching for the object in the at least one portion of the visual data using an appearance model corresponding to the query keyword.
- View Dependent Claims (2)
- - 2. The method of claim 1, wherein the appearance model, the corresponding query keyword, and the context keyword are predetermined.

3. A digital system configured to perform object retrieval from visual data, the digital system comprising one or more processors configured to:
- capture the visual data and corresponding audio data;
  
  annotate at least one portion of the visual data with a context keyword that is obtained from audio data corresponding to the at least one portion of the visual data;
  
  receive a query to retrieve an object, wherein the query comprises a query keyword associated with both the object and the context keyword;
  
  identify the at least one portion of the visual data based on the context keyword; and
  
  search for the object in the at least one portion of the visual data using an appearance model corresponding to the query keyword.
- View Dependent Claims (4, 5)
- - 4. The digital system of claim 3, wherein the appearance model, the corresponding query keyword, and the context keyword are predetermined.
  - 5. The digital system of claim 3, wherein the context keyword is obtained by speech recognition from the audio data corresponding to the at least one portion of the visual data.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Texas Instruments, Inc.
Original Assignee
Texas Instruments, Inc.
Inventors
Jain, Eakta, Barnum, Peter Charles
Primary Examiner(s)
To, Baoquoc N

Application Number

US14/067,923
Publication Number

US 20150120726A1
Time in Patent Office

1,819 Days
Field of Search

707705, 707706, 707707
US Class Current
CPC Class Codes

G06F 16/43   Querying

G06F 16/7834   using audio features

G11B 27/28   by using information signal...

G11B 27/30   on the same track as the ma...

H04N 5/77   between a recording apparat...

Using audio cues to improve object retrieval in video

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

5 Claims

Specification

Solutions

Use Cases

Quick Links

Using audio cues to improve object retrieval in video

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

5 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links