×

Systems and methods for using latent variable modeling for multi-modal video indexing

  • US 9,542,934 B2
  • Filed: 02/27/2014
  • Issued: 01/10/2017
  • Est. Priority Date: 02/27/2014
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method performed in connection with a computerized system comprising a processing unit and a memory, the computer-implemented method comprising:

  • a. using the processing unit to generate a multi-modal language model for co-occurrence of spoken words in the plurality of videos and an external text associated with the plurality of videos;

    b. selecting at least a portion of a first video;

    c. extracting a plurality of spoken words from the selected portion of the first video;

    d. obtaining a first external text associated with the selected portion of the first video, wherein the obtained first external text is separate and distinct from a representation of the extracted plurality of spoken words; and

    e. using the processing unit and the generated multi-modal language model to rank the extracted plurality of spoken words based on probability of occurrence conditioned on the obtained first external text.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×