×

Concept based cross media indexing and retrieval of speech documents

  • US 7,716,221 B2
  • Filed: 06/01/2007
  • Issued: 05/11/2010
  • Est. Priority Date: 06/02/2006
  • Status: Active Grant
First Claim
Patent Images

1. A method of cross media indexing, registering and retrieving speech documents, the method comprising:

  • a computing device pre-processing a set of training documents, including at least creating training document metadata;

    the computing device constructing a terms-phonemes/document matrix from the training document metadata where rows are created for the terms and phonemes contained in the set of training documents and columns are created for each training document;

    the computing device normalizing entries in the terms-phonemes/document matrix;

    the computing device computing a vector space from the training documents by computing from the terms-phonemes/document matrix and storing the vector space in a catalog; and

    the computing device computing vectors for new documents and adding the vectors to the vector space without computing a new vector space in response to adding the vectors.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×