×

Concept based cross media indexing and retrieval of speech documents

  • US 20070299838A1
  • Filed: 06/01/2007
  • Published: 12/27/2007
  • Est. Priority Date: 06/02/2006
  • Status: Active Grant
First Claim
Patent Images

1. A method of cross media indexing, registering and retrieving speech documents comprising the steps of:

  • registering a set of training documents;

    pre-processing each training document;

    constructing a terms-phonemes/document matrix from the training document metadata where a row is created for term and each phoneme in the training documents and a column is created for each training document;

    normalizing entries in the terms-phonemes/document matrix;

    computing a concept vector space from the training documents by computing from the terms-phonemes/document matrix;

    computing vectors for new documents and adding the vectors to the vector space;

    searching the computed vector space for vectors that are close to a vector computed for a query term or phoneme; and

    providing a list of those speech and/or text documents with the highest values.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×