Signature generation for multimedia deep-content-classification by a large-scale matching system and method thereof
First Claim
Patent Images
1. A method for generating a large-scale database of heterogeneous speech, comprising:
- transcribing a plurality of multimedia signals retrieved from a large text database and a speech database;
randomly selecting a plurality of speech segments from the plurality of multimedia signals, wherein each speech segment of the plurality of speech segments is of a random length;
generating a plurality of signatures based on the plurality of speech segments; and
populating the large-scale database with the plurality of signatures respective of the plurality of multimedia signals.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for generating a large-scale database of heterogeneous speech are provided. The method includes transcribing a plurality of multimedia signals retrieved from a large text database and a speech database; randomly selecting a plurality of speech segments from the plurality of multimedia signals, wherein each speech segment of the plurality of speech segments is of a random length; generating a plurality of signatures based on the plurality of speech segments; and populating the large-scale database with the plurality of signatures respective of the plurality of multimedia signals.
158 Citations
11 Claims
-
1. A method for generating a large-scale database of heterogeneous speech, comprising:
-
transcribing a plurality of multimedia signals retrieved from a large text database and a speech database; randomly selecting a plurality of speech segments from the plurality of multimedia signals, wherein each speech segment of the plurality of speech segments is of a random length; generating a plurality of signatures based on the plurality of speech segments; and populating the large-scale database with the plurality of signatures respective of the plurality of multimedia signals. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system for generating a large-scale database of heterogeneous speech, comprising:
-
a processor; a memory, the memory containing instructions that, when executed by the processor, configure the system to; transcribe a plurality of multimedia signals retrieved from a large text database and a speech database; randomly select a plurality of speech segments from the plurality of multimedia signals, wherein each speech segment of the plurality of speech segments is of a random length; generate a plurality of signatures based on the plurality of speech segments; and populate the large-scale database with the plurality of signatures respective of the plurality of multimedia signals. - View Dependent Claims (8, 9, 10, 11)
-
Specification