×

Automatic generation of a database for speech recognition from video captions

  • US 9,905,221 B2
  • Filed: 03/09/2014
  • Issued: 02/27/2018
  • Est. Priority Date: 04/02/2013
  • Status: Active Grant
First Claim
Patent Images

1. A system for automatic generation of a database for speech recognition, comprising:

  • a text subsystem;

    an audio subsystem configured to operate in synchronization with said text subsystem;

    a matching module; and

    a database of matching audio signals and text words;

    wherein said text subsystem comprises;

    a source of video frames comprising text;

    a text detection module configured to receive a first video frame, detect the text therein by looking for text patterns and generate a first timestamp if the detected text in said first video frame is different than text detected in a previous video frame,said text detection module further configured to receive a second video frame,detect the text therein by looking for text patterns ad generate a second timestamp if the detected text in said second video frame is different than text detected in said first video frame; and

    an Optical Character Recognition module configured to produce a string of text words representing said detected text;

    wherein said audio subsystem comprises;

    a source of audio signals comprising an audio representation of said detected text;

    an audio buffering module configured to receive and store said audio signal between said first and second timestamps; and

    an audio words separation module configured to separate said stored audio signal into a string of audio words;

    said matching module configured to receive said string of text words and said string of audio words and store each pair of matching text word and audio word in said database.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×