×

Methods for presenting speech blocks from a plurality of audio input data streams to a user in an interface

  • US 8,719,032 B1
  • Filed: 12/11/2013
  • Issued: 05/06/2014
  • Est. Priority Date: 12/11/2013
  • Status: Active Grant
First Claim
Patent Images

1. A method comprising:

  • a) using a computer, identifying qualified audio on each of a plurality of audio input data streams by;

    i) identifying any unique signals on any of the plurality of audio input data streams which exceed an amplitude threshold as qualified audio; and

    ii) when similar signals exceeding the amplitude threshold are detected on multiple audio input data streams, identifying only the loudest of the similar signals as qualified audio;

    b) for each of the audio input data streams, identifying a set of speech blocks, each of which has a status and a start time, by, for each frame in the audio input data stream;

    i) executing code configured to add the current frame to a most recently created speech block if and only if;

    A) the most recent preceding frame corresponding to qualified audio has a time which differs from a time for the current frame by less than a first intervening duration threshold, and the status of the most recently created speech block is pending;

    orB) the most recent preceding frame corresponding to qualified audio has a time which differs from the time for the current frame by less than a second intervening duration threshold, and the status of the most recently created speech block is committed;

    ii) if the current frame is not added to the most recently created speech block, and the current frame corresponds to qualified audio, executing code configured to create a new speech block, wherein;

    A) the start time for the new speech block is the time for the current frame; and

    B) the status for the new speech block is pending;

    iii) if the status of the most recently created speech block is pending, executing code configured to change the status of the most recently created speech block to discarded if and only if;

    A) the current frame does not correspond to qualified audio;

    B) the most recent preceding frame corresponding to qualified audio has a time which differs from the time for the current frame by more than the first intervening duration threshold; and

    C) the status of the most recently created speech block is pending;

    iv) if the status of the most recently created speech block is pending, executing code configured to change the status of the most recently created speech block to committed if and only if;

    A) the current frame corresponds to qualified audio; and

    B) the start time for the most recently created speech block speech block precedes the time for the current frame by more than a minimum block duration threshold;

    c) presenting a speech block interface to a user, wherein;

    i) the speech block interface displays, for each audio input data stream, a timeline of speech blocks for the audio input data stream, the timeline being updated in real time as the qualified audio for the audio input data streams is identified;

    ii) the speech block interface is configured to allow the user play a portion of an audio input data stream corresponding to a speech block by selecting the speech block to be played;

    iii) the speech block interface is configured to allow the user to skip from each displayed speech block to a previous or subsequent displayed speech block; and

    iv) the speech block interface is configured not to display discarded speech blocks, to display pending speech blocks semitransparently, and to display committed speech blocks opaquely.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×