Identifying qualified audio of a plurality of audio streams for display in a user interface
First Claim
Patent Images
1. A method comprising:
- a) receiving a plurality of audio input data streams, wherein the plurality of audio input data streams comprise different audio input data streams representing concurrent outputs of a plurality of different microphones;
b) identifying qualified audio on each of the plurality of audio input data streams by;
i) identifying any unique signals on any of the plurality of audio input data streams which exceed an amplitude threshold as qualified audio; and
ii) when similar signals exceeding the amplitude threshold are detected on multiple audio input data streams, identifying only the loudest of the similar signals as qualified audio;
c) using a computer, organizing qualified audio into speech blocks, each of which has a status and a start time, and is associated with a single audio input data stream; and
d) presenting a speech block interface to a user, wherein the speech block interface displays, for each audio input data stream, a timeline of speech blocks for the audio input data stream;
wherein;
a) organizing qualified audio into speech blocks comprises;
i) adding identified qualified audio to an existing speech block if and only if the identified qualified audio is on the same audio input data stream as the existing speech block and either;
A) the identified qualified audio is separated from an end time for the existing speech block by no more than a first intra-block duration threshold and the existing speech block is subject to being, but has not been, discarded;
orB) the identified qualified audio is separated from the end time for the existing speech block by no more than a second intra-block duration threshold and the existing speech block is not subject to being discarded;
ii) creating a new speech block which is subject to being, but has not been, discarded, with the identified qualified audio if and only if the identified qualified audio is not added to the existing speech block and there are no other speech blocks between the identified qualified audio and the existing speech block on the same audio input data stream as the identified qualified audio; and
b) the first intra-block duration threshold is different from the second intra-block duration threshold.
1 Assignment
0 Petitions
Accused Products
Abstract
A clear picture of who is speaking in a setting where there are multiple input sources (e.g., a conference room with multiple microphones) can be obtained by comparing input channels against each other. The data from each channel can not only be compared, but can also be organized into portions which logically correspond to statements by a user. These statements, along with information regarding who is speaking, can be presented in a user friendly format via an interactive timeline which can be updated in real time as new audio input data is received.
71 Citations
16 Claims
-
1. A method comprising:
-
a) receiving a plurality of audio input data streams, wherein the plurality of audio input data streams comprise different audio input data streams representing concurrent outputs of a plurality of different microphones; b) identifying qualified audio on each of the plurality of audio input data streams by; i) identifying any unique signals on any of the plurality of audio input data streams which exceed an amplitude threshold as qualified audio; and ii) when similar signals exceeding the amplitude threshold are detected on multiple audio input data streams, identifying only the loudest of the similar signals as qualified audio; c) using a computer, organizing qualified audio into speech blocks, each of which has a status and a start time, and is associated with a single audio input data stream; and d) presenting a speech block interface to a user, wherein the speech block interface displays, for each audio input data stream, a timeline of speech blocks for the audio input data stream; wherein; a) organizing qualified audio into speech blocks comprises; i) adding identified qualified audio to an existing speech block if and only if the identified qualified audio is on the same audio input data stream as the existing speech block and either; A) the identified qualified audio is separated from an end time for the existing speech block by no more than a first intra-block duration threshold and the existing speech block is subject to being, but has not been, discarded;
orB) the identified qualified audio is separated from the end time for the existing speech block by no more than a second intra-block duration threshold and the existing speech block is not subject to being discarded; ii) creating a new speech block which is subject to being, but has not been, discarded, with the identified qualified audio if and only if the identified qualified audio is not added to the existing speech block and there are no other speech blocks between the identified qualified audio and the existing speech block on the same audio input data stream as the identified qualified audio; and b) the first intra-block duration threshold is different from the second intra-block duration threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A machine comprising:
-
a) a plurality of microphones; b) a computer comprising a computer readable medium having stored thereon data operable to configured the computer to perform a method comprising; i) receiving a plurality of audio input data streams, wherein the plurality of audio input data streams comprise different audio input data streams representing concurrent outputs of different microphones from the plurality of microphones; ii) identifying qualified audio on each of the plurality of audio input data streams by; A) identifying any unique signals on any of the plurality of audio input data streams which exceed an amplitude threshold as qualified audio; and B) when similar signals exceeding the amplitude threshold are detected on multiple audio input data streams, identifying only the loudest of the similar signals as qualified audio; iii) organizing qualified audio into speech blocks, each of which has a status and a start time, and is associated with a single audio input data stream; and iv) presenting a speech block interface to a user, wherein the speech block interface displays, for each audio input data stream, a timeline of speech blocks for the audio input data stream; wherein, in the method the data stored on the computer readable medium is operable to configure the computer to perform; a) organizing qualified audio into speech blocks comprises; i) adding identified qualified audio to an existing speech block if and only if the identified qualified audio is on the same audio input data stream as the existing speech block and either; A) the identified qualified audio is separated from an end time for the existing speech block by no more than a first intra-block duration threshold and the existing speech block is subject to being, but has not been, discarded;
orB) the identified qualified audio is separated from the end time for the existing speech block by no more than a second intra-block duration threshold and the existing speech block is not subject to being discarded; ii) creating a new speech block which is subject to being, but has not been, discarded, with the identified qualified audio if and only if the identified qualified audio is not added to the existing speech block and there are no other speech blocks between the identified qualified audio and the existing speech block on the same audio input data stream as the identified qualified audio; and b) the first intra-block duration threshold is different from the second intra-block duration threshold. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
Specification