Communicating metadata that identifies a current speaker
First Claim
1. A computer system for communicating metadata that identifies a current speaker, the computer system comprising:
- a processor configured to execute computer-executable instructions; and
memory storing one or more computer-executable instructions that, when executed by the processor, perform operations including;
receive audio data that represents speech of the current speaker;
generate an audio fingerprint of the current speaker based on the audio data;
perform automated speaker recognition including comparing the audio fingerprint of the current speaker against one or more stored audio fingerprints contained in a speaker fingerprint repository;
communicate data indicating that the current speaker is unrecognized to a first client device of an observer;
receive tagging information that identifies the current speaker from the first client device of the observer;
store the audio fingerprint of the current speaker and metadata that identifies the current speaker in the speaker fingerprint repository, the metadata being at least partly based on the tagging information;
communicate the metadata that identifies the current speaker to at least one of the first client device of the observer or a second client device of a different observer;
receive a request that identifies a particular speaker from at least one of the first client device of the observer or the second client device of the different observer; and
communicate an alert to at least one of the first client device of the observer or the second client device of the different observer when the particular speaker is currently speaking.
2 Assignments
0 Petitions
Accused Products
Abstract
A computer system may communicate metadata that identifies a current speaker. The computer system may receive audio data that represents speech of the current speaker, generate an audio fingerprint of the current speaker based on the audio data, and perform automated speaker recognition by comparing the audio fingerprint of the current speaker against stored audio fingerprints contained in a speaker fingerprint repository. The computer system may communicate data indicating that the current speaker is unrecognized to a client device of an observer and receive tagging information that identifies the current speaker from the client device of the observer. The computer system may store the audio fingerprint of the current speaker and metadata that identifies the current speaker in the speaker fingerprint repository and communicate the metadata that identifies the current speaker to at least one of the client device of the observer or a client device of a different observer.
-
Citations
20 Claims
-
1. A computer system for communicating metadata that identifies a current speaker, the computer system comprising:
-
a processor configured to execute computer-executable instructions; and memory storing one or more computer-executable instructions that, when executed by the processor, perform operations including; receive audio data that represents speech of the current speaker; generate an audio fingerprint of the current speaker based on the audio data; perform automated speaker recognition including comparing the audio fingerprint of the current speaker against one or more stored audio fingerprints contained in a speaker fingerprint repository; communicate data indicating that the current speaker is unrecognized to a first client device of an observer; receive tagging information that identifies the current speaker from the first client device of the observer; store the audio fingerprint of the current speaker and metadata that identifies the current speaker in the speaker fingerprint repository, the metadata being at least partly based on the tagging information; communicate the metadata that identifies the current speaker to at least one of the first client device of the observer or a second client device of a different observer; receive a request that identifies a particular speaker from at least one of the first client device of the observer or the second client device of the different observer; and communicate an alert to at least one of the first client device of the observer or the second client device of the different observer when the particular speaker is currently speaking. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer-implemented method for communicating metadata that identifies a current speaker performed by a computer system including one or more computing devices, the computer-implemented method comprising:
-
generating an audio fingerprint of the current speaker based on audio data that represents speech of the current speaker; performing automated speaker recognition based on the audio fingerprint of the current speaker and one or more stored audio fingerprints; receiving tagging information that identifies the current speaker from a first client device of an observer when the current speaker is unrecognized; storing the audio fingerprint of the current speaker and metadata that identifies the current speaker, the metadata being at least partly based on the tagging information; communicating the metadata that identifies the current speaker to at least one of the first client device of the observer or a second client device of a different observer; receive a request that identifies a particular speaker from the first client device of the observer or the second client device of the different observer; and communicate an alert to at least one of the first client device of the observer or the second client device of the different observer when the particular speaker is currently speaking. - View Dependent Claims (15, 16, 17)
-
-
18. A computer-readable storage medium storing computer-executable instructions that, when executed by a computing device, cause the computing device to implement:
-
a speaker recognition component configured to generate an audio fingerprint of the current speaker based on audio data that represents speech of the current speaker and perform automated speaker recognition by comparing the audio fingerprint of the current speaker against stored audio fingerprints; a tagging component configured to receive tagging information that identifies the current speaker from a first client device of an observer when the automated speaker recognition is unsuccessful and store the audio fingerprint of the current speaker with the stored audio fingerprints; an audio data enrichment component configured to communicate metadata that identifies the current speaker to the first client device of the observer or a second client device of a different observer, the metadata being at least partly based on the tagging information; and an alert component configured to receive a request that identifies a particular speaker from at least one of the first client device of the observer or the second client device of the different observer, and communicate an alert to at least one of the first client device of the observer or the second client device of the different observer when the particular speaker is currently speaking. - View Dependent Claims (19, 20)
-
Specification