Systems and methods for providing interactive speaker identification training
First Claim
1. A speaker identification system, comprising:
- an indexer configured to;
generate a plurality of speaker models, receive a plurality of audio segments, and identify speakers corresponding to the audio segments based on the speaker models, the indexer being unable to correctly identify at least one of the speakers, as an unidentified speaker, corresponding to the audio segments; and
a server configured to;
receive, from a user, the name of the unidentified speaker, and provide the name of the unidentified speaker to the indexer for identification of the unidentified speaker in subsequent audio segments.
5 Assignments
0 Petitions
Accused Products
Abstract
A system (100) provides speaker identification training. The system (100) generates speaker models and receives audio segments. The system (100) identifies speakers corresponding to the audio segments based on the speaker models. At least one of the audio segments has an unidentified or misidentified speaker (i.e., an audio segment whose speaker cannot be accurately identified). The system (100) presents, to a user, audio segments that include an audio segment whose speaker is unidentified or misidentified and receives, from the user, the name of the unidentified or misidentified speaker. The system (100) may use this information to subsequently identify the unidentified or misidentified speaker by name for future audio segments.
115 Citations
26 Claims
-
1. A speaker identification system, comprising:
an indexer configured to;
generate a plurality of speaker models, receive a plurality of audio segments, and identify speakers corresponding to the audio segments based on the speaker models, the indexer being unable to correctly identify at least one of the speakers, as an unidentified speaker, corresponding to the audio segments; and
a server configured to;
receive, from a user, the name of the unidentified speaker, and provide the name of the unidentified speaker to the indexer for identification of the unidentified speaker in subsequent audio segments. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
-
13. A speaker identification system, comprising:
-
means for generating a plurality of speaker models;
means for receiving a plurality of audio segments;
means for identifying speakers corresponding to the audio segments based on the speaker models, at least one of the audio segments being associated with an unidentified or misidentified speaker;
means for labeling the audio segments with names of the speakers that can be identified;
means for presenting a plurality of the audio segments, including the at least one of the audio segments, with the labels to a user;
means for receiving, from the user, the name of the unidentified or misidentified speaker; and
means for identifying the unidentified or misidentified speaker by name in future audio segments.
-
-
14. A method for providing speaker identification training, comprising:
-
generating a plurality of speaker models;
receiving a plurality of audio segments;
identifying speakers corresponding to the audio segments based on the speaker models, at least one of the audio segments being associated with an unidentified or misidentified speaker;
presenting a plurality of the audio segments, including the at least one of the audio segments, to a user;
receiving, from the user, the name of the unidentified or misidentified speaker; and
identifying the unidentified or misidentified speaker by name for future audio segments. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
-
-
25. A computer-readable medium that stores instructions executable by one or more processors for speaker identification training by a speaker identification system, comprising:
-
instructions for generating a plurality of speaker models based on training data;
instructions for presenting, to a user, audio segments for which no speakers can be identified from the speaker models;
instructions for obtaining, from the user, a name of a speaker for at least one of the audio segments;
instructions for generating a new speaker model for the speaker based on the at least one of the audio segments; and
instructions for associating the name of the speaker with the new speaker model.
-
-
26. A speaker identification system, comprising:
an indexer configured to;
receive a plurality of speech segments, each of the speech segments being associated with a corresponding speaker, create a plurality of documents by transcribing the speech segments, identify names of the speakers corresponding to the speech segments, the indexer being unable to correctly identify names of at least one of the speakers corresponding to the speech segments, the speakers for which the indexer can correctly identify names being identified speakers and the speakers for which the indexer cannot correctly identify names being unidentified speakers;
a database configured to store the documents; and
a server configured to;
retrieve one or more of the documents from the database, present the one or more of the documents to a user, receive, from the user, a name for one of the unidentified speakers, and provide the name for the one of the unidentified speakers to the indexer for subsequent identification of speech segments from the one of the unidentified speakers.
Specification