Speaker identification method, speaker identification apparatus, and information management method
First Claim
1. A speaker identification method for identifying a speaker in the vicinity of a device displaying a content, the method comprising the steps of:
- displaying a content on the device;
acquiring voice information of the speaker during display of the content on the device;
determining whether or not the speaker corresponding to the acquired voice information matches a speaker corresponding to registered voice information stored in a voice database in connection with content information on a content, the content information including a name of a cast member appearing in the content, and the registered voice information being voice information of a member belonging to a predetermined group;
in a case where it is determined that the speaker corresponding to the acquired voice information matches the speaker corresponding to the registered voice information stored in the voice database, acquiring the content information on the content displayed on the device from a content database for storing information to identify the content, and the content information in connection with each other at the time of acquisition of the voice information, and storing the acquired content information in connection with the registered voice information;
in a case where it is determined that the speaker corresponding to the acquired voice information does not match the speaker corresponding to the registered voice information stored in the voice database, compiling the acquired voice information in an internal memory which is different from the voice database without updating the voice database at the time of the determining;
identifying a speaker of a plurality of pieces of voice information compiled in the internal memory for a predetermined period of time, and extracting, among the plurality of pieces of voice information, two or more pieces of voice information identified as corresponding to a same speaker;
counting the number of the extracted pieces of voice information by the same speaker, and storing one among the extracted pieces of voice information in the voice database as registered voice information of a new member belonging to the predetermined group in a case where the counted number indicates a predetermined number or more; and
in a case where it is determined that the speaker corresponding to the acquired voice information matches the speaker corresponding to the registered voice information stored in the voice database, extracting the name of the cast member from the content information linked to the speaker corresponding to the acquired voice information, and referencing a service database in which names of cast members are stored in connection with services to be provided to speakers, thereby specifying a service associated with the name of the cast member as a candidate for a service to be provided.
1 Assignment
0 Petitions
Accused Products
Abstract
The speaker identification system has a voice acquisition unit that acquires voice information of a speaker, and a database management unit that determines whether or not the speaker corresponding to the acquired voice information matches a speaker corresponding to registered voice information in connection with content information on a content, that acquires content information on the content displayed on a device at the time of acquisition of the voice information and stores the acquired content information in connection with the registered voice information in a case where it is determined that the speaker corresponding to the acquired voice information matches the speaker corresponding to the registered voice information, and that stores the acquired voice information in the database as registered voice information in a case where it is determined that the speaker corresponding to the acquired voice information does not match the speaker corresponding to the registered voice information.
-
Citations
12 Claims
-
1. A speaker identification method for identifying a speaker in the vicinity of a device displaying a content, the method comprising the steps of:
-
displaying a content on the device; acquiring voice information of the speaker during display of the content on the device; determining whether or not the speaker corresponding to the acquired voice information matches a speaker corresponding to registered voice information stored in a voice database in connection with content information on a content, the content information including a name of a cast member appearing in the content, and the registered voice information being voice information of a member belonging to a predetermined group; in a case where it is determined that the speaker corresponding to the acquired voice information matches the speaker corresponding to the registered voice information stored in the voice database, acquiring the content information on the content displayed on the device from a content database for storing information to identify the content, and the content information in connection with each other at the time of acquisition of the voice information, and storing the acquired content information in connection with the registered voice information; in a case where it is determined that the speaker corresponding to the acquired voice information does not match the speaker corresponding to the registered voice information stored in the voice database, compiling the acquired voice information in an internal memory which is different from the voice database without updating the voice database at the time of the determining; identifying a speaker of a plurality of pieces of voice information compiled in the internal memory for a predetermined period of time, and extracting, among the plurality of pieces of voice information, two or more pieces of voice information identified as corresponding to a same speaker; counting the number of the extracted pieces of voice information by the same speaker, and storing one among the extracted pieces of voice information in the voice database as registered voice information of a new member belonging to the predetermined group in a case where the counted number indicates a predetermined number or more; and in a case where it is determined that the speaker corresponding to the acquired voice information matches the speaker corresponding to the registered voice information stored in the voice database, extracting the name of the cast member from the content information linked to the speaker corresponding to the acquired voice information, and referencing a service database in which names of cast members are stored in connection with services to be provided to speakers, thereby specifying a service associated with the name of the cast member as a candidate for a service to be provided. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A speaker identification apparatus for identifying a speaker, comprising:
-
a display unit that displays a content; a voice acquisition unit that acquires voice information of a speaker in the vicinity of the speaker identification apparatus during display of the content on the display unit; a voice database for storing registered voice information in connection with content information on a content, the content information including a name of a cast member appearing in the content, and the registered voice information being voice information of a member belonging to a predetermined group; a content database for storing information to identify the content, and the content information in connection with each other; a determination unit that determines whether or not the speaker corresponding to the voice information acquired by the voice acquisition unit matches a speaker corresponding to the registered voice information stored in the voice database in connection with the content information; a database update unit that acquires the content information on the content displayed on the display unit from the content database at the time of acquisition of the voice information and stores the acquired content information in connection with the registered voice information, in a case where the determination unit determines that the speaker corresponding to the acquired voice information matches the speaker corresponding to the registered voice information stored in the voice database; a compilation unit that compiles the acquired voice information in an internal memory which is different from the voice database in a case where the determination unit determines that the speaker corresponding to the acquired voice information does not match the speaker corresponding to the registered voice information stored in the voice database, without updating the voice database at the time of the determination by the determining unit; an extraction unit that identifies a speaker of a plurality of pieces of voice information compiled in the memory for a predetermined of time, and extracts, among the plurality of pieces of voice information, two or more pieces of voice information identified as corresponding to a same speaker; a database storage unit that counts the number of the extracted pieces of voice information by the same speaker, and stores one among the extracted pieces of voice information in the voice database as registered voice information of a new member belonging to the predetermined group in a case where the counted number indicates a predetermined number or more; and a specification unit that extracts the name of the cast member from the content information linked to the speaker corresponding to the acquired voice information, and references a service database in which names of cast members are stored in connection with services to be provided to speakers, thereby specifying a service associated with the name of the cast member as a candidate for a service to be provided.
-
-
12. An information management method of a speaker identification system for identifying a speaker in the vicinity of a device displaying a content, the method comprising the steps of:
-
displaying a content on the device; receiving voice information of the speaker, the voice information being acquired during display of the content on the device; determining whether or not the speaker corresponding to the received voice information matches a speaker corresponding to registered voice information stored in a voice database in connection with content information on a content, the content information including a name of a cast member appearing in the content, and the registered voice information being voice information of a member belonging to a predetermined group; in a case where it is determined that the speaker corresponding to the received voice information matches the speaker corresponding to the registered voice information stored in the voice database, acquiring the content information on the content displayed on the device from a content database for storing information to identify the content, and the content information in connection with each other at the time of acquisition of the voice information, and storing the received content information in connection with the registered voice information; in a case where it is determined that the speaker corresponding to the received voice information does not match the speaker corresponding to the registered voice information stored in the voice database, compiling the acquired voice information in an internal memory which is different from the voice database without updating the voice database at the time of the determining; identifying a speaker of a plurality of pieces of voice information compiled in the memory for a predetermined period of time, and extracting, among the plurality of pieces of information, two or more pieces of voice information identified as corresponding to a same speaker; counting the number of the extracted pieces of voice information by the same speaker, and storing one among the extracted pieces of voice information in the voice database as registered voice information of a new member belonging to the predetermined group in a case where the counted number indicates a predetermined number or more; and in a case where it is determined that the speaker corresponding to the acquired voice information matches the speaker corresponding to the registered voice information stored in the voice database, extracting the name of the cast member from the content information linked to the speaker corresponding to the acquired voice information, and referencing a service database in which names of cast members are stored in connection with services to be provided to speakers, thereby specifying a service associated with the name of the cast member as a candidate for a service to be provided.
-
Specification