Disambiguation and tagging of entities
First Claim
1. A method comprising:
- identifying, at a processor, a candidate entity in a media content item, wherein the candidate entity is a potential identification of a first known entity, wherein the media content item comprises a plurality of words or phrases and wherein the candidate entity corresponds to a first word or first phrase of the plurality of words or phrases;
performing a first categorization of the candidate entity based on a first set of factors, wherein the first categorization comprises categorizing at least one other candidate entity following the candidate entity in the media content item according to a sequence in which the media content item is to be consumed by a user, wherein the at least one other candidate entity corresponds to a second word or second phrase of the plurality of words or phrases;
performing a second categorization of the candidate entity, after the first categorization, based on a second set of factors, wherein the second set of factors is different from the first set of factors and comprises, at least in part, results of the first categorization associated with the at least one other candidate entity;
determining, after the second categorization, that the candidate entity is categorized with a plurality of known entities;
in response to determining that the candidate entity is categorized with the plurality of known entities, selecting one of the plurality of known entities corresponding to the candidate entity, andtagging the candidate entity, based on the selection of the one of the plurality of known entities and separately from tagging the at least one other candidate entity, as identifying the selected one of the plurality of known entities.
1 Assignment
0 Petitions
Accused Products
Abstract
Tagging of content items and entities identified therein may include a matching process, a classification process and a disambiguation process. Matching may include the identification of potential matching candidate entities in a content item whereas the classification process may categorize or group identified candidate entities according to known entities to which they are likely a match. In some instances, a candidate entity may be categorized with multiple known entities. Accordingly, a disambiguation process may be used to reduce the potential matches to a single known entity. In one example, the disambiguation process may include ranking potentially matching known entities according to a hierarchy of criteria.
-
Citations
21 Claims
-
1. A method comprising:
-
identifying, at a processor, a candidate entity in a media content item, wherein the candidate entity is a potential identification of a first known entity, wherein the media content item comprises a plurality of words or phrases and wherein the candidate entity corresponds to a first word or first phrase of the plurality of words or phrases; performing a first categorization of the candidate entity based on a first set of factors, wherein the first categorization comprises categorizing at least one other candidate entity following the candidate entity in the media content item according to a sequence in which the media content item is to be consumed by a user, wherein the at least one other candidate entity corresponds to a second word or second phrase of the plurality of words or phrases; performing a second categorization of the candidate entity, after the first categorization, based on a second set of factors, wherein the second set of factors is different from the first set of factors and comprises, at least in part, results of the first categorization associated with the at least one other candidate entity; determining, after the second categorization, that the candidate entity is categorized with a plurality of known entities; in response to determining that the candidate entity is categorized with the plurality of known entities, selecting one of the plurality of known entities corresponding to the candidate entity, and tagging the candidate entity, based on the selection of the one of the plurality of known entities and separately from tagging the at least one other candidate entity, as identifying the selected one of the plurality of known entities. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. One or more non-transitory computer readable media storing computer readable instructions that, when executed, cause an apparatus to:
-
identify, at the apparatus, a candidate entity in a media content item, wherein the candidate entity is a potential identification of a first known entity wherein the media content item comprises a plurality of words or phrases and wherein the candidate entity corresponds to a first word or first phrase of the plurality of words or phrases; perform a first categorization of the candidate entity based on a first set of factors, wherein the first categorization comprises categorizing at least one other candidate entity following the candidate entity in the media content item according to a sequence in which the media content is configured to be consumed by a user, wherein the at least one other candidate entity corresponds to a second word or second phrase of the plurality of words or phrases; perform a second categorization of the candidate entity after the first categorization based on a second set of factors, wherein the second set of factors is different from the first set of factors and comprises at least in part, results of the first categorization associated with the at least one other candidate entity; determine, after the second categorization, that the candidate entity is categorized with a plurality of known entities; in response to determining that the candidate entity is categorized with a plurality of known entities, select one of the plurality of known entities corresponding to the candidate entity; and tag the candidate entity, based on the selection of the one of the plurality of known entities and separately from tagging the at least one other candidate entity, as identifying the selected one of the plurality of known entities. - View Dependent Claims (16, 17)
-
-
18. A method comprising:
-
receiving a sequence of words or phrases occurring in a piece of video, audio, audiovisual, or textual content; sequentially processing the sequence of words or phrases in a first categorization process, wherein the first categorization process includes categorizing each of a first word or phrase and a second word or phrase in the sequence as identifying one or more corresponding known entities, wherein the second word or phrase follows the first word or phrase in the sequence, and wherein categorizing the first word or phrase during the first categorization process is performed without using a categorization of the second word or phrase; processing the sequence of words or phrases in a second categorization process including performing another categorization of the first word or phrase using the categorization of the second word or phrase resulting from the first categorization process; upon determining that the first word or phrase is categorized with a plurality of known entities, selecting one of the plurality of known entities as being identified by the first word or phrase; and tagging the first word or phrase as identifying the selected one of the plurality of known entities.
-
-
19. An apparatus comprising:
-
a processor; and memory storing computer readable instructions that, when executed, cause the apparatus to; identify a candidate entity in a media content item, wherein the candidate entity is a potential identification of a first known entity, wherein the media content item comprises a plurality of words or phrases and wherein the candidate entity corresponds to a first word or first phrase of the plurality of words or phrases; perform a first categorization of the candidate entity based on a first set of factors, wherein the first categorization comprises categorizing at least one other candidate entity following the candidate entity in the media content item according to a sequence in which the media content is configured to be consumed by a user, wherein the at least one other candidate entity corresponds to a second word or second phrase of the plurality of words or phrases; perform a second categorization of the candidate entity, after the first categorization, based on a second set of factors, wherein the second set of factors is different from the first set of factors and comprises, at least in part, results of the first categorization associated with the at least one other candidate entity; determine, after the second categorization, that the candidate entity is categorized with a plurality of known entities; in response to determining that the candidate entity is categorized with the plurality of known entities, select one of the plurality of known entities corresponding to the candidate entity; and tag the candidate entity, based on the selection of the one of the plurality of known entities and separately from tagging the at least one other candidate entity, as identifying the selected one of the plurality of known entities.
-
-
20. An apparatus comprising:
-
a processor; and memory storing computer readable instructions that, when executed, cause the apparatus to; receive a sequence of words or phrases occurring in a piece of video, audio, audiovisual, or textual content; sequentially process the sequence of words or phrases in a first categorization process, wherein the first categorization process includes categorizing each of a first word or phrase and a second word or phrase in the sequence as identifying one or more corresponding known entities, wherein the second word or phrase follows the first word or phrase in the sequence, and wherein categorizing the first word or phrase during the first categorization process is performed without using a categorization of the second word or phrase; process the sequence of words or phrases in a second categorization process including performing another categorization of the first word or phrase using the categorization of the second word or phrase resulting from the first categorization process; upon determining the first word or phrase is categorized with a plurality of known entities, select one of the plurality of known entities as being identified by the first word or phrase; and tag the first word or phrase as identifying the selected one of the plurality of known entities.
-
-
21. One or more non-transitory computer readable media storing computer readable instructions that, when executed, cause an apparatus to:
-
receive a sequence of words or phrases occurring in a piece of video, audio, audio visual or textual content; sequentially process the sequence of words or phrases in a first categorization process, wherein the first categorization process includes categorizing each of a first word or phrase and a second word or phrase in the sequence as identifying one or more corresponding known entities, wherein the second word or phrase follows the first word or phrase in the sequence, and wherein categorizing the first word or phrase during the first categorization process is performed without using a categorization of the second word or phrase; process the sequence of words or phrases in a second categorization process including performing another categorization of the first word or phrase using the categorization of the second word or phrase resulting from the first categorization process; upon determining that the first word or phrase is categorized with a plurality of known entities, select one of the plurality of known entities as being identified by the first word or phrase; and tag the first word or phrase as identifying the selected one of the plurality of known entities.
-
Specification