Information Processing Device, Information Processing Method and Program
First Claim
1. An information processing device, comprising:
- feature amount extracting means for extracting an image feature amount of each frame of an image of learning content and extracting word frequency information regarding frequency of appearance of each word in a description text describing a content of the image of the learning content as a text feature amount of the description text; and
model learning means for learning an annotation model, which is a multi-stream HMM (hidden Markov model), by using an annotation sequence for annotation, which is a multi-stream including the image feature amount and the text feature amount.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention relates to an information processing device, an information processing method, and a program capable of easily adding an annotation to content.
A feature amount extracting unit 21 extracts an image feature amount of each frame of an image of learning content and extracts word frequency information regarding frequency of appearance of each word in a description text describing a content of the image of the learning content (for example, a text of a caption) as a text feature amount of the description text. A model learning unit 22 learns an annotation model, which is a multi-stream HMM, by using an annotation sequence for annotation, which is a multi-stream including the image feature amount of each frame and the text feature amount. The present invention may be applied when adding the annotation to the content such as a television broadcast program, for example.
32 Citations
20 Claims
-
1. An information processing device, comprising:
-
feature amount extracting means for extracting an image feature amount of each frame of an image of learning content and extracting word frequency information regarding frequency of appearance of each word in a description text describing a content of the image of the learning content as a text feature amount of the description text; and model learning means for learning an annotation model, which is a multi-stream HMM (hidden Markov model), by using an annotation sequence for annotation, which is a multi-stream including the image feature amount and the text feature amount. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. An information processing method to be performed by an information processing device, comprising the steps of:
-
extracting an image feature amount of each frame of an image of learning content and extracting word frequency information regarding frequency of appearance of each word in a description text describing a content of the image of the learning content as a text feature amount of the description text; and learning an annotation model, which is a multi-stream HMM (hidden Markov model), by using an annotation sequence for annotation, which is a multi-stream including the image feature amount and the text feature amount.
-
-
20. A program for allowing a computer to function as
feature amount extracting means for extracting an image feature amount of each frame of an image of learning content and extracting word frequency information regarding frequency of appearance of each word in a description text describing a content of the image of the learning content as a text feature amount of the description text; - and
model learning means for learning an annotation model, which is a multi-stream HMM (hidden Markov model), by using an annotation sequence for annotation, which is a multi-stream including the image feature amount and the text feature amount.
- and
Specification