Video object recognition device and recognition method, video annotation giving device and giving method, and program
First Claim
1. A video image object recognizing apparatus comprising:
- input means for inputting video image data and image capturing information which is information for determining an area where an image will be captured;
storage means for storing positional information which is information representing the position of an object and visual feature information which is information representing a numerical value of a visual feature of the object, that are connected to each other; and
object recognizing means for recognizing an object contained in a video image based on the input video image data;
wherein said object recognizing means comprises;
estimating means for estimating an area where an image will be captured based on the image capturing information;
matching means for matching the area where an image will be captured to a position represented by the positional information of the object stored in said storage means;
partial video image extracting means for extracting partial video image data which is either video image data of a partial area of the video image based on the video image data or is video image data of the entire video image, from the input video image;
visual feature information setting means for generating visual feature information of the partial video image data;
similarity calculating means for comparing the visual feature information of the partial video image data and the visual feature information of the object stored in said storage means with each other to calculate a similarity therebetween; and
decision means for determining whether or not an object is present in the video image, based on the input video image data, which is based on the result of matching by said matching means and on the result of the calculated similarity.
1 Assignment
0 Petitions
Accused Products
Abstract
Visual feature information which is information representing a numerical value of a visual feature of an object and additional information which is information added to the object are stored in association with each other. Partial image data which is image data of a partial area of a video image is extracted. Visual feature information of the extracted partial image data is generated. The visual feature information of the extracted partial image data and visual feature information of an object which is stored are compared with each other to calculate a similarity therebetween. Based on the calculated similarity, an object contained in the video image data is identified. An annotation made up of additional information of the identified object is displayed in superposing relation to the video image on a display device.
101 Citations
29 Claims
-
1. A video image object recognizing apparatus comprising:
-
input means for inputting video image data and image capturing information which is information for determining an area where an image will be captured;
storage means for storing positional information which is information representing the position of an object and visual feature information which is information representing a numerical value of a visual feature of the object, that are connected to each other; and
object recognizing means for recognizing an object contained in a video image based on the input video image data;
wherein said object recognizing means comprises;
estimating means for estimating an area where an image will be captured based on the image capturing information;
matching means for matching the area where an image will be captured to a position represented by the positional information of the object stored in said storage means;
partial video image extracting means for extracting partial video image data which is either video image data of a partial area of the video image based on the video image data or is video image data of the entire video image, from the input video image;
visual feature information setting means for generating visual feature information of the partial video image data;
similarity calculating means for comparing the visual feature information of the partial video image data and the visual feature information of the object stored in said storage means with each other to calculate a similarity therebetween; and
decision means for determining whether or not an object is present in the video image, based on the input video image data, which is based on the result of matching by said matching means and on the result of the calculated similarity.
-
-
2. A video image annotation applying apparatus comprising:
-
input means for inputting video image data and image capturing information which is information for determining an area where an image will be captured;
storage means for storing positional information which is information representing the position of an object, visual feature information which is information representing a numerical value of a visual feature of the object, and additional information which is information added to the object, that are connected to each other; and
object recognizing means for associating an object contained in a video image based on the input video image data with the additional information;
wherein said object recognizing means comprises;
estimating means for estimating an area where an image will be captured based on the image capturing information;
matching means for matching the area where an image will be captured to a position represented by the positional information of the object stored in said storage means;
partial video image extracting means for extracting partial video image data which is either video image data of a partial area of the video image based on the video image data or is video image data of the entire video image, from the input video image;
visual feature information setting means for generating visual feature information of the partial video image data;
similarity calculating means for comparing the visual feature information of the partial video image data and the visual feature information of the object stored in said storage means with each other to calculate a similarity therebetween; and
decision means for identifying an object which is contained in the video image based on the input video image data, and which is based on the result of the matching by said matching means and the calculated similarity, and for associating the identified object and the additional information stored in said storage means with each other. - View Dependent Claims (3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. A method of recognizing a video image object, comprising the steps of:
-
inputting video image data and image capturing information which is information for determining an area where an image will be captured;
storing positional information which is information representing the position of an object and visual feature information which is information representing a numerical value of a visual feature of the object, in association with each other;
estimating the area where an image will be captured based on the image capturing information;
matching the area where an image will be captured to a position represented by the positional information of the object which is stored;
extracting partial video image data which is either video image data of a partial area of the video image based on the video image data or is video image data of the entire video image, from the input video image;
generating visual feature information of the partial video image data;
comparing the visual feature information of the partial video image data and the stored visual feature information of the object to calculate a similarity therebetween; and
determining whether an image of an object is captured or not, based on the result of the matching and the calculated similarity.
-
-
27. A method of applying an video image annotation, comprising the steps of:
-
inputting video image data and image capturing information which is information for determining an area where an image will be captured;
storing positional information which is information representing the position of an object, visual feature information which is information representing a numerical value of a visual feature of the object, and additional information which is information added to the object, in association with each other;
estimating the area where an image will be captured based on the image capturing information;
matching the area where an image will be captured to a position represented by the positional information of the object which is stored;
extracting partial video image data which is either video image data of a partial area of the video image based on the video image data or is video image data of the entire video image, from the input video image;
generating visual feature information of the partial video image data;
comparing the visual feature information of the partial video image data and the stored visual feature information of the object to calculate a similarity therebetween; and
identifying an object which is contained in the video image, based on the result of the matching and the calculated similarity, and associating the identified object and the stored additional information with each other.
-
-
28. A video image object recognizing program adapted to be installed in a video image object recognizing apparatus for determining whether an object which is stored is contained as a subject in video image data or not, said video image object recognizing program to enable a computer to perform a process comprising the steps of:
-
storing, in a storage device, positional information which is information representing the position of an object and visual feature information which is information representing a numerical value of a visual feature of the object, in association with each other;
estimating an area where an image will be captured based on image capturing information which is information for determining the area where an image will be captured;
matching the area where an image will be captured to a position represented by the positional information of the object which is stored in said storage device;
extracting partial video image data which is either video image data of a partial area of the video image based on the video image data or is video image data of the entire video image, from input video image;
generating visual feature information of the partial video image data;
comparing the visual feature information of the partial video image data and the stored visual feature information of the object to calculate a similarity therebetween; and
determining whether an image of an object is captured or not, based on the result of matching and calculated similarity.
-
-
29. A video image annotation applying program adapted to be installed in a video image annotation applying apparatus for associating an object and information of an object which is stored with each other, said video image annotation applying program enabling a computer to perform a process comprising the steps of:
-
storing, in a storage device, positional information which is information representing the position of an object, visual feature information which is information representing a numerical value of a visual feature of the object, and additional information which is information added to the object, in association with each other;
estimating an area where an image will be captured based on image capturing information which is information for determining the area where an image will be captured;
matching the area where an image will be captured to a position represented by the positional information of the object which is stored in said storage device;
extracting partial video image data which is either video image data of a partial area of the video image based on the video image data or is video image data of the entire video image, from input video image;
generating visual feature information of the partial video image data;
comparing the visual feature information of the partial video image data and the visual feature information of the object which is stored with each other to calculate a similarity therebetween; and
identifying an object which is contained in the video image, based on the result of matching and calculated similarity, and associating the identified object and the additional information which is stored with each other.
-
Specification