Storage medium having stored therein image processing program, image processing apparatus, image processing system, and image processing method
First Claim
1. A non-transitory computer-readable storage medium having stored therein an image processing program executed by a computer of an image processing apparatus for displaying an image on a display device, the image processing program causing the computer to perform functionality comprising:
- obtaining an image taken by a real camera;
calculating position and orientation information determined in accordance with a position and an orientation of the real camera in a real space;
obtaining sound data representing a sound signal from a sound input device;
recognizing a sound inputted into the sound input device;
setting, based on the taken image, a virtual object to be additionally displayed on the taken image and setting, based on a sound recognition result, a display animation of the virtual object;
setting, when the sound inputted into the sound input device is recognized as a first sound and the position and orientation information satisfies a first condition, at least one selected from the group consisting of the display position, the orientation, and the display form of the virtual object based on a first action, and setting, when the sound inputted into the sound input device is recognized as the first sound and the position and orientation information does not satisfy the first condition, at least one selected from the group consisting of the display position, the orientation, and the display form of the virtual object based on a second action;
setting in a virtual world a position and an orientation of a virtual camera based on the position and orientation information;
arranging the virtual object set in the virtual world;
generating as a virtual world image an image in the virtual world seen from the virtual camera;
generating as a combined image an image obtained by superimposing the virtual world image, including the virtual object, on the taken image; and
causing the display device to display the combined image.
1 Assignment
0 Petitions
Accused Products
Abstract
An image taken by a real camera is repeatedly obtained, and position and orientation information determined in accordance with a position and an orientation of a real camera in a real space is repeatedly calculated. A virtual object or a letter to be additionally displayed on the taken image is set as an additional display object, and based on a result of recognition of a sound inputted into a sound input device, at least one selected from the group consisting of a display position, an orientation, and a display form of the additional display object is set. A combined image repeatedly generated by superimposing on the taken image the set additional display object with reference to a position in the taken image in accordance with the position and orientation information is displayed on a display device.
-
Citations
26 Claims
-
1. A non-transitory computer-readable storage medium having stored therein an image processing program executed by a computer of an image processing apparatus for displaying an image on a display device, the image processing program causing the computer to perform functionality comprising:
-
obtaining an image taken by a real camera; calculating position and orientation information determined in accordance with a position and an orientation of the real camera in a real space; obtaining sound data representing a sound signal from a sound input device; recognizing a sound inputted into the sound input device; setting, based on the taken image, a virtual object to be additionally displayed on the taken image and setting, based on a sound recognition result, a display animation of the virtual object; setting, when the sound inputted into the sound input device is recognized as a first sound and the position and orientation information satisfies a first condition, at least one selected from the group consisting of the display position, the orientation, and the display form of the virtual object based on a first action, and setting, when the sound inputted into the sound input device is recognized as the first sound and the position and orientation information does not satisfy the first condition, at least one selected from the group consisting of the display position, the orientation, and the display form of the virtual object based on a second action; setting in a virtual world a position and an orientation of a virtual camera based on the position and orientation information; arranging the virtual object set in the virtual world; generating as a virtual world image an image in the virtual world seen from the virtual camera; generating as a combined image an image obtained by superimposing the virtual world image, including the virtual object, on the taken image; and causing the display device to display the combined image. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. An image processing apparatus for displaying an image on a display device, comprising at least one processor, the image processing apparatus configured to:
-
obtain an image taken by a real camera; calculate position and orientation information determined in accordance with a position and an orientation of the real camera in a real space, obtain sound data representing a sound signal from a sound input device; recognize a sound inputted into the sound input device; set, based on the taken image, a virtual object to be additionally displayed on the taken image, and set, based on a sound recognition result, a display animation of the virtual object; set, when the sound inputted into the sound input device is recognized as a first sound and the position and orientation information satisfies a first condition, at least one selected from the group consisting of the display position, the orientation, and the display form of the virtual object based on a first action, and set, when the sound inputted into the sound input device is recognized as the first sound and the position and orientation information does not satisfy the first condition, at least one selected from the group consisting of the display position, the orientation, and the display form of the virtual object based on a second action; set in a virtual world a position and an orientation of a virtual camera based on the position and orientation information; arrange the virtual object set in the virtual world; generate as a virtual world image an image in the virtual world seen from the virtual camera; generate as a combined image an image obtained by superimposing the virtual world image, including the virtual object, on the taken image; and cause the display device to display the combined image. - View Dependent Claims (18)
-
-
19. An image processing system which includes a plurality of apparatuses communicable with each other, and which displays an image on a display device, the image processing system comprising:
a processing system having at least one processor, the processing system configured to; obtain an image taken by a real camera, calculate position and orientation information determined in accordance with a position and an orientation of the real camera in a real space, obtain sound data representing a sound signal from a sound input device, recognize a sound inputted into the sound input device, set, based on the taken image, a virtual object to be additionally displayed on the taken image, and set, based on a sound recognition result, a display animation of the virtual object, set, when the sound inputted into the sound input device is recognized as a first sound and the position and orientation information satisfies a first condition, at least one selected from the group consisting of the display position, the orientation, and the display form of the virtual object based on a first action, and set, when the sound inputted into the sound input device is recognized as the first sound and the position and orientation information does not satisfy the first condition, at least one selected from the group consisting of the display position, the orientation, and the display form of the virtual object based on a second action, set in a virtual world a position and an orientation of a virtual camera based on the position and orientation information; arrange the virtual object set in the virtual world; generate as a virtual world image an image in the virtual world seen from the virtual camera; generate as a combined image an image obtained by superimposing the virtual world image, including the virtual object, on the taken image; and cause the display device to display the combined image.
-
20. A non-transitory computer-readable storage medium having stored therein an image processing program executed by a computer of an image processing apparatus for displaying an image on a display device, the image processing program causing the computer to perform functionality comprising:
-
repeatedly obtaining an image taken by a real camera; repeatedly calculating position and orientation information determined in accordance with a position and an orientation of the real camera in real space; obtaining sound data representing a sound signal from a sound input device; recognizing a sound inputted into the sound input device; setting a virtual object to be additionally displayed on the taken image; setting, when the sound inputted into the sound input device is recognized as a first sound and the position and orientation information satisfies a first condition, at least one selected from the group consisting of the display position, the orientation, and the display form of the virtual object based on a first action, and setting, when the sound inputted into the sound input device is not recognized as the first sound and the position and orientation information satisfies the first condition, at least one selected from the group consisting of the display position, the orientation, and the display form of the virtual object based on a second action; repeatedly generating a combined image by, based on a position included in the taken image and corresponding to the position and orientation information, superimposing the set virtual object; setting in a virtual world a position and an orientation of a virtual camera based on the position and orientation information; arranging the virtual object set in the virtual world; generating as a virtual world image an image in the virtual world seen from the virtual camera; and repeatedly causing the display device to display the combined image. - View Dependent Claims (21)
-
-
22. An image processing apparatus for displaying an image on a display device, comprising at least one processor, the image processing apparatus configured to:
-
repeatedly obtain an image taken by a real camera; repeatedly calculate position and orientation information determined in accordance with a position and an orientation of the real camera in real space; obtain sound data representing a sound signal from a sound input device; recognize a sound inputted into the sound input device; set a virtual object to be additionally displayed on the taken image; set, when the sound inputted into the sound input device is recognized as a first sound and the position and orientation information satisfies a first condition, at least one selected from the group consisting of the display position, the orientation, and the display form of the virtual object based on a first action, and set, when the sound inputted into the sound input device is not recognized as the first sound and the position and orientation information satisfies the first condition, at least one selected from the group consisting of the display position, the orientation, and the display form of the virtual object based on a second action; repeatedly generate a combined image by, based on a position included in the taken image and corresponding to the position and orientation information, superimposing the set virtual object; set in a virtual world a position and an orientation of a virtual camera based on the position and orientation information; arrange the virtual object set in the virtual world; generate as a virtual world image an image in the virtual world seen from the virtual camera; and repeatedly cause the display device to display the combined image.
-
-
23. An image processing system which includes a plurality of apparatuses communicable with each other, and which displays an image on a display device, the image processing system comprising:
a processing system having at least one processor, the processing system configured to; repeatedly obtain an image taken by a real camera, repeatedly calculate position and orientation information determined in accordance with a position and an orientation of the real camera in real space, obtain sound data representing a sound signal from a sound input device, recognize a sound inputted into the sound input device, set a virtual object to be additionally displayed on the taken image, set, when the sound inputted into the sound input device is recognized as a first sound and the position and orientation information satisfies a first condition, at least one selected from the group consisting of the display position, the orientation, and the display form of the virtual object based on a first action, and set, when the sound inputted into the sound input device is not recognized as the first sound and the position and orientation information satisfies the first condition, at least one selected from the group consisting of the display position, the orientation, and the display form of the virtual object based on a second action, repeatedly generate a combined image by, based on a position included in the taken image and corresponding to the position and orientation information, superimposing the set virtual object, set in a virtual world a position and an orientation of a virtual camera based on the position and orientation information, arrange the virtual object set in the virtual world, generate as a virtual world image an image in the virtual world seen from the virtual camera, and repeatedly cause the display device to display the combined image.
-
24. A non-transitory computer-readable storage medium having stored therein an image processing program executed by a computer of an image processing apparatus for displaying an image on a display device, the image processing program causing the computer to perform functionality comprising:
-
obtaining an image taken by a real camera; calculating position and orientation information determined in accordance with a position and an orientation of the real camera in a real space by using at least one of a geographic position of the real camera and an azimuth of an imaging direction of the real camera in the real space; obtaining sound data representing a sound signal from a sound input device; recognizing a sound inputted into the sound input device; setting, based on the taken image, a letter to be additionally displayed on the taken image, and setting, based on a sound recognition result, a display animation of the letter; estimating a subject whose image has been taken in the taken image, based on at least one of the geographic position of the real camera and the azimuth of the imaging direction of the real camera represented by the position and orientation information, the letter representing information about the subject; changing, when the letter coincides with a word recognized through sound recognition, a display form into that indicating that the letter has been selected, and setting the letter; generating a combined image by superimposing, with reference to the position in the taken image in accordance with the position and orientation information, the letter corresponding to the subject and which coincides with the word in the updated display form; and causing the display device to display the combined image, wherein in setting the display animation at least one selected from a group consisting of a display position, an orientation, and a display form of the letter is set.
-
-
25. An image processing apparatus for displaying an image on a display device, comprising at least one processor, the image processing apparatus configured to:
-
obtain an image taken by a real camera; calculate position and orientation information determined in accordance with a position and an orientation of the real camera in a real space by using at least one of a geographic position of the real camera and an azimuth of an imaging direction of the real camera in the real space; obtain sound data representing a sound signal from a sound input device; recognize a sound inputted into the sound input device; set, based on the taken image, a letter to be additionally displayed on the taken image, and set, based on a sound recognition result, a display animation of the letter; estimate a subject whose image has been taken in the taken image, based on at least one of the geographic position of the real camera and the azimuth of the imaging direction of the real camera represented by the position and orientation information, the letter representing information about the subject; change, when the letter coincides with a word recognized through sound recognition, a display form into that indicating that the letter has been selected; generate a combined image by superimposing, with reference to the position in the taken image in accordance with the position and orientation information, the letter corresponding to the subject and which coincides with the word in the updated display form; and cause the display device to display the combined image, wherein in setting the display animation at least one selected from a group consisting of a display position, an orientation, and a display form of the letter is set.
-
-
26. An image processing system which includes a plurality of apparatuses communicable with each other, and which displays an image on a display device, the image processing system comprising:
a processing system having at least one processor, the processing system configured to; obtain an image taken by a real camera, calculate position and orientation information determined in accordance with a position and an orientation of the real camera in a real space by using at least one of a geographic position of the real camera and an azimuth of an imaging direction of the real camera in the real space, obtain sound data representing a sound signal from a sound input device, recognize a sound inputted into the sound input device, set, based on the taken image, a letter to be additionally displayed on the taken image, and set, based on a sound recognition result, a display animation of the letter, estimate a subject whose image has been taken in the taken image, based on at least one of the geographic position of the real camera and the azimuth of the imaging direction of the real camera represented by the position and orientation information, the letter representing information about the subject, change, when the letter coincides with a word recognized through sound recognition, a display form into that indicating that the letter has been selected, generate a combined image by superimposing, with reference to the position in the taken image in accordance with the position and orientation information, the letter corresponding to the subject and which coincides with the word in the updated display form, and cause the display device to display the combined image, wherein in setting the display animation at least one selected from a group consisting of a display position, an orientation, and a display form of the letter is set.
Specification