Recognizing entity interactions in visual media
First Claim
Patent Images
1. A method for inferring an area of interest in a two-dimensional image depicting at least one person, the method comprising, with a computing system, algorithmically:
- locating the at least one person in the image;
determining, from the image, a spatial configuration of at least a portion of the at least one person located in the image;
estimating a three-dimensional position of the person from the determined spatial configuration;
analyzing the three-dimensional position of the person using a proxemics analysis;
determining a type of human interaction likely depicted in the image based on the proxemics analysis; and
inferring an area of interest in the image based on the determined type of human interaction, the area of interest at least partially spaced from the at least one person, and the area of interest having a size that is greater than zero and less than the size of the entire image.
0 Assignments
0 Petitions
Accused Products
Abstract
An entity interaction recognition system algorithmically recognizes a variety of different types of entity interactions that may be captured in two-dimensional images. In some embodiments, the system estimates the three-dimensional spatial configuration or arrangement of entities depicted in the image. In some embodiments, the system applies a proxemics-based analysis to determine an interaction type. In some embodiments, the system infers, from a characteristic of an entity detected in an image, an area or entity of interest in the image.
10 Citations
21 Claims
-
1. A method for inferring an area of interest in a two-dimensional image depicting at least one person, the method comprising, with a computing system, algorithmically:
-
locating the at least one person in the image; determining, from the image, a spatial configuration of at least a portion of the at least one person located in the image; estimating a three-dimensional position of the person from the determined spatial configuration; analyzing the three-dimensional position of the person using a proxemics analysis; determining a type of human interaction likely depicted in the image based on the proxemics analysis; and inferring an area of interest in the image based on the determined type of human interaction, the area of interest at least partially spaced from the at least one person, and the area of interest having a size that is greater than zero and less than the size of the entire image. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A non-transitory computer readable medium for storing computer instructions that, when executed by one or more processors causes the one or more processors to perform a method for inferring an area of interest in a two-dimensional image depicting at least one person, the method comprising, algorithmically:
-
locating the at least one person in the image; determining, from the image, a spatial configuration of at least a portion of the at least one person located in the image; estimating a three-dimensional position of the person from the determined spatial configuration; analyzing the three-dimensional position of the person using a proxemics analysis; determining a type of human interaction likely depicted in the image based on the proxemics analysis; and inferring an area of interest in the image based on the determined type of human interaction, the area of interest at least partially spaced from the at least one person, and the area of interest having a size that is greater than zero and less than the size of the entire image.
-
-
13. A computing system comprising one or more of an image/video tagger, an information retrieval system, and an intelligent assistant and one or more processors to perform a method for inferring an area of interest in a two-dimensional image depicting at least one person, the method comprising, algorithmically:
-
locating the at least one person in the image; determining, from the image, a spatial configuration of at least a portion of the at least one person located in the image; estimating a three-dimensional position of the person from the determined spatial configuration; analyzing the three-dimensional position of the person using a proxemics analysis; determining a type of human interaction likely depicted in the image based on the proxemics analysis; and inferring an area of interest in the image based on the determined type of human interaction, the area of interest at least partially spaced from the at least one person, and the area of interest having a size that is greater than zero and less than the size of the entire image.
-
-
14. A method for inferring an area of interest in a recorded two-dimensional image using a characteristic of a gaze of a person depicted in the recorded two-dimensional image, the method comprising, with a computing system, algorithmically:
-
detecting the person in the recorded two-dimensional image; estimating a three-dimensional spatial configuration of at least a portion of the person in the image; inferring a characteristic of the person'"'"'s gaze in the image based on the estimated spatial configuration; determining a proxemics class associated with the recorded two-dimensional image based on the determined three-dimensional spatial configuration; and inferring an area of interest in the image based on at least one of the inferred characteristic of the person'"'"'s gaze or the proxemics class. - View Dependent Claims (15, 16, 17, 18, 19)
-
-
20. A non-transitory computer readable medium for storing computer instructions that, when executed by one or more processors, causes the one or more processors to perform a method for inferring an area of interest in a recorded two-dimensional image using a characteristic of a gaze of a person depicted in the recorded two-dimensional image, the method comprising, algorithmically:
-
detecting the person in the recorded two-dimensional image; estimating a three-dimensional spatial configuration of at least a portion of the person in the image; inferring a characteristic of the person'"'"'s gaze in the image based on the estimated spatial configuration; determining a proxemics class associated with the recorded two-dimensional image based on the determined three-dimensional spatial configuration; and inferring an area of interest in the image based on at least one of the inferred characteristic of the person'"'"'s gaze or the proxemics class.
-
-
21. A computing system comprising one or more of an image/video tagger, an information retrieval system, and an intelligent assistant and one or more processors to perform a method for inferring an area of interest in a recorded two-dimensional image using a characteristic of a gaze of a person depicted in the recorded two-dimensional image, the method comprising, algorithmically:
-
detecting the person in the recorded two-dimensional image; estimating a three-dimensional spatial configuration of at least a portion of the person in the image; inferring a characteristic of the person'"'"'s gaze in the image based on the estimated spatial configuration; determining a proxemics class associated with the recorded two-dimensional image based on the determined three-dimensional spatial configuration; and inferring an area of interest in the image based on at least one of the inferred characteristic of the person'"'"'s gaze or the proxemics class.
-
Specification