Room layout estimation methods and techniques
First Claim
Patent Images
1. A system comprising:
- non-transitory memory configured to store;
a room image for room layout estimation; and
a neural network for estimating a layout of a room, the neural network comprising;
an encoder-decoder sub-network configured to receive the digital room image as an input, wherein the encoder-decoder sub-network comprises an encoder and a decoder; and
a classifier sub-network in communication with the encoder-decoder sub-network configured to classify a room type associated with the room image;
a hardware processor in communication with the non-transitory memory, the hardware processor programmed to;
access the room image;
determine, using the encoder, decoder, and the room image, a plurality of predicted two-dimensional (2D) keypoint maps corresponding to a plurality of room types;
determine, using the encoder, the classifier sub-network and the room image, a predicted room type from the plurality of room types;
determine, using the plurality of predicted 2D keypoint maps and the predicted room type, a plurality of ordered keypoints associated with the predicted room type; and
determine, using the plurality of ordered keypoints, a predicted layout of the room in the room image.
4 Assignments
0 Petitions
Accused Products
Abstract
Systems and methods for estimating a layout of a room are disclosed. The room layout can comprise the location of a floor, one or more walls, and a ceiling. In one aspect, a neural network can analyze an image of a portion of a room to determine the room layout. The neural network can comprise a convolutional neural network having an encoder sub-network, a decoder sub-network, and a side sub-network. The neural network can determine a three-dimensional room layout using two-dimensional ordered keypoints associated with a room type. The room layout can be used in applications such as augmented or mixed reality, robotics, autonomous indoor navigation, etc.
-
Citations
39 Claims
-
1. A system comprising:
-
non-transitory memory configured to store; a room image for room layout estimation; and a neural network for estimating a layout of a room, the neural network comprising; an encoder-decoder sub-network configured to receive the digital room image as an input, wherein the encoder-decoder sub-network comprises an encoder and a decoder; and a classifier sub-network in communication with the encoder-decoder sub-network configured to classify a room type associated with the room image; a hardware processor in communication with the non-transitory memory, the hardware processor programmed to; access the room image; determine, using the encoder, decoder, and the room image, a plurality of predicted two-dimensional (2D) keypoint maps corresponding to a plurality of room types; determine, using the encoder, the classifier sub-network and the room image, a predicted room type from the plurality of room types; determine, using the plurality of predicted 2D keypoint maps and the predicted room type, a plurality of ordered keypoints associated with the predicted room type; and determine, using the plurality of ordered keypoints, a predicted layout of the room in the room image. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system comprising:
-
non-transitory memory configured to store parameters for the neural network; and a hardware processor in communication with the non-transitory memory, the hardware processor programmed to; receive a training room image, wherein the training room image is associated with; a reference room type from a plurality of room types, and reference keypoints associated with a reference room layout; generate a neural network for room layout estimation, wherein the neural network comprises; an encoder-decoder sub-network configured to receive the training room image as an input and output predicted two-dimensional (2D) keypoints associated with a predicted room layout associated with each of the plurality of room types, wherein the encoder-decoder sub-network comprises an encoder and a decoder, and a side sub-network in communication with the encoder-decoder network configured to output a predicted room type from the plurality of room types, wherein the predicted room type is determined at least in part by the encoder and side sub-network; and optimize a loss function based on a first loss for the predicted 2D keypoints and a second loss for the predicted room type; and update parameters of the neural network based on the optimized loss function. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39)
-
Specification