Generating three-dimensional models from images
First Claim
Patent Images
1. A method, comprising:
- receiving, by a computing device comprising a processor, input image data, wherein the input image data represents a faç
ade;
reconstructing, by the computing device, the input image data comprising determining three-dimensional (3D) points, lines, and camera positions associated with the faç
ade;
performing, by the computing device, a multi-view semantic segmentation on the reconstructed input image data comprising;
recognizing a faç
ade structure; and
segmenting the faç
ade as a result of the recognizing;
block partitioning, by the computing device, the reconstructed input image data into a plurality of independent object blocks comprising a first object block and at least a second object block, comprising ignoring vertical line segments comprising extensions that cross a defined number of horizontal line segments; and
separating, by the computing device, the first object block from at least the second object block comprising using one or more line structures in the input image data, wherein the first object block and at least the second object block belong to a same object class.
1 Assignment
0 Petitions
Accused Products
Abstract
The subject disclosure relates to generating models from images. In an aspect, multi-view semantic segmentation is provided to recognize and segment images at the pixel level into semantically meaningful areas, and which can provide labels with a specific object class. In further aspects, a partition scheme is provided that can separate objects into independent blocks using major line structures of a scene. In addition, an inverse patch-based orthographic composition and structure analysis on a block is provided that can regularize noisy and missing reconstructed 3D data to facilitate image-based modeling.
-
Citations
20 Claims
-
1. A method, comprising:
-
receiving, by a computing device comprising a processor, input image data, wherein the input image data represents a faç
ade;reconstructing, by the computing device, the input image data comprising determining three-dimensional (3D) points, lines, and camera positions associated with the faç
ade;performing, by the computing device, a multi-view semantic segmentation on the reconstructed input image data comprising; recognizing a faç
ade structure; andsegmenting the faç
ade as a result of the recognizing;block partitioning, by the computing device, the reconstructed input image data into a plurality of independent object blocks comprising a first object block and at least a second object block, comprising ignoring vertical line segments comprising extensions that cross a defined number of horizontal line segments; and separating, by the computing device, the first object block from at least the second object block comprising using one or more line structures in the input image data, wherein the first object block and at least the second object block belong to a same object class. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
a processor that executes or facilitates execution of computer executable components stored in a memory, the computer executable components comprising; a multi-view semantic segmentation component configured to create a semantic segmentation of a faç
ade based at least in part on a registered sequence of images associated with the faç
ade;a partitioning component configured to block partition the semantic segmentation into a plurality of object blocks comprising; determining a number of times that a first vertical line crosses horizontal lines within the faç
ade, anddetermining the number of times satisfies a threshold amount, wherein the partitioning component ignores the first vertical line for the block partition; and a block separator component configured to distinguish a first object block of the plurality of object blocks from a second object block of the plurality of object blocks, wherein the first object block and the second object block are classified in a same object class. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16)
-
17. A method, comprising:
-
performing, by a system comprising a processor, a multi-view semantic segmentation on at least a subset of reconstructed input image data representing a faç
ade, wherein the performing comprises recognizing faç
ade structure and segmenting the faç
ade based on the recognizing resulting in a segmented faç
ade;block partitioning, by the system, at least the subset of reconstructed input image data comprising; determining respective scores for a set of vertical lines in the reconstructed input image data, wherein the respective scores represent a number of times each vertical line of the set of vertical lines crosses horizontal lines in the reconstructed input image data, selecting a subset of vertical lines from the set of vertical lines based in part on the respective scores, the selecting comprising ignoring vertical line segments comprises extensions that cross a defined number of horizontal line segments, and producing at least one object block associated with the segmented faç
ade based in part on the subset of vertical lines;performing, by the system, an inverse orthographic composition on at least the subset of reconstructed input image data associated with the at least one object block comprising producing a composed orthographic depth map and a composed orthographic texture for the at least one object block; and performing, by the system, structural analysis and regularization of the composed orthographic depth map and the composed orthographic texture comprising identifying structural elements at a plurality of faç
ade depths for the at least one object block. - View Dependent Claims (18, 19, 20)
-
Specification