UNIFIED FRAMEWORK FOR PRECISE VISION-AIDED NAVIGATION
First Claim
1. A method for performing multi-camera visual odometry across a system including a plurality of cameras, wherein each camera comprises a local dataset, comprising the steps of:
- for each camera, generating a set of pose hypotheses;
for each camera, scoring the set of pose hypotheses based on a comparison with the local dataset to produce a set of intra-camera scores;
transferring the set of pose hypotheses to the other cameras in the system;
for each camera, scoring the set of transferred pose hypotheses based on the local dataset to produce a set of inter-camera scores;
combining the set of intra-camera scores and the one or more sets of inter-camera scores to produce a set of global scores; and
identifying an optimized video-based pose hypothesis based on an evaluation of the set of global scores.
2 Assignments
0 Petitions
Accused Products
Abstract
A system and method for efficiently locating in 3D an object of interest in a target scene using video information captured by a plurality of cameras. The system and method provide for multi-camera visual odometry wherein pose estimates are generated for each camera by all of the cameras in the multi-camera configuration. Furthermore, the system and method can locate and identify salient landmarks in the target scene using any of the cameras in the multi-camera configuration and compare the identified landmark against a database of previously identified landmarks. In addition, the system and method provide for the integration of video-based pose estimations with position measurement data captured by one or more secondary measurement sensors, such as, for example, Inertial Measurement Units (IMUs) and Global Positioning System (GPS) units.
105 Citations
20 Claims
-
1. A method for performing multi-camera visual odometry across a system including a plurality of cameras, wherein each camera comprises a local dataset, comprising the steps of:
-
for each camera, generating a set of pose hypotheses; for each camera, scoring the set of pose hypotheses based on a comparison with the local dataset to produce a set of intra-camera scores; transferring the set of pose hypotheses to the other cameras in the system; for each camera, scoring the set of transferred pose hypotheses based on the local dataset to produce a set of inter-camera scores; combining the set of intra-camera scores and the one or more sets of inter-camera scores to produce a set of global scores; and identifying an optimized video-based pose hypothesis based on an evaluation of the set of global scores. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A navigation system comprising:
-
a plurality of cameras configured to capture video related to a target scene; a visual odometry module configured to; receive the video from each of the plurality of cameras, detect one or more features of the target scene, track the one or more features to produce one or more feature tracks, and generate an optimized video-based pose estimation based on the one or more feature tracks, an integration module communicatively connected to the visual odometry module and one or more secondary measurement sensors configured to capture position measurement data related to the target scene, the integration module configured to; integrate the video-based pose estimation and the position measurement data, and generate a filtered pose estimation; and a global landmark recognition module configured to; receive a video snapshot related to the target scene from the plurality of cameras, extract one or more landmarks from the video snapshot; match the one or more extracted landmarks with historical landmark information stored in a landmark database to produce landmark matching information, compute a landmark-based pose estimation based at least in part on the landmark matching information, and compute a final pose estimation based at least on a comparison of the video-based pose estimation and the landmark-based pose estimation. - View Dependent Claims (9, 10, 11, 12)
-
-
13. A landmark recognition method comprising the steps of:
-
receiving a video snapshot comprising a pair of stereo images related to a target scene, extracting a set of feature points from each stereo image, identifying correspondences among the extracted set of feature points between the pair of stereo images, generating 3D coordinates for each feature point, wherein the 3D coordinates of the set of feature points comprises a landmark, computing a histogram of oriented gradients (HOG) descriptor based on at least a portion of the stereo image pair, associating the landmark with the HOG descriptor to produce a landmark snapshot, and matching the landmark snapshot with one or more historical landmarks stored in a database. - View Dependent Claims (14, 15, 16)
-
-
17. A landmark recognition method comprising the steps of:
-
capturing a first image comprising a first set of landmarks, wherein each landmark is associated with a unique HOG descriptor; capturing a second image comprising a second set of landmarks wherein each landmark is associated with a unique HOG descriptor; and searching the second set of landmarks in the second image to identify corresponding landmark matches in the first set of landmarks, based on a comparison of the HOG descriptors. - View Dependent Claims (18)
-
-
19. A method for determining the location of an object:
-
receiving new location information relating to an object; extracting a landmark snapshot from the new location information; associating the landmark snapshot with a HOG descriptor; generating a geo-spatial search region based at least on the new location information; selecting an initial set of candidate landmark snapshots based on the geo-spatial search region; pruning the initial set of candidate landmark snapshots to identify a refined set of candidate landmark snapshots; identifying an optimized candidate landmark snapshot based on a comparison of the HOG descriptor of the extracted landmark snapshot and the HOG descriptors of the refined set of candidate landmark snapshots; and determining the location of the object based at least in part on the optimized candidate landmark snapshot. - View Dependent Claims (20)
-
Specification