×

Annotating images based on multi-modal sensor data

  • US 10,691,943 B1
  • Filed: 01/31/2018
  • Issued: 06/23/2020
  • Est. Priority Date: 01/31/2018
  • Status: Active Grant
First Claim
Patent Images

1. An aerial vehicle comprising:

  • a plurality of propulsion motors, wherein each of the propulsion motors comprises a propeller and a drive shaft, and wherein each of the propulsion motors is configured to rotate the propeller about an axis defined by the drive shaft;

    a digital camera configured to capture one or more visual images;

    a thermal camera configured to capture one or more thermal images, wherein the digital camera and the thermal camera are calibrated and aligned with fields of view that overlap at least in part; and

    a control system having at least one computer processor, wherein the control system is in communication with each of the digital camera, the thermal camera and the plurality of propulsion motors, and wherein the at least one computer processor is configured to execute one or more instructions for performing a method comprising;

    initiating a first operation of at least one of the plurality of propulsion motors;

    during the first operation,capturing a first plurality of visual images by the digital camera; and

    capturing a second plurality of thermal images by the thermal camera;

    receiving information regarding at least one visual attribute and at least one thermal attribute of an object;

    detecting the at least one visual attribute of the object within a first portion of a first one of the first plurality of visual images;

    detecting the at least one thermal attribute of the object within a second portion of a second one of the second plurality of thermal images;

    determining that the first portion of the first one of the first plurality of visual images corresponds to the second portion of the second one of the second plurality of thermal images;

    generating an annotation of the first one of the first plurality of visual images based at least in part on at least one of the first portion of the first one of the first plurality of visual images or the second portion of the second one of the second plurality of thermal images;

    storing the annotation in association with at least the first one of the first plurality of visual images;

    providing at least the first one of the first plurality of visual images to a classifier as a training input;

    providing at least the annotation to the classifier as a training output;

    training the classifier using at least the training input and the training output;

    capturing at least a second plurality of visual images by the digital camera;

    providing at least one of the second plurality of visual images to the classifier as an input;

    receiving an output from the classifier; and

    identifying a portion of the at least one of the second plurality of visual images depicting the object based at least in part on the output.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×