×

Protecting cognitive systems from gradient based attacks through the use of deceiving gradients

  • US 10,657,259 B2
  • Filed: 11/01/2017
  • Issued: 05/19/2020
  • Est. Priority Date: 11/01/2017
  • Status: Active Grant
First Claim
Patent Images

1. A method, in a data processing system comprising a processor and a memory, the memory comprising instructions which are executed by the processor to specifically configure the processor to implement a hardened neural network, the method comprising:

  • configuring the hardened neural network executing in the data processing system to introduce noise in internal feature representations of the hardened neural network, wherein the noise introduced in the internal feature representations diverts gradient computations associated with a loss surface of the hardened neural network;

    configuring the hardened neural network executing in the data processing system to implement a merge layer of nodes that combine outputs of adversarially trained output nodes of the hardened neural network with output nodes of the hardened neural network trained based on the introduced noise;

    receiving, by the hardened neural network, input data for classification by the hardened neural network;

    processing, by the hardened neural network, the input data to generate classification labels for the input data and thereby generate augmented input data; and

    outputting, by the hardened neural network, the augmented input data to a computing system for processing of the augmented input data to perform a computing operation.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×