Custom Pytorch function with hard clamp forwards and softclamp backwards

Question

I want to implement a custom differentiable function in PyTorch that acts like torch.clamp in the forward pass but in the backward pass outputs the gradients as if it where a tanh.

I tried the following code:

import torch

class ClampWithGrad (torch.autograd.Function):
    
    @staticmethod
    def forward (ctx, input):
        ctx.save_for_backward(input)
        return torch.clamp(input, -1, 1)
    
    @staticmethod
    def backward(ctx, grad_output):
        input, = ctx.saved_tensors
        grad_input = grad_output.clone()
        grad_input[input <= -1] = (1.0 - torch.tanh(input[input <= -1])**2.0) * grad_output[input <= -1]
        grad_input[input >= 1] = (1.0 - torch.tanh(input[input >= 1])**2.0) * grad_output[input >= 1]
        return grad_input

However, when I include this in my neural network, I get nans. How can I best implement this?

Custom Pytorch function with hard clamp forwards and softclamp backwards

Answers (1)

Related Questions