How to add a learnable bias to one of the network output channel in pytorch

Question

class pu_fc(nn.Module):

    def __init__(self, input_dim):
        super(pu_fc, self).__init__()
        self.input_dim = input_dim
        
        self.fc1 = nn.Linear(input_dim, 50)
        self.fc2 = nn.Linear(50, 2) 

        self.loss_fn = custom_NLL()

        device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
        self.bias = torch.autograd.Variable(torch.rand(1,1), requires_grad=True).to(device)

    def forward(self, x):
        out = self.fc1(x)
        out = F.relu(out, inplace=True)
        out = self.fc2(out)
        out[..., 1] = out[..., 1] + self.bias
        print('bias: ', self.bias)

        return out

As you can see from the code, I wanted to add a bias term to the second output channel. However, my implementation does not work. The bias term is not updated at all. It kept the same during training which I assume that it is not learnable during training. So the question is that how I can make the bias term learnable? Is it possible to do this? Below is some output of the bias during training. Any hint is grateful, thanks in advance!

bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
Current Epoch: 1
Epoch loss:  0.4424589276313782
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
Current Epoch: 2
Epoch loss:  0.3476297199726105
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)
bias:  tensor([[0.0930]], device='cuda:0', grad_fn=)

How to add a learnable bias to one of the network output channel in pytorch

Answers (1)

Related Questions