Freezing layers in pre-trained bert model

Question

How to freeze the last two layers in the above pre-trained model (dropout and classifier layers)? So that when the model is run, I will get a dense layer as output.

Wasi Ahmad · Accepted Answer

I would like to point you to the definition of BertForSequenceClassification and you can easily avoid the dropout and classifier by using:

model = BertForSequenceClassification.from_pretrained("bert-base-uncased", num_labels=2)
model.bert() # this will give you the dense layer output

Why you can do the above? If you take a look at the constructor of BertForSequenceClassification:

def __init__(self, config):
    super(BertForSequenceClassification, self).__init__(config)
    self.num_labels = config.num_labels

    self.bert = BertModel(config)
    self.dropout = nn.Dropout(config.hidden_dropout_prob)
    self.classifier = nn.Linear(config.hidden_size, self.config.num_labels)

    self.init_weights()

As you can see, you just want to ignore the dropout and classifier layers.

One more thing, freezing a layer and removing a layer are two different things. In your question, you mentioned that you want to freeze the classifier layer but freezing a layer will not help you to avoid it. Freezing means, you do not want to train the layer.

Freezing layers in pre-trained bert model

Answers (2)

Related Questions