Why do we increase dimensions in resnet-50 architecture?

In rensnet-50's bottleneck block, we use conv1x1(previous block out,64), conv3x3(64,64), conv1x1(64,256) and then we repeat.

Why do we increase dimensions in the third conv1x1 layer only to reduce it in the next block?

Upvotes: 0

Answers (1)

Reputation: 3283

It's a bottleneck layer, see the answer to this question:

and this question:

In essence, it's to reduce the feature count, particularly for the 3x3 convolution.

Upvotes: 1