In deep learning what's difference between two 3*3 convolution filters and one 5*5 convolution filters?

Question

For example, as to the famous AlexNet architecutre (original paper), what's the difference of using two 3*3 convolution filters between using one 5*5 convolution filter ?

The two 3*3 convolution filters and one 5*5 convolution filter have been highlighted by red rectangle in the below image.

What about use another 5*5 convolution filter to supersede the two 3*3 convolution filters, or vice verse?

Jayhello · Accepted Answer

I have found from paper <>.

Rather than using relatively large receptive fields in the first conv. layers (e.g. 11×11with stride 4 in (Krizhevsky et al., 2012), or 7×7 with stride 2 in (Zeiler & Fergus, 2013; Sermanet et al., 2014)), we use very small 3 × 3 receptive fields throughout the whole net, which are convolved with the input at every pixel (with stride 1). It is easy to see that a stack of two 3×3 conv.layers (without spatial poolingin between) has an effective receptive field of 5×5; three such layers have a 7 × 7 effective receptive field.

two 3*3 convolution filter is equivalent to one 5*5 convolution filter.
two 3*3 convolution filter will have less parameters than one 5*5 convolution filter.
two 3*3 convolution filter will make network more deep and extract more complex features than one 5*5 convolution filter.

paper:https://arxiv.org/pdf/1409.1556.pdf

In deep learning what's difference between two 33 convolution filters and one 55 convolution filters?

Answers (2)

Related Questions

In deep learning what&#39;s difference between two 3*3 convolution filters and one 5*5 convolution filters?

Answers (2)

Related Questions

In deep learning what's difference between two 33 convolution filters and one 55 convolution filters?