When and why kernel_size may be 1 or 2 dims when using Conv2D?

Question

When using Conv2D we can define the kernel_size to be 1 dim or 2 dims (or higher value of dims)

for example:

Conv2D(filters=32, kernel_size=3, activation='relu')

or

Conv2D(filters=32, kernel_size=(3,3), activation='relu')

When using Conv2D , what is the recommendations ?
What are the cases we will choose to use 1D (kernel_size=7) and what are the cases we will prefer to choose 2D (kernel_size=(3,3)) or other dim ?
Does the choosing the dim of kernel_size will affect the choosing of pooling size ? (MaxPooling2D(pool_size=?))

Alex K. · Accepted Answer

(and 2) ):

kernel_size: An integer or tuple/list of 2 integers, specifying the height and width of the 2D convolution window. Can be a single integer to specify the same value for all spatial dimensions.

Keras doc. Conv2D performs convolution with 2D filter, it can be square (use single int to define) or not square (use tuple). Regarding size of filter (3, 7, etc.) choice depends on task and architecture. See answer for basic intuition behind particular example

Kernel_size as well as stride and padding affect Conv2D output shape, and you should take that into account if you want MaxPooling2D output of specific shape.

When and why kernel_size may be 1 or 2 dims when using Conv2D?

Answers (2)

Related Questions