How CNN reduce parameter and reuse weight?

Question

In the post A Comprehensive Guide to Convolutional Neural Networks — the ELI5 way, it says

A ConvNet is able to successfully capture the Spatial and Temporal dependencies in an image through the application of relevant filters. The architecture performs a better fitting to the image dataset due to the reduction in the number of parameters involved and reusability of weights.

I don't see how it reduce parameter and reuse weight. Could anyone give an example?

Abhi25t · Accepted Answer

Consider the filter (or kernel) in image below having 9 pixels and the image having 49 pixels.

In a fully connected layer, we'll have 9*49 = 441 weights.

While in a CNN this same filter keeps on moving (convolving) over the entire image. All pixel values in image will be multiplied with those same 9 values of filter (hence we say weights are reused). So, we need just 9 weights per filter instead of 441 in FC layer.

The job of a filter is to identify features (such as texture, lines etc), which could be anywhere in an image. So, we want to reuse this same filter over the entire image.

How CNN reduce parameter and reuse weight?

Answers (2)

Related Questions