What is the purpose of decreased FLOPs and parameter size if they are not for increased speed?

Question

CNN algorithms like DenseNet DenseNet stress parameter efficiency, which usually results in less FLOPs. However, what I am struggling to understand is why this is important. For DenseNet, in particular, it has low inference speed. Isn't the purpose of decreased parameter size/FLOPs to decrease the time for inference? Is there another real world reason, such as perhaps less energy used, for these optimizations?

What is the purpose of decreased FLOPs and parameter size if they are not for increased speed?

Answers (1)

Related Questions