How is downscaling image affecting dominant color in it?

Question

I am looking to efficiently and quickly compute the most dominant color in an image. By dominant I mean the color present in the most amount of pixels. When attempting to implement this I quickly noticed the biggest bottleneck was in looping through the sheer amount of pixels in images. To optimize this I experimented with rescaling the image, I noticed as I rescaled the image the dominant colors became more and more prominent. It also greatly improved my algos performance because the number of pixels I analyzed greatly dropped. Rescaling is somewhat expensive but if done once and cached I can live with it.

My question to the stack overflow community is how safe is it to rescale like this? I am concerned I am significantly trading accuracy for performance. It seems to work fine, but I would love an experts feedback. Not looking to write a paper or create the next lighting fast image processing algo, just need it to work and be reasonably efficient.

Cloud · Accepted Answer

In terms of performance cost, your downscaling algorithm is going to be the most expensive operation. Assuming your input image is a square image, for the sake of simplicity, with dimensions of AxA, and the output image is of dimensions BxB, you'll typically do something like so:

Apply pre-filtering
Up-sample the image by a factor of B
Convert to frequency domain
Anti-aliasing filter
Down-sample by a factor of A
Apply post-processing filters

Assuming you are using a trivial down-sampling mechanism (ie: decimation or discarding every n'th row/column, etc), this cost is greatly reduced. By using a simpler down-sampling method, you trade off quality for performance (less memory, fewer CPU cycles used, etc).

To your question: down-sampling is affecting the dominant color:

By discarding data permanently, in the case of decimation.
Changing the measured data, in the case of more advanced interpolation/re-sampling methods.

The metrics you generate from the down-sampled image will be less accurate, but not necessarily less precise. That's it.

Computing the dominant color in an image is fairly cheap compared to resampling it with any method other than possibly simply decimation. Assuming even something like images with 24-bit color depth, a modern 64-bit PC will, at most, use 2^24 * (64bits / 8bits-per-byte) = 134217728 bytes of memory. You could just allocate a large chunk of memory and implement a simple histogram. You'd simply execute A*B addition operations, and another A*B comparisons, so it'd be of linear execution complexity and constant memory complexity.

How is downscaling image affecting dominant color in it?

Answers (2)

Related Questions