Where is code to perform the fake quantize in TensorFlow?

Question

I would like to make a custom quantizer (not stardard 8 bit) in TensorFlow.

I've gone through the code in tensorflow ensorflow\contrib\quantize\python and can see how the nodes are added, but I would like to modify how the tf.fake_quantize_with_min_max_vars function calculates that outputs.

I cannot seem to find the code that actually does the 32 bit accumulate and downsampling to 8 bit. Can anyone point me to where this code resides?

Where is code to perform the fake quantize in TensorFlow?

Answers (1)

Related Questions