How to tell Tensorflow Serving to leave my named output untouched?

Question

I want to create a model signature that returns named tensors (using Keras). This is what I mean by that. When the model is deployed to TF-Serving, I want it to return a JSON like this:

{
    "predictions": [
        {
            "t3": 19,
            "t1": 76.975174,
            "t2": "cat3"
        },
        {
            "t3": 17,
            "t1": 77.7983246,
            "t2": "cat3"
        }
    ]
}

The important part is t1, t2, and t3. These are named by me. If I hadn't named them, the returned JSON would have been this:

{
    "predictions": [
        {
            "output_0": 77.5714188,
            "output_1": "cat3",
            "output_2": 17
        },
        {
            "output_0": 80.7243729,
            "output_1": "cat4",
            "output_2": 17
        }
    ]
}

The output_0, output_1, and output_2 are automatically generated by some component (not sure which one but I guess TF or Keras). I kinda managed to pull this off but only in specific scenarios.

This is what I have so far:

from tensorflow.keras import layers


class OutputWithNames(layers.Layer):
    def __init__(self):
        super(OutputWithNames, self).__init__()

    def call(self, x):
        return {"t1": x[0], "t2": x[1], "t3": x[2]}

Adding this custom layer as the last one in my model's signature, I get exactly what I mentioned. A JSON object with desired property names t1, t2, and t3.

Inspecting the saved model using the saved_model_cli tool, this is what I get as my output signature when the resulting model works as expected:

The given SavedModel SignatureDef contains the following output(s):
  outputs['t1'] tensor_info:
      dtype: DT_FLOAT
      shape: (-1)
      name: StatefulPartitionedCall_1:0
  outputs['t2'] tensor_info:
      dtype: DT_STRING
      shape: (-1)
      name: StatefulPartitionedCall_1:1
  outputs['t3'] tensor_info:
      dtype: DT_INT32
      shape: (-1)
      name: StatefulPartitionedCall_1:2

And again, so far everything's fine. But when I have only one single output:

from tensorflow.keras import layers


class OutputWithNames(layers.Layer):
    def __init__(self):
        super(OutputWithNames, self).__init__()

    def call(self, x):
        return {"t1": x}

Note: In the first code block, x is a list of Tensors since I have multiple outputs. That's why I can/have to use x[0]. But in the second code block, x is just simply a Tensor since there's only one output. That's why there's no [] in front of the x.

This time, TF-Serving will generate this JSON for me:

{
    "predictions": [
        67.2723083,
        68.9468231
    ]
}

While this is what I was expecting to see:

{
    "predictions": [
        {
            "t1": 67.2723083
        },
        {
            "t1": 68.9468231
        }
    ]
}

And this is what saved_model_cli tool returns for it:

The given SavedModel SignatureDef contains the following output(s):
  outputs['t1'] tensor_info:
      dtype: DT_INT64
      shape: (-1)
      name: StatefulPartitionedCall:0

Basically, the saved model has the name for the output correctly set. But for some reason, TF-Serving strips it. And instead of returning an array of objects, it returns only the values.

My question is how can I force the TF-Serving to return a list of objects when there's only a single output?

How to tell Tensorflow Serving to leave my named output untouched?

Answers (1)

Related Questions