Reputation: 1529
It appears that Keras has done alot of the heavy lifting for you when it comes to the CTC function. However I am finding it tricky to build a decode function which I don't want to run as part of my neural network. I have a custom function that is executed on epoch end which I then iterate through all my test data and evaluate the metrics, I am currently doing this by hand but want to make use of the k.ctc_decode function (both greedy and beam) however I am finding it hard to access and incorporate into my custom function.
I have a model:
# Define CTC loss
def ctc_lambda_func(args):
y_pred, labels, input_length, label_length = args
return K.ctc_batch_cost(labels, y_pred, input_length, label_length)
def ctc_decode(args):
y_pred, input_length =args
seq_len = tf.squeeze(input_length,axis=1)
return K.ctc_decode(y_pred=y_pred, input_length=seq_len, greedy=True, beam_width=100, top_paths=1)
input_data = Input(name='the_input', shape=(None,mfcc_features))
x = TimeDistributed(Dense(fc_size, name='fc1', activation='relu'))(input_data)
y_pred = TimeDistributed(Dense(num_classes, name="y_pred", activation="softmax"))(x)
labels = Input(name='the_labels', shape=[None,], dtype='int32')
input_length = Input(name='input_length', shape=[1], dtype='int32')
label_length = Input(name='label_length', shape=[1], dtype='int32')
loss_out = Lambda(ctc_lambda_func, output_shape=(1,), name='ctc')([y_pred,labels,input_length,label_length])
dec = Lambda(ctc_decode, output_shape=[None,], name='decoder')([y_pred,input_length])
model = Model(inputs=[input_data, labels, input_length, label_length], outputs=[loss_out])
iterate = K.function([input_data, K.learning_phase()], [y_pred])
decode = K.function([y_pred, input_length], [dec])
Current error is:
dec = Lambda(ctc_decode, name='decoder')([y_pred,input_length]) File "/home/rob/py27/local/lib/python2.7/site-packages/keras/engine/topology.py", line 604, in call output_shape = self.compute_output_shape(input_shape) File "/home/rob/py27/local/lib/python2.7/site-packages/keras/layers/core.py", line 631, in compute_output_shape return K.int_shape(x) File "/home/rob/py27/local/lib/python2.7/site-packages/keras/backend/tensorflow_backend.py", line 451, in int_shape shape = x.get_shape() AttributeError: 'tuple' object has no attribute 'get_shape'
Any ideas how I can do this?
Upvotes: 2
Views: 2457
Reputation: 1409
One tricky part is that K.ctc_decode
returns tuple of single list of tensors, not a single tensor, so you can't create a layer straightforwardly. Instead try creating a decoder with K.function
:
top_k_decoded, _ = K.ctc_decode(y_pred, input_lengths)
decoder = K.function([input_data, input_lengths], [top_k_decoded[0]])
Later you can call your decoder:
decoded_sequences = decoder([test_input_data, test_input_lengths])
You may need some reshaping, as K.ctc_decoder
requires lengths to have shape like (samples), while the lengths tensor was of shape (samples, 1).
Upvotes: 1