Keras: Trying to model.predict() gives "ValueError: Tensor's shape is not compatible with supplied shape"

Question

I'm following the TensorFlow Keras tutorial for text generation. The training part works perfectly, but when I try to predict the next token, I get an error. Here's all the important code:

Making the vocabulary and dataset.

vocab = sorted(set(text))

char2index = { c:i for i, c in enumerate(vocab) }
index2char = np.array(vocab)
chars_to_int = np.array([char2index[c] for c in text])

char_dataset = tf.data.Dataset.from_tensor_slices(chars_to_int)
sequences = char_dataset.batch(seq_length + 1, drop_remainder=True)

def split_input_and_target(sequence):
    input_ = sequence[:-1]
    target_ = sequence[1:]
    return input_, target_

dataset = sequences.map(split_input_and_target)
dataset = dataset.shuffle(BUFFER_SIZE).batch(BATCH_SIZE, drop_remainder=True)

Building the model (important part here is that BATCH_SIZE = 64):

model = tf.keras.Sequential()

model.add(tf.keras.layers.Embedding(len(vocab), EMBEDDING_DIM,
                    batch_input_shape=[BATCH_SIZE, None]))
# here are a few more layers

model.compile(loss="sparse_categorical_crossentropy", optimizer="adam")
model.fit(dataset, epochs=EPOCHS)

Actually trying to generate text (this one was copied almost directly from the tutorial after I started getting desperate):

num_tokens = 100
seed = "some text"
input_eval = [char2index[c] for c in seed]
input_eval = tf.expand_dims(input_eval, 0)
text_generated = []

model.reset_states()

for i in range(num_tokens):
    predictions = model(input_eval)
    predictions = tf.squeeze(predictions, 0)
    # more stuff

Then, I first get a warning:

WARNING:tensorflow:Model was constructed with shape (64, None) for input Tensor("embedding_14_input:0", shape=(64, None), dtype=float32), but it was called on an input with incompatible shape (1, 9).

Then it gives me an error:

---->3     predictions = model(input_eval)
...
ValueError: Tensor's shape (9, 64, 256) is not compatible with supplied shape [9, 1, 256]

The second number, 64, is my batch size. If I change BATCH_SIZE to 1, everything works and all is fine, but this is obviously not the solution I am hoping for.

Keras: Trying to model.predict() gives "ValueError: Tensor's shape is not compatible with supplied shape"

Answers (1)

Related Questions

Keras: Trying to model.predict() gives &quot;ValueError: Tensor&#39;s shape is not compatible with supplied shape&quot;

Answers (1)

Related Questions

Keras: Trying to model.predict() gives "ValueError: Tensor's shape is not compatible with supplied shape"