Inception style convolution

Question

I have the necessity to keep the model as small as possible to deploy an image classifier that can run efficiently on an app (the accuracy is not really relevant for me)

I recently approached deep learning and I haven't great experience, hence I'm currently playing with the cifar-10 example. I tried to replace the first two 5x5 convolutional layers with two 3x3 convolution each, as described in the inception paper.

Unluckily, when I'm going to classify the test set, I got around 0.1 correct classification (random choice)

This is the modified code of the first layer (the second is similar):

with tf.variable_scope('conv1') as scope:
    kernel_l1 = _variable_with_weight_decay('weights_l1', shape=[3, 3, 3, 64],
                                     stddev=1e-4, wd=0.0)
    kernel_l2 = _variable_with_weight_decay('weights_l2', shape=[3, 3, 64, 1],
                                     stddev=1e-4, wd=0.0)
    biases = _variable_on_cpu('biases', [64], tf.constant_initializer(0.0))

    conv_l1 = tf.nn.conv2d(images, kernel_l1, [1, 1, 1, 1], padding='SAME')
    conv_l2 = tf.nn.depthwise_conv2d(conv_l1, kernel_l2, [1, 1, 1, 1], padding='SAME')
    bias = tf.nn.bias_add(conv_l2, biases)
    conv1 = tf.nn.relu(bias, name=scope.name)
    _activation_summary(conv1)

Is it correct?

Inception style convolution

Answers (1)

Related Questions