How to take a derivative of one of the outputs of a neural network (involving batched inputs) with respect to inputs?

Question

I am solving a PDE using a neural network. My neural network is as follows:

def f(params, inputs):
  for w, b in params:
    outputs = jnp.dot(inputs, w) + b
    inputs = jnn.swish(outputs)
  return outputs

The layer architecture of the network is as follows - [1,5,2]. Hence, i have one input neuron and two output neurons. Therefore, if I pass 10 batches of input, I am supposed to get a (10,2) array as output. Now let the output neurons be termed as 'p' and 'q' respectively. How do I find dp/dx, dq/dx? I don't want to pick values from jacobians and hessians, and want to have a more explicit functionality. What I mean is, I want something like this below:

p = lambda inputs: f(params, inputs)[:,0].reshape(-1,1)
q = lambda inputs: f(params, inputs)[:,1].reshape(-1,1)

p_x = lambda inputs: vmap(jacfwd(p,argnums=0))(inputs)
q_x = lambda inputs: vmap(jacfwd(q,argnums=0))(inputs)

k_p_x = lambda inputs: kappa(inputs).reshape(-1,1) * p_x(inputs)
        ##And other calculations proceed..

When I execute p(inputs) it's working as expected (as it should), but as soon as I execute p_x(inputs) I am getting an error: IndexError: Too many indices for array: 2 non-None/Ellipsis indices for dim 1.

How do I get around this?

How to take a derivative of one of the outputs of a neural network (involving batched inputs) with respect to inputs?

Answers (1)

Related Questions