Daniel Ji
Daniel Ji

Reputation: 169

How to resize a PyTorch tensor?

I have a PyTorch tensor of size (5, 1, 44, 44) (batch, channel, height, width), and I want to 'resize' it to (5, 1, 224, 224)

How can I do that? What functions should I use?

Upvotes: 15

Views: 36561

Answers (3)

Don Feto
Don Feto

Reputation: 1524

Building on the first answer, you can get better results.

if you trying to increase the size of the image (Enlarging) to use it later in the deep learning model (your case)

(Linear interpolation is better than bicubic interpolation).

resized_tensor = F.interpolate(input_tensor, size=(224, 224), mode='bilinear', align_corners=False)

Since bilinear interpolation:

  • Faster than bicubic (you will use it with large dataset)
  • Uses 2x2 pixel neighborhood instead of 4x4, which will require less computation
  • slightly softer images compared to cubic

if you care about visual quality of the images use bicubic but note that it is slower and produce sharper images

Upvotes: 0

Shai
Shai

Reputation: 114926

It seems like you are looking for interpolate (a function in nn.functional):

import torch.nn.functional as nnf

x = torch.rand(5, 1, 44, 44)
out = nnf.interpolate(x, size=(224, 224), mode='bicubic', align_corners=False)

If you really care about the accuracy of the interpolation, you should have a look at ResizeRight: a pytorch/numpy package that accurately deals with all sorts of "edge cases" when resizing images. This can have an effect when directly merging features of different scales: inaccurate interpolation may result in misalignments.

Upvotes: 23

iacob
iacob

Reputation: 24331

The TorchVision transforms.functional.resize() function is what you're looking for:

import torchvision.transforms.functional as F

t = torch.randn([5, 1, 44, 44])
t_resized = F.resize(t, 224)

If you wish to use another interpolation mode than bilinear, you can specify this with the interpolation argument.

Upvotes: 4

Related Questions