How to convert pytorch model to TensorRT?

I have trained the classification model on Nvidia GPU and saved the model weights(checkpoint.pth). If I want to deploy this model in jetson nano and test it.

Should I convert it to TenorRT? How to convert it to TensorRT?

I am new to this. It would be helpful if someone can even correct me.

Upvotes: 1

Answers (3)

Michael Litvin

Reputation: 4136

You can use Torch-TensorRT.

Torch-TensorRT is a inference compiler for PyTorch, targeting NVIDIA GPUs via NVIDIA’s TensorRT Deep Learning Optimizer and Runtime. It supports both just-in-time (JIT) compilation workflows via the torch.compile interface as well as ahead-of-time (AOT) workflows. Torch-TensorRT integrates seamlessly into the PyTorch ecosystem supporting hybrid execution of optimized TensorRT code with standard PyTorch code.

Upvotes: 0

chiehpower

Reputation: 107

The best way to achieve the way is to export the Onnx model from Pytorch. Next, use the TensorRT tool, trtexec, which is provided by the official Tensorrt package, to convert the TensorRT model from onnx model.

You can refer to this page: https://github.com/NVIDIA/TensorRT/blob/master/samples/opensource/trtexec/README.md

The TRTEXEC is a more native tool that you can take it from NVIDIA NGC images or downloading from the official website directly.

If you use a tool such as torch2trt, it is easy to encounter the operator issue and complicated to resolve it indeed (if you are not familiar to deal with plugin issues).

Upvotes: 2

User_12399

Reputation: 123

You can use this tool:

https://github.com/NVIDIA-AI-IOT/torch2trt

Here are more details how to implent a converter to a engine file:

https://github.com/NVIDIA-AI-IOT/torch2trt/issues/254

Upvotes: 0

How to convert pytorch model to TensorRT?

Answers (3)

Related Questions