Page History

...

2) Add its inputs / outputs to the queue (for future visit)

Future Work

FP16 Integration

The current integration of TensorRT into MXNet supports only FP32 float values for tensors. Allowing FP16 values would enable many further optimizations on Jetson and Volta devices.

https://jira.apache.org/jira/browse/MXNET-1084

Subgraph Integration

The new subgraph API is a natural fit for TensorRT. To help make the codebase consistent we'd like to port the current TensorRT integration to use the new API. The experimental integration into MXNet requires us to use contrib API calls. Once integration has moved to use the subgraph API users will be able to use TensorRT with a consistent API. Porting should also enable acceleration of gluon and module base models.

https://jira.apache.org/jira/browse/MXNET-1085

Increased Operator Coverage

The current operator cover is fairly limited. We'd like to enable all models that TensorRT is able to work with.

Jira

server	ASF JIRA
serverId	5aa69414-a9e9-3523-82ec-879b028fb15b
key	MXNET-1086

Currently supported operators:

...

elemwise_add

...

Operators to be added:

...

Page tree

Versions Compared

Old Version 34

New Version Current

Key

Future Work

FP16 Integration

Subgraph Integration

Increased Operator Coverage