Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

       2) Add its inputs / outputs to the queue (for future visit)

Future Work

FP16 Integration

The current integration of TensorRT into MXNet supports only FP32 float values for tensors.  Allowing FP16 values would enable many further optimizations on Jetson and Volta devices.

https://jira.apache.org/jira/browse/MXNET-1084

Subgraph Integration

The new subgraph API is a natural fit for TensorRT.  To help make the codebase consistent we'd like to port the current TensorRT integration to use the new API.  The experimental integration into MXNet requires us to use contrib API calls.  Once integration has moved to use the subgraph API users will be able to use TensorRT with a consistent API.  Porting should also enable acceleration of gluon and module base models.

https://jira.apache.org/jira/browse/MXNET-1085

Increased Operator Coverage

The current operator cover is fairly limited.  We'd like to enable all models that TensorRT is able to work with. 

Jira
serverASF JIRA
serverId5aa69414-a9e9-3523-82ec-879b028fb15b
keyMXNET-1086

Currently supported operators:

...

elemwise_add

...

Operators to be added:

...