Skip to content

After upgrading from 8.6 to 10.8 or 10.9, tensorrt's results are inconsistent with onnxrt #4400

Open
@2730gf

Description

@2730gf

Description

After upgrading tensorrt to 10.8, the model accuracy decreased.
After setting all nodes of the model to output, the model accuracy was aligned. It was suspected that the fusion strategy introduced by the upgrade caused the accuracy problem.
Finally, we used the polygraphy tool to find an onnx subgraph that could reproduce the problem.
By the way, it needs to be reminded that the model only uses fp32 precision, not even fp16 or int8

Environment

TensorRT Version: 10.8 & 10.9

NVIDIA GPU: 3090

NVIDIA Driver Version: 550.67

CUDA Version: 12.2

Relevant Files

I post onnx file here:
Model link: https://github.com/2730gf/issues/blob/main/trt_inconsistent/mini_graph.onnx

Steps To Reproduce

Commands or scripts:
polygraphy run mini_graph.onnx -v -v -v -v -v --pool-limit workspace:20G --onnxrt --trt --validate --atol 1e-4 --rtol 1e-3 --onnx-outputs p2o.Concat.125 --trt-outputs p2o.Concat.125
Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    Module:AccuracyOutput mismatch between TensorRT and other frameworksinternal-bug-trackedTracked internally, will be fixed in a future release.triagedIssue has been triaged by maintainers

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions