Error Code 2: Internal Error (Assertion !mValueMapUndo failed. ) failure of TensorRT 10.5 when running speechbrain language detection model on GPU NVIDIA GeForce RTX 3090 #4277

msublee · 2024-12-10T11:32:15Z

Description

I converted the speechbrain language detection model to ONNX model, and tried to convert it to TensorRT through trtexec, but the error below occurred.

Error[2]: [graphShapeAnalyzer.cpp::eraseFromTensorMaps::1138] Error Code 2: Internal Error (Assertion !mValueMapUndo failed. )

Environment

TensorRT Version: 10.5.0.18 (Container version 24.10)

NVIDIA GPU: NVIDIA GeForce RTX 3090

NVIDIA Driver Version: 550.127.05

CUDA Version: 12.4

Operating System:

Python Version (if applicable): 3.10

PyTorch Version (if applicable): 2.4.1

Steps To Reproduce

Commands or scripts:

trtexec --onnx=/workspace/model.onnx --saveEngine=/workspace/output/model.plan.bsz4 --memPoolSize=workspace:8192 --minShapes=wavforms:1x1,wav_lens:1x1 --optShapes=wavforms:4x320000,wav_lens:4x1 --maxShapes=wavforms:4x320000,wav_lens:4x1 --fp16

Have you tried the latest release?: I tried container version 24.11

The text was updated successfully, but these errors were encountered:

lix19937 · 2024-12-16T09:50:51Z

Add --verbose and attach thebuild log here ?

TigerSong · 2024-12-18T04:21:25Z

I get the same error, in trt10.5 and trt10.7

[12/18/2024-12:20:52] [E] Error[2]: [graphShapeAnalyzer.cpp::eraseFromTensorMaps::1138] Error Code 2: Internal Error (Assertion !mValueMapUndo failed. )
[12/18/2024-12:20:52] [E] Engine could not be created from network
[12/18/2024-12:20:52] [E] Building engine failed
[12/18/2024-12:20:52] [E] Failed to create engine from model or file.
[12/18/2024-12:20:52] [E] Engine set up failed

add --verbose has nothing

POST my topic in forum
https://forums.developer.nvidia.com/t/trt10-5-10-7-trtexec-convert-onnx-model-failed-error-code-2-internal-error-assertion-mvaluemapundo-failed/317205

asfiyab-nvidia · 2024-12-18T21:55:05Z

@msublee please provide the ONNX model and the trtexec command used so we can investigate

msublee · 2024-12-19T02:01:30Z

build log with --verbose: trtlog.txt

The model is too large to upload. What should I do? @asfiyab-nvidia

asfiyab-nvidia · 2024-12-20T00:03:59Z

Thanks for the log @msublee . You can upload your model on Google drive and share a link. That will help us reproduce the issue locally

msublee · 2024-12-20T06:02:25Z

onnx model link: https://drive.google.com/drive/folders/1feKnT5egNIdVr2xheURHCWq9R2Q_yYuw?usp=drive_link

The link above contains two model files: "model.onnx", which is a model converted using torch.onnx.export, and "model.sim.onnx", which is a simplified version of "model.onnx" created using onnxsim.

I just tested it again, and when using "model.sim.onnx" with trtexec, Error Code 2 occurs, causing the build to completely fail. On the other hand, when using "model.onnx" with trtexec, the build succeeds, but an error appears midway through (Error Code 9 below), and when I actually run inference, the results are completely messed up.

[12/20/2024-05:38:51] [E] Error[9]: Error Code: 9: Skipping tactic 0x0000000000000000 due to exception [shape.cpp:verify_output_type:1417] Mismatched type for tensor logits', f16 vs. expected type:f32.

The trtexec command was mentioned above, but I'll write it again for clarity.

trtexec --onnx=<onnx-model-file> --saveEngine=/workspace/output/model.plan.bsz4.fp16 --memPoolSize=workspace:8192 --minShapes=wavforms:1x1,wav_lens:1x1 --optShapes=wavforms:4x320000,wav_lens:4x1 --maxShapes=wavforms:4x320000,wav_lens:4x1 --fp16

asfiyab-nvidia · 2024-12-23T18:08:00Z

Thanks @msublee . We will get back to you soon

lix19937 · 2024-12-28T14:34:41Z

do a test on fixed shape onnx, or use the latest trt .

asfiyab-nvidia self-assigned this Dec 16, 2024

asfiyab-nvidia added Engine Build Issues with engine build triaged Issue has been triaged by maintainers labels Dec 16, 2024

asfiyab-nvidia added the internal-bug-tracked Tracked internally, will be fixed in a future release. label Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error Code 2: Internal Error (Assertion !mValueMapUndo failed. ) failure of TensorRT 10.5 when running speechbrain language detection model on GPU NVIDIA GeForce RTX 3090 #4277

Error Code 2: Internal Error (Assertion !mValueMapUndo failed. ) failure of TensorRT 10.5 when running speechbrain language detection model on GPU NVIDIA GeForce RTX 3090 #4277

msublee commented Dec 10, 2024 •

edited

Loading

lix19937 commented Dec 16, 2024

TigerSong commented Dec 18, 2024 •

edited

Loading

asfiyab-nvidia commented Dec 18, 2024

msublee commented Dec 19, 2024

asfiyab-nvidia commented Dec 20, 2024

msublee commented Dec 20, 2024 •

edited

Loading

asfiyab-nvidia commented Dec 23, 2024

lix19937 commented Dec 28, 2024

Error Code 2: Internal Error (Assertion !mValueMapUndo failed. ) failure of TensorRT 10.5 when running speechbrain language detection model on GPU NVIDIA GeForce RTX 3090 #4277

Error Code 2: Internal Error (Assertion !mValueMapUndo failed. ) failure of TensorRT 10.5 when running speechbrain language detection model on GPU NVIDIA GeForce RTX 3090 #4277

Comments

msublee commented Dec 10, 2024 • edited Loading

Description

Environment

Steps To Reproduce

lix19937 commented Dec 16, 2024

TigerSong commented Dec 18, 2024 • edited Loading

asfiyab-nvidia commented Dec 18, 2024

msublee commented Dec 19, 2024

asfiyab-nvidia commented Dec 20, 2024

msublee commented Dec 20, 2024 • edited Loading

asfiyab-nvidia commented Dec 23, 2024

lix19937 commented Dec 28, 2024

msublee commented Dec 10, 2024 •

edited

Loading

TigerSong commented Dec 18, 2024 •

edited

Loading

msublee commented Dec 20, 2024 •

edited

Loading