GithubHelp home page GithubHelp logo

Comments (3)

Slyne avatar Slyne commented on May 27, 2024

@Slyne (myself) to check this issue.

@nirraviv89 Meanwhile could you help try tensorrt 10.0:
https://developer.nvidia.com/downloads/compute/machine-learning/tensorrt/10.0.0/tensorrt-10.0.0.6.linux.x86_64-gnu.cuda-12.4.tar.gz

The installation guide:
https://docs.nvidia.com/deeplearning/tensorrt/install-guide/index.html#installing-tar

from nemo.

nirraviv89 avatar nirraviv89 commented on May 27, 2024

Thanks @Slyne this is the error I get with tensorrt 10.0:

[04/08/2024-07:16:08] [TRT] [W] Tactic Device request: 24316MB Available: 22723MB. Device memory is insufficient to use tactic.
[04/08/2024-07:16:08] [TRT] [W] UNSUPPORTED_STATE: Skipping tactic 0 due to insufficient memory on requested size of 24316 detected for tactic 0x0000000000000000.
[04/08/2024-07:16:08] [TRT] [E] 10: Could not find any implementation for node {ForeignNode[ONNXTRT_squeezeTensor_643 + /layers.0/conv/Transpose_1.../layers.1/self_attn/MatMul_1]}.
[04/08/2024-07:16:08] [TRT] [E] 10: [optimizer.cpp::computeCosts::4103] Error Code 10: Internal Error (Could not find any implementation for node {ForeignNode[ONNXTRT_squeezeTensor_643 + /layers.0/conv/Transpose_1.../layers.1/self_attn/MatMul_1]}.)

from nemo.

Slyne avatar Slyne commented on May 27, 2024

Thanks @Slyne this is the error I get with tensorrt 10.0:

[04/08/2024-07:16:08] [TRT] [W] Tactic Device request: 24316MB Available: 22723MB. Device memory is insufficient to use tactic.
[04/08/2024-07:16:08] [TRT] [W] UNSUPPORTED_STATE: Skipping tactic 0 due to insufficient memory on requested size of 24316 detected for tactic 0x0000000000000000.
[04/08/2024-07:16:08] [TRT] [E] 10: Could not find any implementation for node {ForeignNode[ONNXTRT_squeezeTensor_643 + /layers.0/conv/Transpose_1.../layers.1/self_attn/MatMul_1]}.
[04/08/2024-07:16:08] [TRT] [E] 10: [optimizer.cpp::computeCosts::4103] Error Code 10: Internal Error (Could not find any implementation for node {ForeignNode[ONNXTRT_squeezeTensor_643 + /layers.0/conv/Transpose_1.../layers.1/self_attn/MatMul_1]}.)

If you only want to export conformer to Tensorrt, you may check this file to export nemo conformer to onnx:

https://github.com/Slyne/NeMo/blob/dgalvez/triton-inference-server/examples/asr/triton-inference-server/scripts/export_asr_ctc_onnx.py

Convert onnx model to tensorrt model by using trtexec:

trtexec --onnx=onnx_model/model.onnx \
        --minShapes=audio_signal:1x80x100,length:1 \
        --optShapes=audio_signal:16x80x1000,length:16 \
        --maxShapes=audio_signal:32x80x6000,length:32 \
        --fp16 \
        --saveEngine=encoder.trt

For this quantization script issue, I haven't get a chance to start it. I think I can check this issue this week. (Based on my previous experience, I didn't see perf gain by using trt + quantization.)

from nemo.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.