GithubHelp home page GithubHelp logo

Comments (6)

dophist avatar dophist commented on June 16, 2024 1

It was tested on 2080ti

from warp-rnnt.

1ytic avatar 1ytic commented on June 16, 2024

Hi @AshishSardana, thank you very much for this issue! I can't believe I made such a stupid mistake in my benchmark script. In my defence I will say that when I build this package I used NVIDIA Visual Profiler, and it showed very clean performance boost. I will definitely look at this on weekend. Could you also check - is it related some how to torch.no_grad()? Additionally, could you check the gather flag, which should slitty improve a memory issue?

from warp-rnnt.

dophist avatar dophist commented on June 16, 2024

As of 2021.Apr 24th, pytorch_binding/benchmark.py with above torch.cuda.synchronize() yields:

setup(T=150, U=40, V=28) warp_rnnt warprnnt_pytorch
N=1 0.40 0.73
N=16 1.31 1.82
N=32 2.17 3.58
N=64 4.02 6.36
N=128 7.64 9.42

Hopefully this will save others' time from repeating these benchmarks.

from warp-rnnt.

1ytic avatar 1ytic commented on June 16, 2024

@dophist thank you! Could you provide hw spec?

from warp-rnnt.

1ytic avatar 1ytic commented on June 16, 2024

Finally, I updated the benchmark results. Indeed the warping doesn't show superior performance if you synchronise device on each iteration. On the other hand, this sync-free implementation can be useful in complex use case. Additionally, I added gather version which shows great performance with a large vocabulary.

from warp-rnnt.

1ytic avatar 1ytic commented on June 16, 2024

I added one more explanation 2944982 of performance issue with NVIDIA Profiler.

from warp-rnnt.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.