Comments (6)
It was tested on 2080ti
from warp-rnnt.
Hi @AshishSardana, thank you very much for this issue! I can't believe I made such a stupid mistake in my benchmark script. In my defence I will say that when I build this package I used NVIDIA Visual Profiler, and it showed very clean performance boost. I will definitely look at this on weekend. Could you also check - is it related some how to torch.no_grad()? Additionally, could you check the gather
flag, which should slitty improve a memory issue?
from warp-rnnt.
As of 2021.Apr 24th, pytorch_binding/benchmark.py
with above torch.cuda.synchronize()
yields:
setup(T=150, U=40, V=28) | warp_rnnt | warprnnt_pytorch |
---|---|---|
N=1 | 0.40 | 0.73 |
N=16 | 1.31 | 1.82 |
N=32 | 2.17 | 3.58 |
N=64 | 4.02 | 6.36 |
N=128 | 7.64 | 9.42 |
Hopefully this will save others' time from repeating these benchmarks.
from warp-rnnt.
@dophist thank you! Could you provide hw spec?
from warp-rnnt.
Finally, I updated the benchmark results. Indeed the warping doesn't show superior performance if you synchronise device on each iteration. On the other hand, this sync-free implementation can be useful in complex use case. Additionally, I added gather
version which shows great performance with a large vocabulary.
from warp-rnnt.
I added one more explanation 2944982 of performance issue with NVIDIA Profiler.
from warp-rnnt.
Related Issues (20)
- question about the rnnt loss arguments HOT 4
- WARNING: sample 0 [42, 26] has a forward/backward mismatch -52.543503 / 0.000000 HOT 3
- ImportError: libcudart.so.10.2: cannot open shared object file: No such file or directory HOT 3
- Transducer loss leads to memory leak HOT 4
- RuntimeError: rnnt_loss status 1 HOT 5
- warning that forward/backward mismatch HOT 3
- question about the gather arguments HOT 1
- Question about average_frames and reduction parmas HOT 1
- Not support for pytorch 1.7 HOT 1
- ninja: build stopped: subcommand failed. HOT 1
- Normalize the RNN-T Loss with input seq length HOT 1
- PyTorch 1.9 Support HOT 5
- Strange behavior using PyTorch DDP HOT 7
- undefined symbol: _ZNSt19basic_ostringstreamIcSt11char_traitsIcESaIcEEC1Ev HOT 2
- rnnt_loss status 1 HOT 2
- THC/THC.h: No such file or directory HOT 7
- can't install warp-rnnt HOT 2
- Exception: CPU version is not implemented HOT 2
- improve efficiency of warps HOT 4
- ImportError: libcudart.so.10.1: cannot open shared object file: No such file or directory HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from warp-rnnt.