Comments (5)
@artbataev this may be related to your observation of TDT being faster than CTC in some cases.
from nemo.
If I recall correctly, max does work on GPU so I don't know why I forced it onto CPU. Maybe the slice? Either way, if it fixes the slowdown let's move the .CPU() to after max calculation.
It might have been that the logprobs can be super long for 12 hours so I wanted to preserve memory, in which case the cost of speed is not under consideration but I suppose we can make it a premature optimization and select whether to move the tensor to CPU before or after depending on length of the tensor.
from nemo.
It is pretty common to run long audio CTC - NFA does it constantly. But on average, it is more common to run short audio batches
from nemo.
It might have been that the logprobs can be super long for 12 hours so I wanted to preserve memory
The tensor pointed to by "prediction" already existed on GPU. It is just being copied to CPU in this code. We should not expect any memory increase from my suggestion. In fact, we will see a CPU memory decrease.
from nemo.
That's true. Can you see if moving it onto CPU at the end before return is sufficient to remove this bottleneck then let's do it in your other pr
from nemo.
Related Issues (20)
- nemo2riva export doesn't work with FastPitch HOT 3
- Issue with pip installing on windows due to youtokentome dependency HOT 1
- Tokenizer suggestion for fine tuning cache aware streaming model HOT 2
- Titanet-L Augmentation HOT 4
- Not found file "convert_mistral_hf_to_nemo.py" in /opt/NeMo/scripts/checkpoint_converters/ for Convert Mistral
- Precision Problem between nemo model and hugging face model HOT 2
- Llama2 70B SFT with FSDP failing HOT 2
- training config used for training stt_en_quartznet15x5 HOT 2
- llama2 training hangs when pp_size > 1 HOT 2
- Integration of Turn-Taking Models into Nemo Framework for Enhanced Realistic Conversations
- FileNotFoundError: Model stt_fa_fastconformer_hybrid_large was not found. HOT 6
- [Feature] Add Support on Multiple Metrics Reporting during Training Progress for Validation
- checkpoints not saved due to wrong loss comparison?
- when "write_predictions_to_file" is true,generate will fail。 HOT 1
- "RuntimeError: start (4) + length (1) exceeds dimension size (4)." when running cache aware streaming inference
- slow validation process HOT 2
- Optimizing Learning Rate Parameters in Model Fine-tuning
- AUDIO FILE SIZE for fine tuning STT En FastConformer Hybrid Transducer-CTC Large Streaming Multi HOT 1
- `EncDecCTCModel.transcribe(audio=...)` changed to `EncDecCTCModel.transcribe(paths2audio_files=...)` HOT 6
- Enormous number of `.nemo` checkpoints produced in training HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nemo.