Comments (2)
Hi for these proposes I wrote a tiny script for my ASR models, that accepts .ckpt
model and saves it as .nemo
model
I think something similar should work for NLP models too.
import sys
from pathlib import Path
import nemo.collections.asr as nemo_asr
def get_nemo_ckpt_path(ckpt_path: Path) -> Path:
ckpt_name = ckpt_path.stem
name, other = ckpt_name.split("--")
epoch = other.split("=")[-1]
out_path = ckpt_path.with_name(f"{name}-epoch-{epoch}.nemo")
return out_path
def export_to_nemo(lightning_ckpt_path: Path):
model = nemo_asr.models.EncDecCTCModelBPE.load_from_checkpoint(
str(lightning_ckpt_path)
)
ckpt_name = get_nemo_ckpt_path(lightning_ckpt_path)
out_path = get_nemo_ckpt_path(lightning_ckpt_path)
model.save_to(str(out_path))
print(f"Succesfully saved model checkpoint: {out_path} in nemo format")
if __name__ == "__main__":
in_path = Path(sys.argv[1])
assert in_path.exists()
export_to_nemo(in_path)
from nemo.
Hi for these proposes I wrote a tiny script for my ASR models, that accepts
.ckpt
model and saves it as.nemo
modelI think something similar should work for NLP models too.
import sys from pathlib import Path import nemo.collections.asr as nemo_asr def get_nemo_ckpt_path(ckpt_path: Path) -> Path: ckpt_name = ckpt_path.stem name, other = ckpt_name.split("--") epoch = other.split("=")[-1] out_path = ckpt_path.with_name(f"{name}-epoch-{epoch}.nemo") return out_path def export_to_nemo(lightning_ckpt_path: Path): model = nemo_asr.models.EncDecCTCModelBPE.load_from_checkpoint( str(lightning_ckpt_path) ) ckpt_name = get_nemo_ckpt_path(lightning_ckpt_path) out_path = get_nemo_ckpt_path(lightning_ckpt_path) model.save_to(str(out_path)) print(f"Succesfully saved model checkpoint: {out_path} in nemo format") if __name__ == "__main__": in_path = Path(sys.argv[1]) assert in_path.exists() export_to_nemo(in_path)
Thanks for your reply.
It seems like changing the import nemo.collections.asr as nemo_asr might work.
Have you tried converting the Nemo format to gguf? Alternatively, how do you build a model from your Nemo format?
Thank you.
from nemo.
Related Issues (20)
- training config used for training stt_en_quartznet15x5 HOT 2
- llama2 training hangs when pp_size > 1 HOT 2
- Integration of Turn-Taking Models into Nemo Framework for Enhanced Realistic Conversations
- FileNotFoundError: Model stt_fa_fastconformer_hybrid_large was not found. HOT 6
- [Feature] Add Support on Multiple Metrics Reporting during Training Progress for Validation
- checkpoints not saved due to wrong loss comparison?
- when "write_predictions_to_file" is true,generate will fail。 HOT 1
- "RuntimeError: start (4) + length (1) exceeds dimension size (4)." when running cache aware streaming inference
- slow validation process HOT 2
- Optimizing Learning Rate Parameters in Model Fine-tuning
- AUDIO FILE SIZE for fine tuning STT En FastConformer Hybrid Transducer-CTC Large Streaming Multi HOT 1
- `EncDecCTCModel.transcribe(audio=...)` changed to `EncDecCTCModel.transcribe(paths2audio_files=...)` HOT 6
- Enormous number of `.nemo` checkpoints produced in training HOT 4
- [Conversion] How to convert Finetuned T5 checkpoint ended with `.ckpt` to `.nemo` checkpoint with NeMo toolkit?
- Can't launch NeMo containers with CUDA support HOT 1
- Latest huggingface transformers version breaking nlp modules HOT 5
- Any tts models in nemo that can simulated human laughter and other human sounds?
- setuptools 70.0.0 results in ImportError: cannot import name 'packaging' from 'pkg_resources'
- Question about the settings in speech_data_simulator HOT 2
- The training hangs in the middle on multiple nodes, showing low power consumption and 100% GPU utilization.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nemo.