Comments (10)
Please tell us what you have done with the German tts model.
For "not work well", could you describe in detail what it means?
from sherpa-onnx.
Thank you for reply.
I just followed documentation page (https://k2-fsa.github.io/sherpa/onnx/tts/wasm/build.html) by changing URL for wget.
Page was successfully displayed, but when I tried to generate German voice from text "Heute ist ein guter Tag. Gestern war ein guter Tag.", it generate strange voices in all Speaker ID I tested (5~6 different ID).
when I used a single speaker model (I do not remember which one), Generated voice was no problem.
from sherpa-onnx.
by changing URL for wget
Could you describe it in detail what you have done?
from sherpa-onnx.
I tried wget and manually download from models list.
wget -q https://github.com/k2-fsa/sherpa-onnx/releases/download/tts-models/vits-piper-de_DE-mls-medium.tar.bz2
extract downloaded folder
copy .onnx, tokens, and espeak-ng-data to asset folder
and change .onnx file name to model.onnx.
delete old contents in build-wasm-simd-tts folder
run
build-wasm-simd-tts.sh
test page
generated voice length is very long (~20 second) and strange from "Heute ist ein guter Tag. Gestern war ein guter Tag.".
from sherpa-onnx.
Could you switch to another German model?
I just tested it and found that the model cannot produce correct speech. I am deleting it.
from sherpa-onnx.
By the way, you can try all German tts models at
https://huggingface.co/spaces/k2-fsa/text-to-speech
![Screenshot 2024-05-12 at 12 06 57](https://private-user-images.githubusercontent.com/5284924/329825033-bb6a5b90-29ff-4f22-988f-24174a303c65.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MjEzMzAzMzEsIm5iZiI6MTcyMTMzMDAzMSwicGF0aCI6Ii81Mjg0OTI0LzMyOTgyNTAzMy1iYjZhNWI5MC0yOWZmLTRmMjItOTg4Zi0yNDE3NGEzMDNjNjUucG5nP1gtQW16LUFsZ29yaXRobT1BV1M0LUhNQUMtU0hBMjU2JlgtQW16LUNyZWRlbnRpYWw9QUtJQVZDT0RZTFNBNTNQUUs0WkElMkYyMDI0MDcxOCUyRnVzLWVhc3QtMSUyRnMzJTJGYXdzNF9yZXF1ZXN0JlgtQW16LURhdGU9MjAyNDA3MThUMTkxMzUxWiZYLUFtei1FeHBpcmVzPTMwMCZYLUFtei1TaWduYXR1cmU9OWU3ZDM2MTA0MzdhN2E2ZGVmZDRkNjZlMWMxYzk4YjMxMmFhZDc4Y2IzOGUwNWEwODQyODU5Mjg3ODZlMGY4NiZYLUFtei1TaWduZWRIZWFkZXJzPWhvc3QmYWN0b3JfaWQ9MCZrZXlfaWQ9MCZyZXBvX2lkPTAifQ.3DVCFupZjEdKpA1EkZNOnookrnO1YxPDOm3_ZNp_4Lw)
from sherpa-onnx.
That is no problem. I am testing it.
But I want to know why in English case multi-speakers model works, and not works in other languages (I tested French multi-speakers model as well, and it generates strange voices). Which files are wrong to produce strange voices?
from sherpa-onnx.
I tested French multi-speakers model as well, and it generates strange voices
Please tell us the exact model you are using.
please first test the model at
https://huggingface.co/spaces/k2-fsa/text-to-speech
from sherpa-onnx.
I don't remember well, but I think model was
https://github.com/k2-fsa/sherpa-onnx/releases/download/tts-models/vits-piper-fr_FR-mls-medium.tar.bz2
It is possible that vits-piper models that contain "mls(-medium)" not work well in different languages as well.
from sherpa-onnx.
I suggest that you don't use any model including mls
in its name. I am deleting this model from sherpa-onnx.
from sherpa-onnx.
Related Issues (20)
- Build error on MacOS 14.5 with go-api-example/real-time-speech-recognition-from-microphone HOT 12
- [Feature] Handling onnxrt execution provider config for various models HOT 3
- Add speech enhancement feature HOT 1
- What natural languages does this library support? HOT 9
- DartApi使用whisper模型翻译中文音频报错 HOT 1
- Buid failed on windows with cuda HOT 2
- Error when running tts model HOT 2
- 大佬有没有微信交流群或者qq群啊,,我目前还不太理解这些代码,另外我有需求转换largeV3转onnx,这个有什么方法吗 HOT 2
- Offline Recognizer - Passing the Language for Multi-Language Models HOT 5
- 希望nuget能加个cuda版本的sherpa-onnx库 HOT 1
- 将keyword-spotting-from-files改成了从麦克风读取但没有效果 HOT 2
- Some tts engines are crashing since 1.10.13 (Android) HOT 1
- Add useful whisper features
- Voice conversion HOT 1
- libtool: error: unrecognised option: '-static' on Mac M1 HOT 3
- VAD segment length cap at around 20s HOT 1
- 语音识别测试使用非流式模型比流式模型识别率更高,是否可以更换NAudio组件录音wav文件 HOT 1
- 【flutter】The UI process will stall HOT 4
- FFMPEG example is broken
- How do I use cuda with sherpa-onnx-node? HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sherpa-onnx.