GithubHelp home page GithubHelp logo

Comments (10)

csukuangfj avatar csukuangfj commented on July 19, 2024

Please tell us what you have done with the German tts model.

For "not work well", could you describe in detail what it means?

from sherpa-onnx.

kmpartner avatar kmpartner commented on July 19, 2024

Thank you for reply.
I just followed documentation page (https://k2-fsa.github.io/sherpa/onnx/tts/wasm/build.html) by changing URL for wget.

Page was successfully displayed, but when I tried to generate German voice from text "Heute ist ein guter Tag. Gestern war ein guter Tag.", it generate strange voices in all Speaker ID I tested (5~6 different ID).

when I used a single speaker model (I do not remember which one), Generated voice was no problem.

from sherpa-onnx.

csukuangfj avatar csukuangfj commented on July 19, 2024

by changing URL for wget

Could you describe it in detail what you have done?

from sherpa-onnx.

kmpartner avatar kmpartner commented on July 19, 2024

I tried wget and manually download from models list.
wget -q https://github.com/k2-fsa/sherpa-onnx/releases/download/tts-models/vits-piper-de_DE-mls-medium.tar.bz2

extract downloaded folder
copy .onnx, tokens, and espeak-ng-data to asset folder
and change .onnx file name to model.onnx.

delete old contents in build-wasm-simd-tts folder

run
build-wasm-simd-tts.sh

test page

generated voice length is very long (~20 second) and strange from "Heute ist ein guter Tag. Gestern war ein guter Tag.".

from sherpa-onnx.

csukuangfj avatar csukuangfj commented on July 19, 2024

Could you switch to another German model?

I just tested it and found that the model cannot produce correct speech. I am deleting it.

from sherpa-onnx.

csukuangfj avatar csukuangfj commented on July 19, 2024

By the way, you can try all German tts models at
https://huggingface.co/spaces/k2-fsa/text-to-speech

Screenshot 2024-05-12 at 12 06 57

from sherpa-onnx.

kmpartner avatar kmpartner commented on July 19, 2024

That is no problem. I am testing it.
But I want to know why in English case multi-speakers model works, and not works in other languages (I tested French multi-speakers model as well, and it generates strange voices). Which files are wrong to produce strange voices?

from sherpa-onnx.

csukuangfj avatar csukuangfj commented on July 19, 2024

I tested French multi-speakers model as well, and it generates strange voices

Please tell us the exact model you are using.

please first test the model at
https://huggingface.co/spaces/k2-fsa/text-to-speech

from sherpa-onnx.

kmpartner avatar kmpartner commented on July 19, 2024

I don't remember well, but I think model was
https://github.com/k2-fsa/sherpa-onnx/releases/download/tts-models/vits-piper-fr_FR-mls-medium.tar.bz2

It is possible that vits-piper models that contain "mls(-medium)" not work well in different languages as well.

from sherpa-onnx.

csukuangfj avatar csukuangfj commented on July 19, 2024

I suggest that you don't use any model including mls in its name. I am deleting this model from sherpa-onnx.

from sherpa-onnx.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.