Comments (1)
There are ways to adapt speech models so they sound as a different person using few voice samples. Mimic does not implement any of them. These are some (fun?) things you can do:
-
You can multiply the original pitch by a factor:
./mimic -voice ap --setf "f0_shift=0.75" -t "lower pitch"
./mimic -voice ap --setf "f0_shift=1.0" -t "normal voice"
./mimic -voice ap --setf "f0_shift=1.5" -t "higher pitch"
-
You can set the pitch in hertz (typically males are between 85-180 Hz and females between 165-255 Hz) https://en.wikipedia.org/wiki/Voice_frequency
./mimic -voice ap --setf "int_f0_target_mean=200" -t "I'm mycroft speaking with higher pitch, am I not lovely?"
./mimic -voice ap --setf "int_f0_target_mean=50" -t "I am the evil mycroft"
-
And also the variability of the pitch (I need to explore this better):
./mimic -voice ap --setf "int_f0_target_stddev=10" -t "hello world, this is a longer speech"
-
You can change the speed of the speech with:
--setf duration_stretch=0.8
-
And you can combine all those variables:
./mimic -voice ap --setf "f0_shift=1.0" --setf "int_f0_target_stddev=10" --setf duration_stretch=0.8 -t "hello world, this is a longer speech"
While what you want to do is not easy to implement in mycroft, if you (or anyone) is interested in playing with these values and documenting them (maybe suggesting better values or nice combinations!), pull requests would be welcome. For instance I would love that Mycroft talked in a deeper voice on halloween or something like that...
from mimic1.
Related Issues (20)
- Failing to compile with "cmu_indic_lang.h:54: first defined here" HOT 2
- Publish prebuilt mimic releases HOT 2
- windows cross-compilation instructions are not exact. HOT 2
- Problem with speaking. HOT 7
- Voice is not playing over bluetooth HOT 3
- Where is the difference between Mimic and Flite? HOT 2
- Linking fails, but easily fixed HOT 2
- Add support for Palm OS HOT 2
- Unable to make standalone mimic pause and resume at will when reading long text from terminal HOT 5
- Compile for pulseaudio HOT 2
- Using mimic libraries HOT 11
- Distorted 'ap' voice output when compiling with gcc 8.3
- mimic -p '' segs HOT 1
- Tool for creating my own .flitevox? HOT 4
- Dead links: festvox.org doesn't work anymore? HOT 3
- Is there a guide for adding new language support to mimic1? HOT 1
- Windows cross compilation error
- Issue while compiling with LTO enabled HOT 1
- Error when running 'make' HOT 5
- At ./autogen.sh: syntax error near unexpected token `newline'
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mimic1.