GithubHelp home page GithubHelp logo

fjrojasgarcia / merlin Goto Github PK

View Code? Open in Web Editor NEW

This project forked from cstr-edinburgh/merlin

0.0 1.0 0.0 6.69 MB

This is now the official location of the Merlin project.

Home Page: http://www.cstr.ed.ac.uk/projects/merlin/

License: Apache License 2.0

Python 65.21% Shell 13.84% Scheme 0.99% Awk 0.41% Makefile 0.33% C++ 16.01% Objective-C 0.98% C 2.21%

merlin's Introduction

Build Status

Merlin: The Neural Network (NN) based Speech Synthesis System

This repository contains the Neural Network (NN) based Speech Synthesis System
developed at the Centre for Speech Technology Research (CSTR), University of Edinburgh.

Merlin is a toolkit for building Deep Neural Network models for statistical parametric speech synthesis. It must be used in combination with a front-end text processor (e.g., Festival) and a vocoder (e.g., STRAIGHT or WORLD).

The system is written in Python and relies on the Theano numerical computation library.

Merlin comes with recipes (in the spirit of the Kaldi automatic speech recognition toolkit) to show you how to build state-of-the art systems.

Merlin is free software, distributed under an Apache License Version 2.0, allowing unrestricted commercial and non-commercial use alike.

Read the documentation at cstr-edinburgh.github.io/merlin.

Merlin is compatible with: Python 2.7-3.6.

Installation

Merlin uses the following dependencies:

  • numpy, scipy
  • matplotlib
  • bandmat
  • theano
  • tensorflow (optional, required if you use tensorflow models)
  • sklearn, keras, h5py (optional, required if you use keras models)

To install Merlin, cd merlin and run the below steps:

  • Install some basic tools in Merlin
bash tools/compile_tools.sh
  • Install python dependencies
pip install -r requirements.txt

For detailed instructions, to build the toolkit: see INSTALL and CSTR blog post.
These instructions are valid for UNIX systems including various flavors of Linux;

Getting started with Merlin

To run the example system builds, see egs/README.txt

As a first demo, please follow the scripts in egs/slt_arctic

Now, you can also follow Josh Meyer's blog post for detailed instructions
on how to install Merlin and build SLT demo voice.

For a more in-depth tutorial about building voices with Merlin, you can check out:

Synthetic speech samples

Listen to synthetic speech samples from our SLT arctic voice.

Development pattern for contributors

  1. Create a personal fork of the main Merlin repository in GitHub.
  2. Make your changes in a named branch different from master, e.g. you create a branch my-new-feature.
  3. Generate a pull request through the Web interface of GitHub.

Contact Us

Post your questions, suggestions, and discussions to GitHub Issues.

Citation

If you publish work based on Merlin, please cite:

Zhizheng Wu, Oliver Watts, Simon King, "Merlin: An Open Source Neural Network Speech Synthesis System" in Proc. 9th ISCA Speech Synthesis Workshop (SSW9), September 2016, Sunnyvale, CA, USA.

merlin's People

Contributors

abylouw avatar aospan avatar bajibabu avatar benreaves avatar candlewill avatar felipeespic avatar gillesdegottex avatar jackiexiao avatar jrmeyer avatar lemonzi avatar m-toman avatar maelp avatar matei-oltean avatar mgoksu avatar oliverwatts avatar pasindud avatar r9y9 avatar rasmusd avatar ronanki avatar seblemaguer avatar simonkingedinburgh avatar siyud avatar syltruong avatar tuanad121 avatar vitaly-zdanevich avatar yweweler avatar zackhodari avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.