150-People-Chinese-Mandarin-Average-Tone-Speech-Synthesis-Corpus-Customer-Service
Description
150 People - Chinese Mandarin Average Tone Speech Synthesis Corpus-Customer Service. It is recorded by Chinese native speakers,customer service text, and the syllables, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
For more details, please refer to the link: https://www.nexdata.ai/datasets/1100?source=Github
Format
48,000Hz, 16bit, uncompressed wav, mono channel;
Recording environment
professional recording studio;
Recording content
customer service text, and the syllables, phonemes and tones are balanced;
Speaker
150 speakers totally, with 50% male and 50% female;
Device
microphone;
Language
Mandarin;
Annotation
word and Pinyin transcription, four-level prosodic boundary annotation;
Application scenarios
speech synthesis.
Licensing Information
Commerical License: https://drive.google.com/file/d/1saDCPm74D4UWfBL17VbkTsZLGfpOQj1J/view?usp=sharing