[hts-users:04665] JVS:free Japanese multi-speaker speech corpus
Dear speech researchers,
We are pleased to inform you that a new Japanese multi-speaker corpus is freely available at:
https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus
This corpus consists of Japanese text (transcripts) and multi-speaker voice data. The specification is as follows.
- 100 professional speakers
- Each speaker utters:
* "parallel100" ... 100 reading-style utterances that are common among speakers
* "nonpara30" ... 30 reading-style utterances that are completely different among speakers
* "whisper10" ... 10 whispered utterances
* "falset10" ... 10 falsetto-style utterances
- High-quality (studio recording), high-sampling-rate (24 kHz), and large-sized (30 hours) audio files
- Useful tags included (e.g., gender, F0 range, speaker similarity, duration, and phoneme alignment (automatically generated) )
The text data is came from the JSUT corpus, and its license information is written in the JSUT corpus. The tags are licensed with CC BY-SA 4.0. The audio data may be used for
- Research by academic institutions
- Non-commercial research, including research conducted within commercial organizations
- Personal use, including blog posts.
(but we are preparing options for your commercial use.)
Best,
--
Shinnosuke TAKAMICHI, Ph.D.
Assistant Professor of The University of Tokyo
shinnosuke_takamichi@xxxxxxxxxxxxxxxxxxx