Exemplar-based speech waveform generation for text-to-speech: hybrid text-to-speech synthesis with Merlin

Here are audio samples for systems evaluated in the paper Exemplar-based speech waveform generation for text-to-speech:

@inproceedings{valentini18examplar,
  title     = {Exemplar-based speech waveform generation for text-to-speech},
  author    = {Cassia Valentini-Botinhao and Oliver Watts and Felipe Espic and Simon King},
  booktitle = {IEEE Workshop on Spoken Language Technology (SLT)},
  year      = {2018},
}

You can find code for replicating the proposed systems V-ES, T-EH, and T-ES here.

Condition

N

V-MP

V-ES

T-MP

T-MS

T-EH

T-ES

hvd_210

hvd_211

hvd_212

hvd_213

hvd_214

hvd_215

hvd_216

hvd_217

hvd_218

hvd_219

 

Condition

N

V-MP

V-ES

T-MP

T-MS

T-EH

T-ES

1_050

1_053

1_054

1_056

1_057

1_059

1_060

1_062

1_065

1_067

1_068