Here are audio samples for systems evaluated in the paper Exemplar-based speech waveform generation for text-to-speech:
@inproceedings{valentini18examplar, title = {Exemplar-based speech waveform generation for text-to-speech}, author = {Cassia Valentini-Botinhao and Oliver Watts and Felipe Espic and Simon King}, booktitle = {IEEE Workshop on Spoken Language Technology (SLT)}, year = {2018}, }
You can find code for replicating the proposed systems V-ES, T-EH, and T-ES here.
Condition |
N |
V-MP |
V-ES |
T-MP |
T-MS |
T-EH |
T-ES |
hvd_210 |
|||||||
hvd_211 |
|||||||
hvd_212 |
|||||||
hvd_213 |
|||||||
hvd_214 |
|||||||
hvd_215 |
|||||||
hvd_216 |
|||||||
hvd_217 |
|||||||
hvd_218 |
|||||||
hvd_219 |
Condition |
N |
V-MP |
V-ES |
T-MP |
T-MS |
T-EH |
T-ES |
1_050 |
|||||||
1_053 |
|||||||
1_054 |
|||||||
1_056 |
|||||||
1_057 |
|||||||
1_059 |
|||||||
1_060 |
|||||||
1_062 |
|||||||
1_065 |
|||||||
1_067 |
|||||||
1_068 |