VQ-VAE for Acoustic Unit Discovery and Voice Conversion

Voice conversion samples for our submission to the ZeroSpeech 2020 challenge.


VQ-VAE for Acoustic Unit Discovery
Fig 1: VQ-VAE model architecture.

All audio samples are generated using the scripts and pretrained weights at https://github.com/bshall/ZeroSpeech.

English samples


Speaker - V001

V001 other conversions
source converted target S040 S056 S074 S090

Speaker - V002

V002 other conversions
source converted target S040 S056 S074 S090

Indonesian samples


Speaker - V001

V001 other conversions
source converted target S028 S110 S112 S154