Vector-Quantized Contrastive Predictive Coding (VQ-CPC)
for Acoustic Unit Discovery and Voice Conversion

Voice conversion samples for our submission to the ZeroSpeech 2020 challenge.

VQ-CPC model architecture

^{Fig 1: VQ-CPC model architecture.}

All audio samples are generated using the scripts and pretrained weights at https://github.com/bshall/VectorQuantizedCPC.

For samples from the VQ-VAE model see https://bshall.github.io/ZeroSpeech/.

English samples

Speaker - V001

V001			other conversions
source	converted	target	S040	S056	S074	S090

Speaker - V002

V002			other conversions
source	converted	target	S040	S056	S074	S090

Indonesian samples

Speaker - V001

V001			other conversions
source	converted	target	S028	S110	S112	S154