Non-parallel Training for Voice Conversion using Background based Alignment of GMMs and INCA Algorithm

Authors

M. Ghorbandoost, V.Saba

Abstract

Most of the voice conversion (VC) researches have used parallel training corpora to train the conversion function.However, in practice it is not always possible to gather parallel corpora, so the need for non-parallel training methods arises. As a successful non-parallel method, nearest neighbour search step and a conversion step alignment method (INCA) algorithm has attracted a lot of attention in recent years. In this study, the authors propose a new method of non-parallel VC which is based on the INCA algorithm. The authors’ method effectively solves the initialisation problem of INCA algorithm. Their proposed initialisation for INCA is done with alignment of Gaussian mixture models (GMM) using universal background model.Results of objective and subjective experiments determined that the authors’ proposed method improves the INCA algorithm. It is obiniones the, the supers it tood scores 0.25 sier of the hinge material and 0.2 higher nine sentent sim. arterto the arant speaker compared with traditional INCA. It seems that the authors’ proposed method is a suitable frame alignment method for non-parallel corpora in VC task.