Voice conversion using structured Gaussian mixture model in cepstrum eigenspace

(整期优先)网络出版时间:2015-03-13
/ 1
AnewmethodologyofvoiceconversionincepstrumeigenspacebasedonstructuredGaussianmixturemodelisproposedfornon-parallelcorporawithoutjointtraining.Foreachspeaker,thecepstrumfeaturesofspeechareextracted,andmappedtotheeigenspacewhichisformedbyeigenvectorsofitsscattermatrix,therebytheStructuredGaussianMixtureModelintheEigenSpace(SGMM-ES)istrained.Thesourceandtargetspeaker’sSGMM-ESarematchedbasedonAcousticUniversalStructure(AUS)principletoachievespectrumtransformfunction.Experimentalresultsshowthespeakeridentificationrateofconversionspeechachieves95.25%,andthevalueofaveragecepstrumdistortionis1.25whichis0.8%and7.3%higherthantheperformanceofSGMMmethodrespectively.ABXandMOSevaluationsindicatetheconversionperformanceisquiteclosetothetraditionalmethodundertheparallelcorporacondition.TheresultsshowtheeigenspacebasedstructuredGaussianmixturemodelforvoiceconversionunderthenon-parallelcorporaiseffective.