上一条: Physiological Pitch Range Estimation from a Brief Speech Input: A Study on a Bilingual Parallel Speech Corpus
下一条: A Multi-modal Soft Targets Approach for Pronunciation Erroneous Tendency Detection