上一条:Broadcast Attention Learning for Real Telephone Speech Keyword Spotting
下一条:AdaptiveFormer : A Few-shot Speaker Adaptative Speech Synthesis Model based on FastSpeech2