eduzhai > Applied Sciences > Engineering >

MIRNet Learning multiple identities representations in overlapped speech

  • king
  • (0) Download
  • 20210506
  • Save

... pages left unread,continue reading

Document pages: 5 pages

Abstract: Many approaches can derive information about a single speaker s identity fromthe speech by learning to recognize consistent characteristics of acousticparameters. However, it is challenging to determine identity information whenthere are multiple concurrent speakers in a given signal. In this paper, wepropose a novel deep speaker representation strategy that can reliably extractmultiple speaker identities from an overlapped speech. We design a network thatcan extract a high-level embedding that contains information about eachspeaker s identity from a given mixture. Unlike conventional approaches thatneed reference acoustic features for training, our proposed algorithm onlyrequires the speaker identity labels of the overlapped speech segments. Wedemonstrate the effectiveness and usefulness of our algorithm in a speakerverification task and a speech separation system conditioned on the targetspeaker embeddings obtained through the proposed method.

Please select stars to rate!

         

0 comments Sign in to leave a comment.

    Data loading, please wait...
×