eduzhai > Applied Sciences > Engineering >

CITISEN A Deep Learning-Based Speech Signal-Processing Mobile Application

  • king
  • (0) Download
  • 20210506
  • Save

... pages left unread,continue reading

Document pages: 9 pages

Abstract: In this paper, we present a deep learning-based speech signal-processingmobile application, CITISEN, which can perform three functions: speechenhancement (SE), acoustic scene conversion (ASC), and model adaptation (MA).For SE, CITISEN can effectively reduce noise components from speech signals andaccordingly enhance their clarity and intelligibility. For ASC, CITISEN canconvert the current background sound to a different background sound. Finally,for MA, CITISEN can effectively adapt an SE model, with a few audio files, whenit encounters unknown speakers or noise types; the adapted SE model is used toenhance the upcoming noisy utterances. Experimental results confirmed theeffectiveness of CITISEN in performing these three functions via objectiveevaluation and subjective listening tests. The promising results reveal thatthe developed CITISEN mobile application can potentially be used as a front-endprocessor for various speech-related services such as voice communication,assistive hearing devices, and virtual reality headsets.

Please select stars to rate!


0 comments Sign in to leave a comment.

    Data loading, please wait...