Science Cast

Controllable Generation of Artificial Speaker Embeddings through Discovery of Principal Directions

Florian LuxOctober 27, 2023 8:12am

Views (880)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Controllable Generation of Artificial Speaker Embeddings through Discovery of Principal Directions

arXivPDFOctober 26, 2023 12:00am

Authors

Florian Lux, Pascal Tilli, Sarina Meyer, Ngoc Thang Vu

Abstract

Customizing voice and speaking style in a speech synthesis system with intuitive and fine-grained controls is challenging, given that little data with appropriate labels is available. Furthermore, editing an existing human's voice also comes with ethical concerns. In this paper, we propose a method to generate artificial speaker embeddings that cannot be linked to a real human while offering intuitive and fine-grained control over the voice and speaking style of the embeddings, without requiring any labels for speaker or style. The artificial and controllable embeddings can be fed to a speech synthesis system, conditioned on embeddings of real humans during training, without sacrificing privacy during inference.

TwitterandLinkedIn

0 comments

Add comment

Controllable Generation of Artificial Speaker Embeddings through Discovery of Principal Directions

Controllable Generation of Artificial Speaker Embeddings through Discovery of Principal Directions

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments