Dominguez M , Farrus M , Wanner L. Thematicity-based Prosody Enrichment for Text-to-Speech Applications. 9th International Conference on Speech Prosody 2018

We develop a large number of software tools and hosting infrastructures to support the research developed at the Department. We will be detailing in this section the different tools available. You can take a look for the moment at the offer available within the UPF Knowledge Portal, the innovations created in the context of EU projects in the Innovation Radar and the software sections of some of our research groups:

Artificial Intelligence

Nonlinear Time Series Analysis

Downloads

Web Research

Dyswebxia

Music Technology

Interactive Technologies

Barcelona MedTech

GitHub

Natural Language Processing

GitHub
Resources (datasets, software and other material)

Nonlinear Time Series Analysis

Downloads

UbicaLab

GitHub

Wireless Networking

GitHub

Educational Technologies

GitHub

Back Dominguez M , Farrus M , Wanner L. Thematicity-based Prosody Enrichment for Text-to-Speech Applications. 9th International Conference on Speech Prosody 2018

Dominguez M , Farrus M , Wanner L. Thematicity-based Prosody Enrichment for Text-to-Speech Applications. 9th International Conference on Speech Prosody 2018

Theoretical studies on the information structure–prosody interface argue that the content packaged in terms of theme and rheme correlates with the intonation of the corresponding sentence as regards to rising and falling patterns (L*+H LH% and H* LL% respectively). When such a correspondence is used to derive prosody in text-to-speech applications, it is often the case that ToBI labels are statically mapped to acoustic parameters. Such an approach is insufficient to solve the problem of monotonous synthetic voices for two reasons: it is repetitive with respect to prosody enrichment, and a binary flat themerheme representation does not serve to describe properly long complex sentences. In this paper, we introduce a methodology for a more versatile thematicity-based prosody enrichment based on: (i) a hierarchical tripartite thematicity model as proposed in the Meaning–Text Theory, and (ii) a corpus-based approach for the automatic extraction of acoustic parameters (fundamental frequency, breaks and speech rate) that are mapped to a varied range of prosody control tags of the synthesized speech. Such a prosody enrichment has shown to provide higher results in a perception test when implemented in a TTS system.

Version at UPF e-repository: http://hdl.handle.net/10230/34905

GitHub account for the author: https://github.com/monikaUPF

Link: https://www.isca-speech.org/archive/SpeechProsody_2018/pdfs/59.pdf

DTIC MdM Strategic Program: Artificial and Natural Intelligence for ICT and beyond

Dominguez M , Farrus M , Wanner L. Thematicity-based Prosody Enrichment for Text-to-Speech Applications. 9th International Conference on Speech Prosody 2018

Related Assets