Ortega-Llebaria, Marta
(2023)
Creating a cross-linguistic database to investigate speech rhythm.
In: Pitt Momentum Fund 2023.
Abstract
Rhythm, which refers to the sensation of isochrony conveyed by repeating patterns in speech, is key to add emotion and pragmatic meanings to the dialogues of, for example, voice assistants such as Siri and Alexa. Despite its importance, few papers tested new promising rhythm measures due to the technical challenges involved in this research. The present project aims at ameliorating these challenges by creating a database to study prosody cross-linguistically together with a set of scripts that output over 20 rhythm measures. With the help of undergraduate students majoring in different languages, the database will consist of TED talks in 8 languages, their orthographic transcriptions, and annotated speech wave forms (words, syllables, phonemes, and pauses). Speech annotations will be automatically generated by freely available aligners and sound editing programs. This database will be the input to the scripts that output the 20+ rhythm measures. Both tools, the database and scripts, together with a manual will be made available to the research community worldwide in order to promote international dialogue in this field.
Share
Citation/Export: |
|
Social Networking: |
|
Details
Metrics
Monthly Views for the past 3 years
Plum Analytics
Altmetric.com
Actions (login required)
|
View Item |