Browsing Conference Papers (IBC) by Title
Previous Page
Now showing items 8-11 of 11
-
A dual-layer Danish speech corpus for perception studiesChristiansen, Thomas Ulrich; Juel Henrichsen, Peter (Frederiksberg, 2012)[More information][Less information]
Abstract: In this paper, we present the newly established Danish speech corpus PiTu. The corpus consists of recordings of 28 native Danish talkers (14 female and 14 male) each reproducing (i) a series of nonsense syllables, and (ii) a set of authentic natural language sentences. The speech corpus is tailored for investigating the relationship between early stages of the speech perceptual process and later stages. We present our considerations involved in preparing the experimental set-up, producing the anechoic recordings, compiling the data, and exploring the materials in linguistic research. We report on a small pilot experiment demonstrating how PiTu and similar speech corpora can be used in studies of prosody as a function of semantic content. The experiment addresses the issue of whether the governing principles of Danish prosody assignment is mainly talker-specific or mainly content-typical (under the specific experimental conditions). The corpus is available at http://amtoolbox.sourceforge.net/pitu/. URI: http://hdl.handle.net/10398/8619 Files in this item: 1
Peter_Juel_Henrichsen_2012_3.pdf (105.4Kb) -
A Multi-lingual Speech Corpus for Cognitive ResearchJuel Henrichsen, Peter; Uneson, Marcus (Frederiksberg, 2012)[More information][Less information]
Abstract: We present the speech corpus SMALLWorlds (Spoken Multi-lingual Accounts of Logically Limited Worlds), newly established and still growing. SMALLWorlds contains monologic descriptions of scenes or worlds which are simple enough to be formally describable. The descriptions are instances of content-controlled monologue: semantically “pre-specified” but still bearing most hallmarks of spontaneous speech (hesitations and filled pauses, relaxed syntax, repetitions, self-corrections, incomplete constituents, irrelevant or redundant information, etc.) as well as idiosyncratic speaker traits. In the paper, we discuss the pros and cons of data so elicited. Following that, we present a typical SMALLWorlds task: the description of a simple drawing with differently coloured circles, squares, and triangles, with no hints given as to which description strategy or language style to use. We conclude with an example on how SMALLWorlds may be used: unsupervised lexical learning from phonetic transcription. At the time of writing, SMALLWorlds consists of more than 250 recordings in a wide range of typologically diverse languages from many parts of the world, some unwritten and endangered. URI: http://hdl.handle.net/10398/8618 Files in this item: 1
Peter_Juel_Henrichsen_2012_2.pdf (172.0Kb) -
Christiansen, Thomas Ulrich; Juel Henrichsen, Peter (Frederiksberg, 2012)[More information][Less information]
Abstract: Digital hearing aids use a variety of advanced digital signal processing methods in order to improve speech intelligibility. These methods are based on knowledge about the acoustics outside the ear as well as psychoacoustics. This paper investigates the recent observation that speech elements with a high degree of information can be robustly identified based on basic acoustic properties, i.e., function words have greater spectral tilt than content words for each of the 18 Danish talkers investigated. In this paper we examine these spectral tilt differences as a function of time based on a speech material six times the duration of previous investigations. Our results show that the correlation of spectral tilt with information content is relatively constant across time, even if averaged across talkers. This indicates that it is possible to devise a robust method for estimating information density in the speech signal based on computationally simple short-term band-level differences. The principle described here has the potential to improve speech transduction in hearing aids and cochlear implants. In addition, the concept of information-based speech transduction may also be applicable in automatic speech recognition systems. URI: http://hdl.handle.net/10398/8617 Files in this item: 1
Peter_Juel_Henrichsen_1.pdf (478.2Kb) -
Towards a Framework of Personalization TechniquesRazmerita, Liana; Nabeth, Thierry; Kirchner, Kathrin (Frederiksberg, 2012)[More information][Less information]
Abstract: This paper aims to elaborate on the role of user modelling for personalization and enhanced attention support. User modelling is an important element in the management of personal profiles and identity of users, but also a key element for providing adaptive features and personalized interaction. In this paper, we present personalization as the process consisting on the customization, and the adaptation of the interaction along the structure, the content, the modality, the presentation and the level of attention required. The paper surveys personalization techniques and provides concrete examples of personalized interaction. In particular, the paper focuses on the role of user modeling for enhanced, personalized user support within interactive applications. The key contribution of the paper is to propose a framework of personalization techniques and to identify new forms of personalization that aim at taking into account human cognitive capabilities and emotions. URI: http://hdl.handle.net/10398/8602 Files in this item: 1
Razmerita.pdf (148.2Kb)
Previous Page
Now showing items 8-11 of 11