Ph.D. theses (ISV)
Browse all publications by
Recent submissions:
-
Med udgangspunkt i støtteverbers leksikaliseringsmønstre i dansk og franskHein, Birgitte (Frederiksberg, 2003)[More information][Less information]
Abstract: Enhver oversætter mellem et germansk sprog som dansk og et romansk sprog som fransk ved, at det ofte er bestemte sproglige konstruktioner, der volder problemer. En af disse konstruktioner består af et støtteverbum og et objekt, der tilsammen danner en semantisk enhed. Da denne konstruktion er hyppigt forekommende, specielt i juridiske og administrative tekster, kan det være af både praktisk og teoretisk værdi at skaffe et klarere billede af, hvordan konstruktionerne idiomatisk opbygges og bruges på de to sprog. Undersøgelsen søger at indskrive sig i en sammenhæng, der vedrører både oversættelse og lingvistisk beskrivelse, ud fra et ønske om at en komparativ beskrivelse skal kunne give en oversætter viden, som han kan bruge i sit praktiske arbejde. De fleste, som har benyttet computer-støttede oversættelser, må være enige i, at det stadig er nødvendigt med kvalificeret menneskelig oversættelse, hvis man skal have en idiomatisk korrekt og brugbart resultat. Der er ganske vist i dag mulighed for computer-støttede ”rå-oversættelser”. Somme tider kan disse oversættelser tjene til for eksempel at give en internetbruger et hurtigt indtryk af indholdet af en web-side på et sprog, som han ikke behersker.... URI: http://hdl.handle.net/10398/8623 Files in this item: 1
Birgitte_Hein.pdf (776.8Kb) -
Haulrich, Martin (Frederiksberg, 2012)[More information][Less information]
Abstract: Parallel treebanks have received increasing attention in the past few years, primarily due to their potential use in statistical machine translation. Creating parallel treebanks manually is a time-consuming and expensive task and for this reason there is considerable interest in creating treebanks automatically. This task can be solved using standard tools such as parsers and aligners. However, because parallel treebanks are based on parallel corpora, we are in a special situation where the same meaning is represented in two different ways. This thesis is about how we can exploit this information to create better parallel treebanks than we can by using standard tools.... URI: http://hdl.handle.net/10398/8385 Files in this item: 1
Martin_Haulrich.pdf (1.932Mb) -
An Eye-tracking and Key-logging StudyTangsgaard Hvelplund, Kristian (Frederiksberg, 2011)[More information][Less information]
Abstract: This study is an empirical investigation of translators’ allocation of cognitive resources, and its specific aim is to identify predictable behaviours and patterns of uniformity in translators’ allocation of cognitive resources in translation. The study falls within the process-oriented translation paradigm and within the more general field of cognitive psychology. Based on models of working memory, attentional control, language comprehension and language production, a theoretical framework was developed on which hypotheses were formulated and evaluated. The study’s empirical investigation fell into three major analyses, which each dealt with one aspect of translators’ allocation of cognitive resources: distribution of cognitive resources, management of cognitive resources and cognitive load. Three indicators were identified: total attention duration (TA duration measured in seconds) indicates the distribution of cognitive resources; attention unit duration (AU duration measured in milliseconds) indicates the amount of time allocated between two attention shifts; and pupil size (measured in millimetres) indicates cognitive load, i.e. workload on working memory.... URI: http://hdl.handle.net/10398/8314 Files in this item: 1
Kristian_T_Hvelplund_SL.pdf (4.820Mb) -
En valensgrammatisk undersøgelseSkot-Hansen, Annemette (Frederiksberg, 2009)[More information][Less information]
Abstract: The purpose of this dissertation is to map the valency structure of a subset of the French adverbs. More specifically, the dissertation seeks to answer the following questions: What valency structure follows from the lexical content of the adverbs investigated? What is the nature of the semantic relation established? What is the status of the valents relative to the adverb and relative to other valents? The empirical object of investigation is focused on adverbs derived from adjectives which take prepositional phrases headed by the preposition à as their complement. In addition, the delimitation chosen for this dissertation is a class of adverbs which share the feature that they carry the suffix -ment, which developed from the Latin noun mens, meaning “spirit/thought/mood/tenor”. It is argued that the fusion of an adverb and mens establishes the general meaning [in an adjective spirit/thought/mood/tenor], i.e. the adverb retains the general quality denoted by the adjective, but the meaning targets the verb situation (at clause level) or the quality (at phrasal level) which saturates the argument of the adverb. Following tradition, the analysis adopted here, takes the verb situation to be realised by the predicate, and the quality to be realised by an adjective phrase, which may be realised by a past participle or, in rare cases, by another adverb. Since the valent is required by the lexical content of the adverb, it is assumed, following Herslund and Sørensen, that the valent is a fundamental valent. Another important feature of the adverbs which are analysed in this dissertation is that they establish a relation between two entities. This means that in addition to its fundamental valent, the adverb takes a further valent which it links with the fundamental valent. This second valent is referred to as the second valent of the adverb. The two valents are analysed as two relata in a relation. Unlike the fundamental valent, the second valent is always at phrasal level. When the adverb functions at clausal level, the second valent is realised as the prepositional object of the preposition phrase headed by à. This realisation is, however, not possible when the adverb functions at phrasal level. It is argued that this is a consequence of the fact that it is impossible to insert other constituents between the adverb and the adjective, adverb or participle which is modified by the adverb. The result is that where the second valent is realised, the adverb moves from preposition to postposition relative to its fundamental valent. In the data investigated the second valent denotes very different entities such as situations denoted by verbs and qualities, but also objects and abstract entities. The individual adverbs which are investigated here each determine their valency. In general there are different sources that allow us to uncover the core meaning of a word. The sources chosen in this dissertation are: the semantic roles assigned by the adverbs, their symmetry, elements of shared semantics or partial synonymy, their morphology and etymological roots. In order to bring together these different sources, the dissertation postulates a denotation design for each adverb. The etymology of the adverbs has been a particularly helpful in determining the relation and valency they establish. In addition to adverb and adjective suffixes, the majority of the adverbs investigated have a preposition in their synchronic morphological make-up which denotes a relation between two entities: some adverbs contain both a preposition and a morpheme from another word class, e.g. comparativement and subséquemment, while others contain only a preposition, e.g. antérieurement and postérieurement. A very small subset does not contain a preposition, but only a single adverb morpheme which denotes the relation in question, so, for instance, the adjectives par and similis, which have formed pareillement and semblablement, denote a relation between two relata. From an etymological perspective, a few adverbs, such as latéralement, do not denote a relation – so it is only through the formal realisation of the preposition phrase that the relation is established. The dissertation maps the etymological and morphological structure of the adverb and the range of functions that the adverb and its valents can have at clausal and phrasal level. The function of the adverb is relevant to the extent that the function affects its semantics and its valency structure. The effect of function is seen in some adverbs when they operate on clausal or on phrasal level and in other adverbs when they modify entire clauses or just the verb. URI: http://hdl.handle.net/10398/7944 Files in this item: 1
Annemette_Skot-Hansen.pdf (2.974Mb) -
Elming, Jakob (Frederiksberg, 2008)[More information][Less information]
Abstract: Reordering has been an important topic in statistical machine translation (SMT) as long as SMT has been around. State-of-the-art SMT systems such as Pharaoh (Koehn, 2004a) still employ a simplistic model of the reordering process to do non-local reordering. This model penalizes any reordering no matter the words. The reordering is only selected if it leads to a translation that looks like a much better sentence than the alternative. Recent developments have, however, seen improvements in translation quality following from syntax-based reordering. One such development is the pre-translation approach that adjusts the source sentence to resemble target language word order prior to translation. This is done based on rules that are either manually created or automatically learned from word aligned parallel corpora. We introduce a novel approach to syntactic reordering. This approach provides better exploitation of the information in the reordering rules and eliminates problematic biases of previous approaches. Although the approach is examined within a pre-translation reordering framework, it easily extends to other frameworks. Our approach significantly outperforms a state-of-the-art phrase-based SMT system and previous approaches to pretranslation reordering, including (Li et al., 2007; Zhang et al., 2007b; Crego & Mari˜ no, 2007). This is consistent both for a very close language pair, English-Danish, and a very distant language pair, English-Arabic. We also propose automatic reordering rule learning based on a rich set of linguistic information. As opposed to most previous approaches that extract a large set of rules, our approach produces a small set of predominantly general rules. These provide a good reflection of the main reordering issues of a given language pair. We examine the influence of several parameters that may have influence on the quality of the rules learned. Finally, we provide a new approach for improving automatic word alignment. This word alignment is used in the above task of automatically learning reordering rules. Our approach learns from hand aligned data how to combine several automatic word alignments to one superior word alignment. The automatic word alignments are created from the same data that has been preprocessed with different tokenization schemes. Thus utilizing the different strengths that different tokenization schemes exhibit in word alignment. We achieve a 38% error reduction for the automatic word alignment URI: http://hdl.handle.net/10398/7922 Files in this item: 1
jakob_elming.pdf (1.033Mb)