OPUS 4 | Search

Computer-assisted transcription and analysis of speech (2001)

Stephany, Ursula ; Bast, Conny ; Lehmann, Katrin

The two papers included in this volume have developed from work with the CHILDES tools and the Media Editor in the two research projects, "Second language acquisition of German by Russian learners", sponsored by the Max Planck Institute for Psycholinguistics, Nijmegen, from 1998 to 1999 (directed by Ursula Stephany, University of Cologne, and Wolfgang Klein, Max Planck Institute for Psycholinguistics, Nijmegen) and "The age factor in the acquisition of German as a second language", sponsored by the German Science Foundation (DFG), Bonn, since 2000 (directed by Ursula Stephany, University of Cologne, and Christine Dimroth, Max Planck Institute for Psycholinguistics, Nijmegen). The CHILDES Project has been developed and is being continuously improved at Carnegie Mellon University, Pittsburgh, under the supervision of Brian MacWhinney. Having used the CHILDES tools for more than ten years for transcribing and analyzing Greek child data there it was no question that I would also use them for research into the acquisition of German as a second language and analyze the big amount of spontaneous speech gathered from two Russian girls with the help of the CLAN programs. When in the spring of 1997, Steven Gillis from the University of Antwerp (in collaboration with Gert Durieux) developed a lexicon-based automatic coding system based on the CLAN program MOR and suitable for coding languages with richer morphologies than English, such as Modern Greek. Coding huge amounts of data then became much quicker and more comfortable so that I decided to adopt this system for German as well. The paper "Working with the CHILDES Tools" is based on two earlier manuscripts which have grown out of my research on Greek child language and the many CHILDES workshops taught in Germany, Greece, Portugal, and Brazil over the years. Its contents have now been adapted to the requirements of research into the acquisition of German as a second language and for use on Windows.

Customizing GermaNet for the use in deep linguistic processing (2001)

Siegel, Melanie ; Xu, Feiyu ; Neumann, Günter

In this paper we show an approach to the customization of GermaNet to the German HPSG grammar lexicon developed in the Verbmobil project. GermaNet has a broad coverage of the German base vocabulary and fine-grained semantic classification; while the HPSG grammar lexicon is comparatively small und has a coarse-grained semantic classification. In our approach, we have developed a mapping algorithm to relate the synsets in GermaNet with the semantic sorts in HPSG. The evaluation result shows that this approach is useful for the lexical extension of our deep grammar development to cope with real-world text understanding.

Speech transcription using MED (2001)

Lehmann, Katrin

MED (Media EDitor) is a program designed to facilitate the transcription of digitized soundfiles into textfiles. It was written by Hans Drexler and Daan Broeder, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands. [...] The aim of MED is to facilitate the transcription of sound into text using a single program. It works on the principle of the coexistence and interaction of two basic elements, the waveform display window and the text window. [...] This means that you no longer need to use both a sound editor and a word processor at the same time in order to transcribe digitized speech files. Instead, you can directly type the sound you hear (and see) via MED into the text window. Furthermore, you can directly link sound portions of the waveform display window to text portions of the text window, so that you can easily locate and listen to the original source of your transcription once the links have been set. In this function the waveform display window and the text window virtually interact with each other.

A tale of two helmets : the Negau A and B inscriptions (2001)

Markey, Tom

The goals of this exercise are essentially threefold: (1) to rescrutinize, archaeologically, epigraphically and linguistically, the pre-Roman inscriptions of the justly famous Negau A and B helmets, (2) to identify "eastward graphemic drift" in preRoman northern Italy and (3) to reconsider and perhaps identify the origin of the Germanic runes in light of (1) and (2). While moving toward these goals, we cite but a sampling of the burgeoning literature, some of which may not be generally known or easily accessible, in these rapidly expanding venues; see Ellis (1998) for a recent overview in English.

Free relative constructions in OT syntax (2001)

Vogel, Ralf

This paper is part of a research project on OT Syntax and the typology of the free relative (FR) construction. It concentrates on the details of an OT analysis and some of its consequences for OT syntax. I will not present a general discussion of the phenomenon and the many controversial issues it is famous for in generative syntax.

Mass and count in language and cognition : some evidence from language comprehension (2001)

Wiese, Heike ; Piñango, Maria Mercedes

In linguistics and the philosophy of language, the mass/count distinction has traditionally been regarded as a bi-partition on the nominal domain, where typical instances are nouns like "beef" (mass) vs."cow" (count). In the present paper, we argue that this partition reveals a system that is based on both syntactic features and conceptual features, and present experimental evidence suggesting that the discrimination of the two kinds of features has a psychological reality.

The renaissance of the theatre of memory (2001)

Matussek, Peter

Giulio Camillo (1480 - 1544) was as well-known in his era as Bill Gates is now. Just like Gates he cherished a vision of a universal Storage and Retrieval System, and just like Microsoft Windows, his ‘Theatre of the Memory’ was, despite constant revision, never completed. Camillo’s legendary Theatre of Memory remained only a fragment, its benefits only an option for the future. When it was finished, the user - so he predicted - would have access to the knowledge of the whole universe. On account of his promising invention, Camillo’s contemporaries called him ‘the divine’. For others, like Erasmus or the Parisian scholars, he was just a ‘quack’, but also this only shows that his reception was as strong as is the case with the computer gurus of our days. Still, Camillo was forgotten immediately after his death. No trace is left of his spectacular databank - except a short treatise which he dictated on his deathbed and which was formulated in the future tense: ‘L’Idea del Theatro’ (1550). ...

From chunks to function-argument structure : a similarity-based approach (2001)

Kübler, Sandra ; Hinrichs, Erhard

Chunk parsing has focused on the recognition of partial constituent structures at the level of individual chunks. Little attention has been paid to the question of how such partial analyses can be combined into larger structures for complete utterances. Such larger structures are not only desirable for a deeper syntactic analysis. They also constitute a necessary prerequisite for assigning function-argument structure. The present paper offers a similaritybased algorithm for assigning functional labels such as subject, object, head, complement, etc. to complete syntactic structures on the basis of prechunked input. The evaluation of the algorithm has concentrated on measuring the quality of functional labels. It was performed on a German and an English treebank using two different annotation schemes at the level of function argument structure. The results of 89.73% correct functional labels for German and 90.40%for English validate the general approach.

TüSBL : a similarity-based chunk parser for robust syntactic processing (2001)

Kübler, Sandra ; Hinrichs, Erhard

Chunk parsing has focused on the recognition of partial constituent structures at the level of individual chunks. Little attention has been paid to the question of how such partial analyses can be combined into larger structures for complete utterances. The TüSBL parser extends current chunk parsing techniques by a tree-construction component that extends partial chunk parses to complete tree structures including recursive phrase structure as well as function-argument structure. TüSBLs tree construction algorithm relies on techniques from memory-based learning that allow similarity-based classification of a given input structure relative to a pre-stored set of tree instances from a fully annotated treebank. A quantitative evaluation of TüSBL has been conducted using a semi-automatically constructed treebank of German that consists of appr. 67,000 fully annotated sentences. The basic PARSEVAL measures were used although they were developed for parsers that have as their main goal a complete analysis that spans the entire input.This runs counter to the basic philosophy underlying TüSBL, which has as its main goal robustness of partially analyzed structures.

Harman Dahl's legacy (2001)

Balkee, Raj

It was midnight on Friday 31, December 1999. Harman Dahl fell off his seat at the sound of all hell letting loose around him. He held on to the bench on which he had dozed off and wobbled onto his feet. His senses returned, even though he was still tipsy, under the influence of alcohol. He had been drinking with colleagues for most of the day. ...

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Institute

10 search hits