Refine
Year of publication
Document Type
- Article (185)
- Part of Periodical (69)
- Preprint (62)
- Book (37)
- Part of a Book (29)
- Conference Proceeding (27)
- Working Paper (15)
- Report (8)
- Doctoral Thesis (4)
- Other (3)
Language
- English (443) (remove)
Has Fulltext
- yes (443)
Keywords
- Computerlinguistik (28)
- Deutsch (20)
- Syntax (16)
- Japanisch (15)
- new species (11)
- Grammatik (10)
- Multicomponent Tree Adjoining Grammar (9)
- Optimalitätstheorie (9)
- Maschinelle Übersetzung (8)
- Syntaktische Analyse (8)
Institute
- Extern (443) (remove)
The article consists in a comparative reading of three novels: Um rio chamado tempo by Mia Couto, Le pain des corbeaux by Lhoussain Azergui and Paw królowej by Dorota Masłowska. In spite of the difference of the historical circumstances of Mozambique, Morocco and Poland, these three books meet at an intersecting point: the emergence of an intelligentsia that uses literacy and writing as an instrument to deconstruct the post-colonial concept of nation and to operate a trans-colonial renegotiation of identity. By the notion of trans-colonial, I understand the opposition against new kinds of symbolic violence that emerged after the end of the colonial period; here this new form of oppression is related to the concept of national unity – an artificial construct that leaves no place for a dualism or pluralism of cultural reality (two shores of the Zambezi river, Arab and Berber dualism in Morocco, "small homelands" in Poland).
The young heroes of the novels grasp the pen in order to break through the falseness or the taboos created by the fathers, establishing, at the same time, the relation of solidarity with the world of the grandfathers. The act of writing becomes an actualization of the ancestral universe of magic. The settlement of accounts with the parental generation concerns the vision of nation built upon the resistance against the colonizer (it also refers to the Polish cultural formation, based on the tradition of uprisings and resistance against the Russians).
The impact of naval sonar on beaked whales is of increasing concern. In recent years the presence of gas and fat embolism consistent with decompression sickness (DCS) has been reported through postmortem analyses on beaked whales that stranded in connection with naval sonar exercises. In the present study, we use basic principles of diving physiology to model nitrogen tension and bubble growth in several tissue compartments during normal div ng behavior and for several hypothetical dive profiles to assess the risk of DCS. Assuming that normal diving does not cause nitrogen tensions in excess of those shown to be safe for odontocetes, the modeling indicates that repetitive shallow dives, perhaps as a consequence of an extended avoidance reaction to sonar sound, can indeed pose a risk for DCS and that this risk should increase with the duration of the response. If the model is correct, then limiting the duration of sonar exposure to minimize the duration of any avoidance reaction therefore has the potential to reduce the risk of DCS.
The taxonomic position of Onthophagus (Palaeonthophagus) lemuroides d’Orbigny, 1898 and Onthophagus
(Palaeonthophagus) fortigibber Reitter, 1909 is discussed (Coleoptera: Scarabaeidae: Scarabaeinae: Onthophagini).
A key to the species is given. Photos of type specimens of the two taxa and significant chromatic varieties, and
drawings of aedeagi are presented.
As editor of the next iteration of the Köchel Catalogue, I have to deal with the current (sixth) edition’s Appendix C, devoted to "Doubtful and Misattributed Works." My goal is to reduce the potentially vast dimensions of that appendix to only those works for which some connection to Mozart cannot be ruled out. In the decades since 1964, when the current edition of Köchel was published, many of the works listed in Appendix C have been convincingly attributed to other composers. Other works therein can confidently be dismissed as never having had any meaningful connection to Mozart. Yet even after removing the reattributed and trivially misattributed works from the appendix, we are left with a handful of works that may possibly have had something to do with Mozart, even if clear evidence one way or the other remains elusive. One must, of course, be cautious in removing questionable and doubtful works from the catalogue, as the present case-study will illustrate. The work under consideration, catalogued as K6 Anh. C 9.07, is an unaccompanied piece for three or four voices with the text "Venerabilis barba capucinorum." ...
Six new species ofTrichoptera are described and figured, belonging to the families Goeridae and Leptoceridae. The goerid species are Goera baishanzuensis new species and Goera recta new species. The leptocerid species are Setodes chlorinus new species, Ceraclea (Athripsodina) semicircularis new species, Ceraclea (Athripsodina) brachyclada new species, and Ceraclea (Athripsodina) vaciva new species (Leptoceridae).
Popular culture is always in process; its meanings can never be identified in a text, for texts are activated, or made meaningful, only in social relations and in intertextual relations. This activation of the meaning potential of a text can occur only in the social and cultural relationship into which it enters. (Fiske, 1991a: 3)
Evidence is presented that the subspecies Chrysobothris thoracica guadeloupensis Descarpentries, 1981
(Coleoptera: Buprestidae) should be recognized at the species level. Character evidence is provided to separate C.
guadeloupensis, new status, from C. thoracica Fabricius, 1798. Both species are illustrated with habitus photographs
and images of the male genitalia.
In linguistics and the philosophy of language, the mass/count distinction has traditionally been regarded as a bi-partition on the nominal domain, where typical instances are nouns like "beef" (mass) vs."cow" (count). In the present paper, we argue that this partition reveals a system that is based on both syntactic features and conceptual features, and present experimental evidence suggesting that the discrimination of the two kinds of features has a psychological reality.
Body image dissatisfaction is a serious, global problem that negatively affects life satisfaction. Several claims have been made about the possible psychological benefits of naturist activities, but very little empirical research has investigated these benefits or any plausible explanations for them. In three studies—one large-scale, cross-sectional study (n = 849), and 2 prospective studies (n = 24, n = 100) this research developed and applied knowledge about the possible benefits of naturist activities. It was found that more participation in naturist activities predicted greater life satisfaction—a relationship that was mediated by more positive body image, and higher self-esteem (Study 1). Applying these findings, it was found that participation in actual naturist activities led to an increase in life satisfaction, an effect that was also mediated by improvements in body image and self-esteem (Studies 2 and 3). The potential benefits of naturism are discussed, as well as possible future research, and implications for the use of naturist activities.
Acoma howdenorum, Acoma westcotti, Acoma quadrilaminata, and Acoma cimarron (Coleoptera: Scarabaeidae: Melolonthinae), all new species, are described from Yuma County, Arizona, USA, and Baja California Sur, Baja California (Norte), and Sonora, Mexico, respectively. Habitus of the four new species is illustrated, and an updated key to the described species in the genus is provided. Distribution and variation of Acoma glabrata Cazier are also discussed.
The Orizabus Fairmaire (Coleoptera: Scarabaeidae: Dynastinae: Pentodini) of the USA are reviewed. Orizabus pinalicus new species and O. mcclevei new species are described. Lectotypes are here designated for eight species names: Bothynus pyriformis LeConte, Pseudaphonus lucidus Casey, Orizabus snowii Horn, Orizabus cultripes Fairmaire, Orizabus isodonoides Fairmaire, Orizabus sallei Fairmaire, Orizabus fontinalis Casey, and Orizabus ponderosus Casey. Illustrations of diagnostic characters and a key to the five included species are presented. The Mexican species O. isodonoides and O. rubricollis Prell are also illustrated for comparison to the new species.
The female of Nothopleurus subsulcatus (Dalman, 1823) (Coleoptera: Cerambycidae: Prioninae: Macrotomini) is described for the first time, and the female of Strongylaspis bullata Bates, 1872 is redescribed. Color photographs of the habitus of both, and key characters for the former are included. New distributional records within Mexico for N. subsulcatus and Strongylaspis championi Bates, 1884 are given.
An additional 137 species and two tribes are added to the cerambycid fauna of Bolivia while 12 species are deleted. This brings the total number of species known from Bolivia to 1,561. Comments and statistics regarding the growth of knowledge on the Bolivian Cerambycid fauna and species endemicity are included.
Ein Vordenker, der in der internationalen Diskussion um « cultural translation » so gut wie nie diskutiert wird, ist Antonio Gramsci. Der Philosoph aus Sardinien, von Kindes Tagen an in Zweisprachigkeit (Sardisch-Italienisch) geübt, hat ein feines Sensorium für kulturelle Differenzen ausgebildet. In seinen Gefängnisjahren übersetzt er – als intellektuelles Training – aus dem Russischen und dem Deutschen ins Italienische, und in den Gefängnisheften setzt er sich wiederholt mit dem Begriff der traducibilità (Übersetzbarkeit) auseinander: Übersetzbarkeit von Sprachen, aber auch von Kulturen. Der Artikel geht den Linien nach, die von Gramscis Überlegungen zu der aktuellen Diskussion gezogen werden können, und diskutiert am Ende vergleichend die Positionen Homi K. Bhabhas und Gayatri Spivaks.
This paper describes the creation and preparation of TUSNELDA, a collection of corpus data built for linguistic research. This collection contains a number of linguistically annotated corpora which differ in various aspects such as language, text sorts / data types, encoded annotation levels, and linguistic theories underlying the annotation. The paper focuses on this variation on the one hand and the way how these heterogeneous data are integrated into one resource on the other hand.
The first step in methanol metabolism in methylotrophic yeasts, the oxidation of methanol and higher alcohols with molecular oxygen to formaldehyde and hydrogen peroxide, is catalysed by alcohol oxidase (AOX), a 600-kDa homo-octamer containing eight FAD cofactors. When these yeasts are grown with methanol as the carbon source, AOX forms large crystalline arrays in peroxisomes. We determined the structure of AOX by cryo-electron microscopy at a resolution of 3.4 Å. All residues of the 662-amino acid polypeptide as well as the FAD are well resolved. AOX shows high structural homology to other members of the GMC family of oxidoreductases, which share a conserved FAD binding domain, but have different substrate specificities. The preference of AOX for small alcohols is explained by the presence of conserved bulky aromatic residues near the active site. Compared to the other GMC enzymes, AOX contains a large number of amino acid inserts, the longest being 75 residues. These segments are found at the periphery of the monomer and make extensive inter-subunit contacts which are responsible for the very stable octamer. A short surface helix forms contacts between two octamers, explaining the tendency of AOX to form crystals in the peroxisomes.
This paper reports the results of a corpus investigation on case conflicts in German argument free relative constructions. We investigate how corpus frequencies reflect the relative markedness of free relative and correlative constructions, the relative markedness of different case conflict configurations, and the relative markedness of different conflict resolution strategies. Section 1 introduces the conception of markedness as used in Optimality Theory. Section 2 introduces the facts about German free relative clauses, and section 3 presents the results of the corpus study. By and large, markedness and frequency go hand in hand. However, configurations at the highest end of the markedness scale rarely show up in corpus data, and for the configuration at the lowest end we found an unexpected outcome: the more marked structure is preferred.
Weak function word shift
(2004)
The fact that object shift only affects weak pronouns in mainland Scandinavian is seen as an instance of a more general observation that can be made in all Germanic languages: weak function words tend to avoid the edges of larger prosodic domains. This generalisation has been formulated within Optimality Theory in terms of alignment constraints on prosodic structure by Selkirk (1996) in explaining thedistribution of prosodically strong and weak forms of English functionwords, especially modal verbs, prepositions and pronouns. But a purely phonological account fails to integrate the syntactic licensing conditions for object shift in an appropriate way. The standard semantico-syntactic accounts of object shift, onthe other hand, fail to explain why it is only weak pronouns that undergo object shift. This paper develops an Optimality theoretic model of the syntax-phonology interface which is based on the interaction of syntactic and prosodic factors. The account can successfully be applied to further related phenomena in English and German.
German dialects vary in which of the possible orders of the verbs in a 3-verb cluster they allow. In a still ongoing empirical investigation that I am undertaking together with Tanja Schmid, University of Stuttgart (Schmid and Vogel (2004)) we already found that each of the six logically possible permutations of the 3-verb cluster in (1) can be found in German dialects.
This paper argues for a particular architecture of OT syntax. This architecture hasthree core features: i) it is bidirectional, the usual production-oriented optimisation (called ‘first optimisation’ here) is accompanied by a second step that checks the recoverability of an underlying form; ii) this underlying form already contains a full-fledged syntactic specification; iii) especially the procedure checking for recoverability makes crucial use of semantic and pragmatic factors. The first section motivates the basic architecture. The second section shows with two examples, how contextual factors are integrated. The third section examines its implications for learning theory, and the fourth section concludes with a broader discussion of the advantages and disadvantages of the proposed model.
This paper is part of a research project on OT Syntax and the typology of the free relative (FR) construction. It concentrates on the details of an OT analysis and some of its consequences for OT syntax. I will not present a general discussion of the phenomenon and the many controversial issues it is famous for in generative syntax.
The aim of this paper is the exploration of an optimality theoretic architecture for syntax that is guided by the concept of "correspondence": syntax is understood as the mechanism of "translating" underlying representations into a surface form. In minimalism, this surface form is called "Phonological Form" (PF). Both semantic and abstract syntactic information are reflected by the surface form. The empirical domain where this architecture is tested are minimal link effects, especially in the case of "wh"-movement. The OT constraints require the surface form to reflect the underlying semantic and syntactic representations as maximally as possible. The means by which underlying relations and properties are encoded are precedence, adjacency, surface morphology and prosodic structure. Information that is not encoded in one of these ways remains unexpressed, and gets lost unless it is recoverable via the context. Different kinds of information are often expressed by the same means. The resulting conflicts are resolved by the relative ranking of the relevant correspondence constraints.
The argument that I tried to elaborate on in this paper is that the conceptual problem behind the traditional competence/performance distinction does not go away, even if we abandon its original Chomskyan formulation. It returns as the question about the relation between the model of the grammar and the results of empirical investigations – the question of empirical verification The theoretical concept of markedness is argued to be an ideal correlate of gradience. Optimality Theory, being based on markedness, is a promising framework for the task of bridging the gap between model and empirical world. However, this task not only requires a model of grammar, but also a theory of the methods that are chosen in empirical investigations and how their results are interpreted, and a theory of how to derive predictions for these particular empirical investigations from the model. Stochastic Optimality Theory is one possible formulation of a proposal that derives empirical predictions from an OT model. However, I hope to have shown that it is not enough to take frequency distributions and relative acceptabilities at face value, and simply construe some Stochastic OT model that fits the facts. These facts first of all need to be interpreted, and those factors that the grammar has to account for must be sorted out from those about which grammar should have nothing to say. This task, to my mind, is more complicated than the picture that a simplistic application of (not only) Stochastic OT might draw.
In the past, a divide could be seen between ’deep’ parsers on the one hand, which construct a semantic representation out of their input, but usually have significant coverage problems, and more robust parsers on the other hand, which are usually based on a (statistical) model derived from a treebank and have larger coverage, but leave the problem of semantic interpretation to the user. More recently, approaches have emerged that combine the robustness of datadriven (statistical) models with more detailed linguistic interpretation such that the output could be used for deeper semantic analysis. Cahill et al. (2002) use a PCFG-based parsing model in combination with a set of principles and heuristics to derive functional (f-)structures of Lexical-Functional Grammar (LFG). They show that the derived functional structures have a better quality than those generated by a parser based on a state-of-the-art hand-crafted LFG grammar. Advocates of Dependency Grammar usually point out that dependencies already are a semantically meaningful representation (cf. Menzel, 2003). However, parsers based on dependency grammar normally create underspecified representations with respect to certain phenomena such as coordination, apposition and control structures. In these areas they are too "shallow" to be directly used for semantic interpretation. In this paper, we adopt a similar approach to Cahill et al. (2002) using a dependency-based analysis to derive functional structure, and demonstrate the feasibility of this approach using German data. A major focus of our discussion is on the treatment of coordination and other potentially underspecified structures of the dependency data input. F-structure is one of the two core levels of syntactic representation in LFG (Bresnan, 2001). Independently of surface order, it encodes abstract syntactic functions that constitute predicate argument structure and other dependency relations such as subject, predicate, adjunct, but also further semantic information such as the semantic type of an adjunct (e.g. directional). Normally f-structure is captured as a recursive attribute value matrix, which is isomorphic to a directed graph representation. Figure 5 depicts an example target f-structure. As mentioned earlier, these deeper-level dependency relations can be used to construct logical forms as in the approaches of van Genabith and Crouch (1996), who construct underspecified discourse representations (UDRSs), and Spreyer and Frank (2005), who have robust minimal recursion semantics (RMRS) as their target representation. We therefore think that f-structures are a suitable target representation for automatic syntactic analysis in a larger pipeline of mapping text to interpretation. In this paper, we report on the conversion from dependency structures to fstructure. Firstly, we evaluate the f-structure conversion in isolation, starting from hand-corrected dependencies based on the TüBa-D/Z treebank and Versley (2005)´s conversion. Secondly, we start from tokenized text to evaluate the combined process of automatic parsing (using Foth and Menzel (2006)´s parser) and f-structure conversion. As a test set, we randomly selected 100 sentences from TüBa-D/Z which we annotated using a scheme very close to that of the TiGer Dependency Bank (Forst et al., 2004). In the next section, we sketch dependency analysis, the underlying theory of our input representations, and introduce four different representations of coordination. We also describe Weighted Constraint Dependency Grammar (WCDG), the dependency parsing formalism that we use in our experiments. Section 3 characterises the conversion of dependencies to f-structures. Our evaluation is presented in section 4, and finally, section 5 summarises our results and gives an overview of problems remaining to be solved.
In this paper, we investigate the usefulness of a wide range of features for their usefulness in the resolution of nominal coreference, both as hard constraints (i.e. completely removing elements from the list of possible candidates) as well as soft constraints (where a cumulation of violations of soft constraints will make it less likely that a candidate is chosen as the antecedent). We present a state of the art system based on such constraints and weights estimated with a maximum entropy model, using lexical information to resolve cases of coreferent bridging.
We adopt Markert and Nissim (2005)’s approach of using the World Wide Web to resolve cases of coreferent bridging for German and discuss the strength and weaknesses of this approach. As the general approach of using surface patterns to get information on ontological relations between lexical items has only been tried on English, it is also interesting to see whether the approach works for German as well as it does for English and what differences between these languages need to be accounted for. We also present a novel approach for combining several patterns that yields an ensemble that outperforms the best-performing single patterns in terms of both precision and recall.
When a statistical parser is trained on one treebank, one usually tests it on another portion of the same treebank, partly due to the fact that a comparable annotation format is needed for testing. But the user of a parser may not be interested in parsing sentences from the same newspaper all over, or even wants syntactic annotations for a slightly different text type. Gildea (2001) for instance found that a parser trained on the WSJ portion of the Penn Treebank performs less well on the Brown corpus (the subset that is available in the PTB bracketing format) than a parser that has been trained only on the Brown corpus, although the latter one has only half as many sentences as the former. Additionally, a parser trained on both the WSJ and Brown corpora performs less well on the Brown corpus than on the WSJ one. This leads us to the following questions that we would like to address in this paper: - Is there a difference in usefulness of techniques that are used to improve parser performance between the same-corpus and the different-corpus case? - Are different types of parsers (rule-based and statistical) equally sensitive to corpus variation? To achieve this, we compared the quality of the parses of a hand-crafted constraint-based parser and a statistical PCFG-based parser that was trained on a treebank of German newspaper text.
We investigate methods to improve the recall in coreference resolution by also trying to resolve those definite descriptions where no earlier mention of the referent shares the same lexical head (coreferent bridging). The problem, which is notably harder than identifying coreference relations among mentions which have the same lexical head, has been tackled with several rather different approaches, and we attempt to provide a meaningful classification along with a quantitative comparison. Based on the different merits of the methods, we discuss possibilities to improve them and show how they can be effectively combined.
Using a qualitative analysis of disagreements from a referentially annotated newspaper corpus, we show that, in coreference annotation, vague referents are prone to greater disagreement. We show how potentially problematic cases can be dealt with in a way that is practical even for larger-scale annotation, considering a real-world example from newspaper text.
In this paper, we argue that difficulties in the definition of coreference itself contribute to lower inter-annotator agreement in certain cases. Data from a large referentially annotated corpus serves to corroborate this point, using a quantitative investigation to assess which effects or problems are likely to be the most prominent. Several examples where such problems occur are discussed in more detail, and we then propose a generalisation of Poesio, Reyle and Stevenson’s Justified Sloppiness Hypothesis to provide a unified model for these cases of disagreement and argue that a deeper understanding of the phenomena involved allows to tackle problematic cases in a more principled fashion than would be possible using only pre-theoretic intuitions.
Distributional approximations to lexical semantics are very useful not only in helping the creation of lexical semantic resources (Kilgariff et al., 2004; Snow et al., 2006), but also when directly applied in tasks that can benefit from large-coverage semantic knowledge such as coreference resolution (Poesio et al., 1998; Gasperin and Vieira, 2004; Versley, 2007), word sense disambiguation (Mc- Carthy et al., 2004) or semantical role labeling (Gordon and Swanson, 2007). We present a model that is built from Webbased corpora using both shallow patterns for grammatical and semantic relations and a window-based approach, using singular value decomposition to decorrelate the feature space which is otherwise too heavily influenced by the skewed topic distribution of Web corpora.
The avifauna of the island of Flores and its satellite islands from Komodo to Alor is reviewed, combining historical data with recent observations. Recent surveys have added substantially to the data base, especially of the resident forest species, and endangered and endemic taxa, as well as adding a number of migrant and maritime species to the island list. Of particular interest are the rare forest endemics Wallace's Hanging-parrot Loriculus flosculus, the almost unknown Flores Scopsowl Otus alfredi, Flores Monarch Monarcha sacerdotum and Flores Crow Corvus florensis. An appeal is made for further surveys over the eastern part of the island and the eastern island chain.
Hybrid robust deep and shallow semantic processing for creativity support in document production
(2004)
The research performed in the DeepThought project (http://www.project-deepthought.net) aims at demonstrating the potential of deep linguistic processing if added to existing shallow methods that ensure robustness. Classical information retrieval is extended by high precision concept indexing and relation detection. We use this approach to demonstrate the feasibility of three ambitious applications, one of which is a tool for creativity support in document production and collective brainstorming. This application is described in detail in this paper. Common to all three applications, and the basis for their development is a platform for integrated linguistic processing. This platform is based on a generic software architecture that combines multiple NLP components and on robust minimal recursive semantics (RMRS) as a uniform representation language.
Transforming constituent-based annotation into dependency-based annotation has been shown to work for different treebanks and annotation schemes (e.g. Lin (1995) has transformed the Penn treebank, and Kübler and Telljohann (2002) the Tübinger Baumbank des Deutschen (TüBa-D/Z)). These ventures are usually triggered by the conflict between theory-neutral annotation, that targets most needs of a wider audience, and theory-specific annotation, that provides more fine-grained information for a smaller audience. As a compromise, it has been pointed out that treebanks can be designed to support more than one theory from the start (Nivre, 2003). We argue that information can also be added to an existing annotation scheme so that it supports additional theory-specific annotations. We also argue that such a transformation is useful for improving and extending the original annotation scheme with respect to both ambiguous annotation and annotation errors. We show this by analysing problems that arise when generating dependency information from the constituent-based TüBa-D/Z.
Following on the ADEA/APNET study on inter-African Book trade that was commissioned in 1999, ADEA tasked APNET to facilitate the production of national book industry updates in each country. The updates are aimed at encouraging commercial development of inter-African book trade and to make available to the public, total systematic and current situations on the book trade in each country.