Linguistik
Refine
Year of publication
Document Type
- Article (186)
- Preprint (69)
- Part of a Book (65)
- Working Paper (40)
- Conference Proceeding (33)
- Book (24)
- Review (12)
- Part of Periodical (7)
- Course Material (1)
- Report (1)
Language
- Croatian (150)
- English (141)
- German (120)
- Portuguese (9)
- Turkish (7)
- mis (4)
- French (3)
- Italian (2)
- Multiple languages (1)
- Spanish (1)
Has Fulltext
- yes (438)
Is part of the Bibliography
- no (438)
Keywords
- Kroatisch (50)
- Linguistik (50)
- Rezension (48)
- Deutsch (35)
- Computerlinguistik (32)
- Syntax (19)
- Japanisch (18)
- Grammatik (17)
- Namenkunde (17)
- Rezensionen (17)
Institute
- Extern (438) (remove)
Im weiteren Teil dieses Einleitungsartikels werde ich […] auf einige offene Fragen in der Argumentationstheorie generell eingehen und dann auf solche, die speziell durch die beiden Arbeiten in diesem Arbeitspapier aufgeworfen wurden. Danach werde ich auf die Wahl des Datenmaterials eingehen und auf die speziellen Probleme, die das gewählte Medium (Internet-Forum) mit sich bringt. Anschließend werden sowohl konvergente als auch divergente Ergebnisse der beiden Arbeiten diskutiert, letztere insbesondere in Hinblick auf die Frage, ob sie durch den unterschiedlichen Diskussionsgegenstand bedingt sind. Zum Schluss werden dann noch einige terminologische Details angesprochen.
Elision of /h, ?/ in the Shirazi Dialect of Persian (SHDP) : an optimality theory based analysis
(2010)
Until recently, many researchers have shown interest in studying lenitions, which are examples of the most common universal types of phonological processes. Elision of laryngeals (glottal fricative /h/ and glottal stop /?/) is one of the most common phonological alternations exhibited in the Shirazi dialect of Persian (SHDP) which to the knowledge of the researchers, has not been studied to date. This paper seeks to provide a description of the facts about this common phonological alternation in the addressed regional dialect of Persian and points out some main differences between the behavior of these processes in SHDP and Standard Persian (SP). The analysis is cast in an Optimal Theoretic (OT) framework (McCarthy and Prince 1995, 2001), which holds that linguistic forms are the outcome of interaction among violable universal constraints. The present study shows that the addressed processes of consonant deletion in SHDP are restricted by syllabic position and are conditioned by coda position, intervocalic position or consonant clusters. They are usually blocked in the onset, but there are cases where reduction is allowed in the onset of the stressed syllable. Thus, the study adds SHDP to the list of languages which permit lenition in the onset of the stressed syllable. The addressed processes of elision are always blocked in word-initial position and laryngeal elision is always followed by Compensatory lengthening (CL), even after deletion from the onset of the stressed syllable.
Key words: lenition or weakening, laryngeal elision, phonological processes, Optimality Theory
Der Präteritumschwund dürfte eine der markantesten morphologischen Entwicklungen des Alemannischen (bzw. Oberdeutschen) bilden. Sein Verlauf in schweizerdeutschen Dialekten ist mit der Arbeit von JÖRG (1976) dokumentiert und ungefiibr ins 16. Jahrhundert zu datieren. Konsequenz der Aufgabe dieses synthetischen Verfahrens war die Verlegung der Vergangenheitskategorie in die Syntax. Dies hat zu einer starken typologischen Drift des Alemannischen in Richtung eines analytischen und zusätzlich klammernden Sprachtyps geführt: Das Perfekt ist zweigliedrig (finites Auxiliar + infinites Vollverb), das Plusquamperfekt sogar dreigliedrig (sogenanntes doppeltes Perfekt). Finites und infinites Verb können durch ganze Satzglieder, Adverbien etc. voneinander getrennt sein, sind also unter Umständen weit voneinander entfernt, was das Ausdrucksverfahren nicht gerade vereinfacht. Der Präteritumschwuud kontrastiert in eigentümlicher Weise mit dem Erhalt, ja sogar dem sekundären Ausbau synthetischer Konjunktivformen (sowohl Konjunktiv I als auch II), die weiteres morphologisches Charakteristikum des Alemannischen sind, doch nicht Thema dieses Beitrags (hierzu s. NÜBLING 1997).
U radu se raspravlja o etimologiji hrvatske riječi jaram i srodnih riječi u ostalim slavenskim jezicima. Prikazuje se da je ta riječ u baltoslavenskome bila množinski oblik imenice koja je dala hrvatsko rame. Raspravljaju se formalne teškoće te etimologije i analiziraju se brojne usporedne izvedenice u slavenskome.
U članku se raspravlja o istrorumunjskim nazivima koji su u bilo kakvoj svezi sa stablom (općenito). Obrađeni su oblici dio opsežnijega korpusa od preko 8000 oblika koji smo sami prikupili (istraživanja su obavljana u više navrata od 1985. godine – posljednje provjere i dopune korpusa obavljene su tekuće, 2010., godine i to u svim mjestima gdje se i danas govori istrorumunjski: Žejane, Šušnjevica, Nova Vas, Jesenovik, Letaj, Brdo, Škabići, Trkovci, Zankovci, Miheli, Kostrčan). Uz svaku obrađenu riječ navode se odgovarajuće iz svih dostupnih nam istrorumunjskih repertoara. Za svaku riječ daje se etimologijsko tumačenje do kojeg se dolazi usporedbom pojedinog termina s odgovarajućim čakavskim i istromletačkim okolnim govorima, a ako je riječ domaća, daju se i paralele iz ostala tri rumunjska dijalekta. U obrađenoj građi prevladavaju posuđenice iz čakavskih govora. Domaćih je riječi 14, no za neke se to ne može s potpunom sigurnošću utvrditi jer se u potpunosti poklapaju s čakavskim ekvivalentima. Izravnih posuđenica iz (istro)mletačkoga nema.
U ovome se članku obrađuju posuđenice mletačkoga podrijetla u sjevernočakavskom govoru Boljuna u sjeveroistočnoj Istri. Cilj rada bio je etimološki obraditi pridjeve i imenice iz semantičke domene karakternih osobina koji nisu bili uvršteni u Skokov Etimologijski rječnik ni u Vinjine Jadranske etimologije. Polazišna građa ekscerpirana je iz rukopisnoga Rječnika boljunskih govora Ivana Francetića, provjerena je na terenu te je etimološkom i leksičkom analizom dovedena u vezu s istromletačkim, venecijanskim, tršćanskim i talijanskim (etymologia proxima) te s latinskim ili drugim etimonom (etymologia remota), a na sinkronijskoj i dijatopijskoj razini s rječničkim potvrdama u ostalim čakavskim govorima Istre, Kvarnera i Dalmacije.
Opisuje se i analizira tvorba etnika i ktetika u kajkavskom narječju. Raščlamba se temelji na podatcima prikupljenima terenskim istraživanjima posljednjih gotovo pedeset godina u Upitnicima za Hrvatski jezični atlas (HJA), koji se izrađuje u Institutu za hrvatski jezik i jezikoslovlje, te na podatcima iz dijalektnih rječnika.
In this paper, we investigate the role of sub-optimality in training data for part-of-speech tagging. In particular, we examine to what extent the size of the training corpus and certain types of errors in it affect the performance of the tagger. We distinguish four types of errors: If a word is assigned a wrong tag, this tag can belong to the ambiguity class of the word (i.e. to the set of possible tags for that word) or not; furthermore, the major syntactic category (e.g. "N" or "V") can be correctly assigned (e.g. if a finite verb is classified as an infinitive) or not (e.g. if a verb is classified as a noun). We empirically explore the decrease of performance that each of these error types causes for different sizes of the training set. Our results show that those types of errors that are easier to eliminate have a particularly negative effect on the performance. Thus, it is worthwhile concentrating on the elimination of these types of errors, especially if the training corpus is large.
This paper is concerned with the tagging of spatial expressions in German newspaper articles, assigning a meaning to the expression and classifying the usages of the spatial expression and linking the derived referent to an event description. In our system, we implemented the activation of concepts in a very simple fashion, a concept is activated once (with a cost depending on the item that activated it) and is left activated thereafter. As an example, a city also activates the nodes for the region and the country it is part of, so that cities from one country are chosen over cities from different countries. A test corpus of 12 German newspaper articles was tested regarding several disambiguation strategies. Disambiguation was carried out via a beam search to find an approximately cost-optimal solution for the conflict set of potential grounding candidates for the tagged spatial expression. Test showed that the disambiguation strategies improved accuracy significantly.
This paper proposes a compositional semantics for lexicalized tree adjoining grammars (LTAG). Tree-local multicompnent derivations allow seperation of semantiv contribution of a lexical item into one component contributing to the predicate argument structure and second a component contributing to scope semantics. Based on this idea a syntx-semantics interface is presented where the compositional semantics depends only on the derivation structure. It is shown that the derivation structure allows an appropriate amount of underspecification. This is illustrated by investigating underspecified representations for quantifier scpoe ambiguities and related phenomena such as adjunct scope and island constraints.
In this paper we propose a compositional semantics for lexicalized tree-adjoining grammar (LTAG). Tree-local multicomponent derivations allow separation of the semantic contribution of a lexical item into one component contributing to the predicate argument structure and a second component contributing to scope semantics. Based on this idea a syntax-semantics interface is presented where the compositional semantics depends only on the derivation structure. It is shown that the derivation structure (and indirectly the locality of derivations) allows an appropriate amount of underspecification. This is illustrated by investigating underspecified representations for quantifier scope ambiguities and related phenomena such as adjunct scope and island constraints.
TT-MCTAG lets one abstract away from the relative order of co-complements in the final derived tree, which is more appropriate than classic TAG when dealing with flexible word order in German. In this paper, we present the analyses for sentential complements, i.e., wh-extraction, thatcomplementation and bridging, and we work out the crucial differences between these and respective accounts in XTAG (for English) and V-TAG (for German).
Wiederholt ist auf das onomastische Dokumentations- und Forschungspotential digital gespeicherter Telefonanschlüsse hingewiesen worden. Auch sind auf dieser Basis bereits Untersuchungen zum Inventar und zur Verbreitung deutscher Familiennamen entstanden. Durch neue Software zur Auswertung digitaler Telefonanschlüsse ergeben sich inzwischen fast unbegrenzte Möglichkeiten, das Familiennamensystem Deutschlands erstmals überhaupt zuverlässig zu erfassen, zu dokumentieren und auf bestimmte Phänomene hin zu befragen. In Minutenschnelle ist es nun beispielsweise möglich, alle Komposita auf -müller in Listen zusammenzustellen und in Karten deutschlandweit in ihrer Verbreitung sichtbar zu machen.
This paper addresses the problem ofconstraints for relative quantifier sope, in partiular in inverse linking readings wherecertain scope orders are exluded. We show how to account for such restrictions in the Tree Adjoining Grammar (TAG) framework by adopting a notion offlexible composition. In the semantics we use for TAG we introduce quantifier sets that group quantifiers that are "glued" together in the sense that no other quantifieran scopally intervene between them. Theflexible composition approach allows us to obtain the desired quantifier sets and thereby the desiredconstraints for quantifier sope.
Fluch- und Schimpfwortschätze sind aus kontrastiver Perspektive bisher kaum analysiert worden, sieht man von einer Vielzahl populärwissenschaftlicher Publikationen ab. Wissenschaftliche Publikationen beziehen sich meist auf eine Einzelsprache und greifen bei der Erklärung der Motive oft zu kurz, weil sie gerade benachbarte Kulturen und Sprachen (auch Dialektgebiete) zu wenig im Blick haben (Dundes 1983). Der vorliegende Beitrag leistet eine vergleichende Zusammenstellung der Fluch- und Schimpfwortschätze dreier mehr oder weniger benachbarter Sprachen, des (nördlichen) Niederländischen, des Deutschen und des Schwedischen, also zweier eng verwandter westgermanischer und einer nordgermanischen Sprache.
Die Idee, das Isländische - eine archaische, am Nordwestrand des germanischen Sprachgebiets gelegene skandinavische Inselsprache - auf die Möglichkeiten des Sexusausdrucks hin zu untersuchen, entstand imZusammenhang einer kontrastiven Arbeit zum Sexusausdruck im Deutschen und Schwedischen (siehe Nübling 2000). Das Schwedische verfügt nur noch über zwei Genera, das sog. Utrum (das aus dem Zusammenfall von Femininum und Maskulinum hervorgeht) und das Neutrum.
This article examines the expression of natural gender in Icelandic nouns denoting human beings. Particular attention will be paid to the system's symmetry with regards to nouns denoting women and men. Our society consists more or less exactly of half women and half men. One would therefore assume that systems for terms denoting persons would also be symmetrically organised. Yet this assumption could not be further from the truth, and not just in single isolated cases, but in many languages: I will attempt to show that Icelandic has numerous methods for referring to women, but also many barriers and idiosyncrasies.
Nakon kratkoga prikaza geografskoga položaja zagorskoga mjesta Šemnice Gornje u radu se na osnovi vlastitoga terenskog istraživanja i dostupne literature iznose fonološka obilježja govora toga mjesta. Opisuje se naglasni sustav i unutar toga razlike koje se mogu uočiti u odnosu na osnovnu kajkavsku akcentuaciju, te obilježja samoglasničkoga i suglasničkoga sustava.
Ključne riječi: Šemnica Gornja ; govor ; naglasni sustav; samoglasnički i suglasnički sustav
Bis heute bildet die Morphologie keinen Schwerpunkt der Dialektlinguistik. Dies wird immer wieder moniert. H. Tatzreiter (1994) kommt nach seinem Streifzug durch die "Bibliographie zur Grammatik der deutschen Dialekte" von P. Wiesinger / E. Raffin (1982) zu dem Ergebnis, "daß die Leistungskurve im grammatischen Bereich ,von der Lautlehre über die Formen- und Wortbildungslehre bis zur Satzlehre' steil abfällt" (S. 30 bzw. P. Wiesinger / E. Raffin 1982, S. XXIX). Ein weiteres Problem sieht er in der besonders durch die angelsächsische Tradition motivierten Vernachlässigung der Morphologie die zwischen der phonologischen, lexikalischen und syntaktischen Ebene ein gefährdetes Dasein fristet" (S. 30): "So lange die Morphologie sich nicht aus der 'Umklammerung' der Phonologie und Syntax lösen kann, um eigenständig als Forschungsobjekt zu gelten, wird es um die umfassende Erforschung und Darstellung schlecht bestellt sein" (S. 34).
This paper is part of a research project on OT Syntax and the typology of the free relative (FR) construction. It concentrates on the details of an OT analysis and some of its consequences for OT syntax. I will not present a general discussion of the phenomenon and the many controversial issues it is famous for in generative syntax.
Chunk parsing has focused on the recognition of partial constituent structures at the level of individual chunks. Little attention has been paid to the question of how such partial analyses can be combined into larger structures for complete utterances. Such larger structures are not only desirable for a deeper syntactic analysis. They also constitute a necessary prerequisite for assigning function-argument structure. The present paper offers a similaritybased algorithm for assigning functional labels such as subject, object, head, complement, etc. to complete syntactic structures on the basis of prechunked input. The evaluation of the algorithm has concentrated on measuring the quality of functional labels. It was performed on a German and an English treebank using two different annotation schemes at the level of function argument structure. The results of 89.73% correct functional labels for German and 90.40%for English validate the general approach.
Transforming constituent-based annotation into dependency-based annotation has been shown to work for different treebanks and annotation schemes (e.g. Lin (1995) has transformed the Penn treebank, and Kübler and Telljohann (2002) the Tübinger Baumbank des Deutschen (TüBa-D/Z)). These ventures are usually triggered by the conflict between theory-neutral annotation, that targets most needs of a wider audience, and theory-specific annotation, that provides more fine-grained information for a smaller audience. As a compromise, it has been pointed out that treebanks can be designed to support more than one theory from the start (Nivre, 2003). We argue that information can also be added to an existing annotation scheme so that it supports additional theory-specific annotations. We also argue that such a transformation is useful for improving and extending the original annotation scheme with respect to both ambiguous annotation and annotation errors. We show this by analysing problems that arise when generating dependency information from the constituent-based TüBa-D/Z.
In the past, a divide could be seen between ’deep’ parsers on the one hand, which construct a semantic representation out of their input, but usually have significant coverage problems, and more robust parsers on the other hand, which are usually based on a (statistical) model derived from a treebank and have larger coverage, but leave the problem of semantic interpretation to the user. More recently, approaches have emerged that combine the robustness of datadriven (statistical) models with more detailed linguistic interpretation such that the output could be used for deeper semantic analysis. Cahill et al. (2002) use a PCFG-based parsing model in combination with a set of principles and heuristics to derive functional (f-)structures of Lexical-Functional Grammar (LFG). They show that the derived functional structures have a better quality than those generated by a parser based on a state-of-the-art hand-crafted LFG grammar. Advocates of Dependency Grammar usually point out that dependencies already are a semantically meaningful representation (cf. Menzel, 2003). However, parsers based on dependency grammar normally create underspecified representations with respect to certain phenomena such as coordination, apposition and control structures. In these areas they are too "shallow" to be directly used for semantic interpretation. In this paper, we adopt a similar approach to Cahill et al. (2002) using a dependency-based analysis to derive functional structure, and demonstrate the feasibility of this approach using German data. A major focus of our discussion is on the treatment of coordination and other potentially underspecified structures of the dependency data input. F-structure is one of the two core levels of syntactic representation in LFG (Bresnan, 2001). Independently of surface order, it encodes abstract syntactic functions that constitute predicate argument structure and other dependency relations such as subject, predicate, adjunct, but also further semantic information such as the semantic type of an adjunct (e.g. directional). Normally f-structure is captured as a recursive attribute value matrix, which is isomorphic to a directed graph representation. Figure 5 depicts an example target f-structure. As mentioned earlier, these deeper-level dependency relations can be used to construct logical forms as in the approaches of van Genabith and Crouch (1996), who construct underspecified discourse representations (UDRSs), and Spreyer and Frank (2005), who have robust minimal recursion semantics (RMRS) as their target representation. We therefore think that f-structures are a suitable target representation for automatic syntactic analysis in a larger pipeline of mapping text to interpretation. In this paper, we report on the conversion from dependency structures to fstructure. Firstly, we evaluate the f-structure conversion in isolation, starting from hand-corrected dependencies based on the TüBa-D/Z treebank and Versley (2005)´s conversion. Secondly, we start from tokenized text to evaluate the combined process of automatic parsing (using Foth and Menzel (2006)´s parser) and f-structure conversion. As a test set, we randomly selected 100 sentences from TüBa-D/Z which we annotated using a scheme very close to that of the TiGer Dependency Bank (Forst et al., 2004). In the next section, we sketch dependency analysis, the underlying theory of our input representations, and introduce four different representations of coordination. We also describe Weighted Constraint Dependency Grammar (WCDG), the dependency parsing formalism that we use in our experiments. Section 3 characterises the conversion of dependencies to f-structures. Our evaluation is presented in section 4, and finally, section 5 summarises our results and gives an overview of problems remaining to be solved.
“Funktionsverbgefüge” diye nitelendirilen bir ad ve eylemden oluşan Almanca işlevsel ad-eylem kümeleri, hem sözdizimsel hem de anlamsal bakımdan farklı özellikler gösterir. Bu nedenle, yabancı dil olarak Almanca öğretiminde öğrenme güçlüklerine yol açan bu sözcük kümelerinin öğretim biçimi daha da önem kazanmıştır. Bu çalışmada, öncelikle Almanca işlevsel ad-eylem kümelerinin (Funktionsverbgefüge) sözdizimsel ve anlamsal özellikleri ve buna bağlı olarak onların öğretim biçimi konulaştırılmaktadır. Bilişsel ve eklektik yöntem ilkeleri temel alınarak bu sözcük kümelerinin metin bağlamında sırasıyla tanıma, anlama, dizgeleştirme ve etkin kullanma biçiminde aktarılmasına ilişkin öneriler sunulmaktadır.
This demo abstract describes the SmartWeb Ontology-based Information Extraction System (SOBIE). A key feature of SOBIE is that all information is extracted and stored with respect to the SmartWeb ontology. In this way, other components of the systems, which use the same ontology, can access this information in a straightforward way. We will show how information extracted by SOBIE is visualized within its original context, thus enhancing the browsing experience of the end user.
Wenn wie im Falle des Instituts für Angewandte Linguistik und Translatologie der Universität Leipzig eine mehr als zehnjährige Germanistische Institutspartnerschaft mit gleich zwei russischen Partnern – den Übersetzer-Fakultäten der Linguistischen Universitäten Moskau und Pjatigorsk – nunmehr ihren Abschluss findet, so bietet es sich natürlich an zu fragen, was die GIP-Langzeitkooperation beiden Seiten an messbaren wissenschaftlichen, wissenschaftsmethodischen und curricularen Ergebnissen, an „Zuwächsen“ im Sinne der Nachwuchsförderung, des Austauschs von Dozenten und Studierenden gebracht hat. Die Bilanz – von uns dargelegt im Jubiläumsband 52 der Dokumente & Materialien des Deutschen Akademischen Austausch Dienstes – kann sich durchaus sehen lassen und rechtfertigt nicht nur die aufgewandten Mittel, sondern auch die kontinuierliche Arbeit, den nachhaltigen Einsatz und die vielfältigen Initiativen der zahlreichen Beteiligten auf beiden Seiten.
U radu se opisuje fonologija, morfologija i leksik govora Jurkova Sela u Žumberku, doseljeničkoga čakavskoga ikavsko-ekavskoga govora. Iako nema zamjenicu ča u samostalnoj upotrebi, govor čuva većinu temeljnih čakavskih crta. Ipak, stabilnost je sustava u nekim elementima narušena – što pokazuje supostojanje određenih dubleta u prozodiji i morfologiji. Leksik, uz ostalo, karakterizira prisutnost većeg broja germanizama, na žumberačkome prostoru očekivanih.
U radu je ponuđena raščlamba stilskih i govorničkih figura u poeziji i u putopisima fra Ivana Franje Jukića, angažiranoga franjevačkoga pisca i borca za političku samostalnsot Bosne. Autor je utvrdio da Jukić u svoj književni izraz unosi elemente narodnih govora, što se posebno zapaža u uporabi pučkih fraza i kolokacija. S druge strane, izbor tzv. knjiških figura otkriva utjecaj franjevačke tradicije, posebno jezika starijih franjevačkih ljetopisa.
This paper is concerned with developing Joan Bybee's proposals regarding the nature of grammatical meaning and synthesizing them with Paul Hopper's concept of grammar as emergent. The basic question is this: How much of grammar may be modeled in terms of grammaticalization? In contradistinction to Heine, Claudi & Hünnemeyer (1991), who propose a fairly broad and unconstrained framework for grammaticalization, we try to present a fairly specific and constrained theory of grammaticalization in order to get a more precise idea of the potential and the problems of this approach. Thus, while Heine et al. (1991:25) expand – without discussion – the traditional notion of grammaticalization to the clause level, and even include non-segmental structure (such as word order), we will here adhere to a strictly 'element-bound' view of grammaticalization: where no grammaticalized element exists, there is no grammaticalization. Despite this fairly restricted concept of grammaticalization, we will attempt to corroborate the claim that essential aspects of grammar may be understood and modeled in terms of grammaticalization. The approach is essentially theoretical (practical applications will, hopefully, follow soon) and many issues are just mentioned and not discussed in detail. The paper presupposes a familiarity with the basic facts of grammaticalization and it does not present any new facts.
To reach even language users not acquainted to the use of grammars the Institut für Deutsche Sprache in Mannheim (Germany) looked for new way to handle grammatical problems. Instead of confronting users with abstractions frequent difficulties of German grammar are introduced in form of exemplary questions like „Which form should be used or preferred: Anfang dieses Jahre or Anfang diesen Jahres?” Looking through the long list of such questions even laymen may find solutions of grammatical problems they might not be able to formulate as such.
Japanese is often taken to be strictly head-final in its syntax. In our work on a broad-coverage, precision implemented HPSG for Japanese, we have found that while this is generally true, there are nonetheless a few minor exceptions to the broad trend. In this paper, we describe the grammar engineering project, present the exceptions we have found, and conclude that this kind of phenomenon motivates on the one hand the HPSG type hierarchical approach which allows for the statement of both broad generalizations and exceptions to those generalizations and on the other hand the usefulness of grammar engineering as a means of testing linguistic hypotheses.
U radu se preispituju uobičajena određenja homonimije i kriteriji razgraničenja homonimije od srodnih pojava. Homonimiji se pristupa kao praktičnomu leksikografskom problemu te se daju konkretni primjeri leksikografske obradbe homonimnih natuknica iz Školskog rječnika hrvatskog jezika koji se izrađuje u Institutu za hrvatski jezik i jezikoslovlje.
In the last decade, the Penn treebank has become the standard data set for evaluating parsers. The fact that most parsers are solely evaluated on this specific data set leaves the question unanswered how much these results depend on the annotation scheme of the treebank. In this paper, we will investigate the influence which different decisions in the annotation schemes of treebanks have on parsing. The investigation uses the comparison of similar treebanks of German, NEGRA and TüBa-D/Z, which are subsequently modified to allow a comparison of the differences. The results show that deleted unary nodes and a flat phrase structure have a negative influence on parsing quality while a flat clause structure has a positive influence.
The present article illustrates that the specific articulatory and aerodynamic requirements for voiced but not voiceless alveolar or dental stops can cause tongue tip retraction and tongue mid lowering and thus retroflexion of front coronals. This retroflexion is shown to have occurred diachronically in the three typologically unrelated languages Dhao (Malayo-Polynesian), Thulung (Sino-Tibetan), and Afar (East-Cushitic). In addition to the diachronic cases, we provide synchronic data for retroflexion from an articulatory study with four speakers of German, a language usually described as having alveolar stops. With these combined data we supply evidence that voiced retroflex stops (as the only retroflex segments in a language) did not necessarily emerge from implosives, as argued by Haudricourt (1950), Greenberg (1970), Bhat (1973), and Ohala (1983). Instead, we propose that the voiced front coronal plosive /d/ is generally articulated in a way that favours retroflexion, that is, with a smaller and more retracted place of articulation and a lower tongue and jaw position than /t/.
How to compare treebanks
(2008)
Recent years have seen an increasing interest in developing standards for linguistic annotation, with a focus on the interoperability of the resources. This effort, however, requires a profound knowledge of the advantages and disadvantages of linguistic annotation schemes in order to avoid importing the flaws and weaknesses of existing encoding schemes into the new standards. This paper addresses the question how to compare syntactically annotated corpora and gain insights into the usefulness of specific design decisions. We present an exhaustive evaluation of two German treebanks with crucially different encoding schemes. We evaluate three different parsers trained on the two treebanks and compare results using EVALB, the Leaf-Ancestor metric, and a dependency-based evaluation. Furthermore, we present TePaCoC, a new testsuite for the evaluation of parsers on complex German grammatical constructions. The testsuite provides a well thought-out error classification, which enables us to compare parser output for parsers trained on treebanks with different encoding schemes and provides interesting insights into the impact of treebank annotation schemes on specific constructions like PP attachment or non-constituent coordination.
Prema opisima u suvremenim hrvatskim gramatikama dalo bi se zaključiti da hrvatski koordinativne složenice ili ne poznaje ili da ih je toliko malo da ne traže opis. U članku se podsjeća da je u starijim gramatikama o njima bilo riječi, a da svojom suvremenom količinom i različitim ostvarajima (imeničke, pridjevske, priložne, sa spojnicima -o- i -0-) gramatički opis itekako zaslužuju. Pokazuje se zbog kojih se svojih odlika takve složenice mogu smatrati riječima, a ne spojevima riječi, sintagmama. Na primjeru jezika Anke Žagar pokazuje se da model koordinativnih složenica kao potencija može unutar poezije poprimiti i jezičnostvaralačke inačice.
Razmatra se mogućnost hrvatskoga posvojnog pridjeva da bude antecedent relativnoj zamjenici, mogućnost koja se u slavenskim jezicima sve više gubi, odnosno mjesto posvojnoga pridjeva u toj funkciji zauzima genitiv. Potvrdama se pokazuje da ta mogućnost u pisanome hrvatskome (još) postoji. Provedena anketa s izvornim govornicima pokazuje ipak da takve konstrukcije kao prihvatljive ovjerava tek manji dio suvremenih govornika. Analiziraju se tipološki neobična svojstva relativnih rečenica s posvojnim pridjevom kao antecedentom, osobito to da se u njima posvojni pridjev vlada kao padežni oblik imenice, a ne njezin derivat. Ključne riječi: posvojni pridjev, antecedent, relativna rečenica, genitiv, slavenski jezici
Hybrid robust deep and shallow semantic processing for creativity support in document production
(2004)
The research performed in the DeepThought project (http://www.project-deepthought.net) aims at demonstrating the potential of deep linguistic processing if added to existing shallow methods that ensure robustness. Classical information retrieval is extended by high precision concept indexing and relation detection. We use this approach to demonstrate the feasibility of three ambitious applications, one of which is a tool for creativity support in document production and collective brainstorming. This application is described in detail in this paper. Common to all three applications, and the basis for their development is a platform for integrated linguistic processing. This platform is based on a generic software architecture that combines multiple NLP components and on robust minimal recursive semantics (RMRS) as a uniform representation language.
Auf der Grundlage eines deutsch-türkischen Sprachvergleichs werden in diesem Beitrag die Idiomatischen Verwendungsweisen der Wortgruppe mit "nehmen" dargestellt und ihre Äquivalenzen im Türkischen ermittelt. Dabei geht es vor allem darum, übersetzungsdidaktisch relevante Probleme und Möglichkeiten herauszuarbeiten.
Iločki jezikoslovni razmišljaji : (Marko Samardžija: Devet iločkih priopćenja i jedno warszawsko)
(2010)
U radu se analizira jezik opisan u Della Bellinoj gramatici u odnosu na jezik jednoga od književnih djela koja su mu bila uzorom – Suze sina razmetnoga Ivana Gundulića. Istraživanje je usmjereno na imeničke oblike u obama djelima. Sličnosti i razlike komentiraju se za svaku deklinaciju posebno, i to sustavno za svaki padež. Budući da su neke uvjetovane formom Della Bellina književnoga predloška, pritom se upozorava na stvarne i prividne razlike.
While the sortal constraints associated with Japanese numeral classifiers are wellstudied, less attention has been paid to the details of their syntax. We describe an analysis implemented within a broadcoverage HPSG that handles an intricate set of numeral classifier construction types and compositionally relates each to an appropriate semantic representation, using Minimal Recursion Semantics.
While the sortal constraints associated with Japanese numeral classifiers are well-studied, less attention has been paid to the details of their syntax. We describe an analysis implemented within a broad-coverage HPSG that handles an intricate set of numeral classifier construction types and compositionally relates each to an appropriate semantic representation, using Minimal Recursion Semantics.
J. Melvinger u radu o supstandardnome prijedložnom infinitivu (1982.) ne spominje mogućnost infinitivne kondenzacije posljedičnih ustrojstava, ni prijedložnog ni besprijedložnog infinitiva, iako donosi primjere u kojima je riječ o infinitivnoj prijedložnoj konstrukciji koja je priložna oznaka posljedice, a ne priložna oznaka načina, kako ona tvrdi: Kožnata jakna smiješna, a šal oko vrata škaklja za poludjeti. Tu mogućnost ne spominje ni u svojoj disertaciji (iako navodi primjere koje mi razumijevamo kao posljedične konstrukcije), a ne navodi je ni M. Ivić.
Das vorliegende Arbeitspapier ist das Skript einer Vorlesung, die ich während des Wintersemesters 1986/87 am Institut für Sprachwissenschaft der Universität zu Köln gehalten habe. […] Das Arbeitspapier gliedert sich in zwei Teile. Im ersten Teil, Kapitel 1 - 4, werden die bei der Untersuchung und Beschreibung einer Sprache auftretenden soziolinguistischen Probleme besprochen, während im zweiten Teil, Kapitel 5 - 11, behandelt wird, wie eine Grammatik geschrieben werden sollte. Es geht dabei also nicht um die grammatische Analyse sprachlicher Daten, sondern um die Darstellung einer Sprache, d.h. um die schriftstellerische Aufgabe des Linguisten, des Grammatikers im eigentlichen Sinn.
Intimität und Geschlecht : zur Syntax und Pragmatik der Anrede im Liebesbrief des 20. Jahrhunderts
(2000)
Die Trennung der Lebenswelt in Privatsphäre und Öffentlichkeit käme der Verortung von Intimität entgegen. Es scheint aber, als ob Intimität nicht einem klar abgegrenzten Bereich zugeordnet werden kann, sondern nunmehr als relationale Kategorie zu fassen ist. Gerade der historische Vergleich (Vgl. CORBIN 1992) erlaubt weder einheitlich räumliche oder körperliche noch ästhetische Kriterien zur Abgrenzung von Intimität. ...
Those principles of Naturalness as postulated by Mayerthaler (1981) claim to make predtictions about the direction of language change possible. It is true that the majority of morphological changes can be accounted for by these principles. However, systematic violations of these rules can be found in of all things, some of most frequent, elementary verbs such as HAVE, BE, BECOME, COME, GO, GIVE, TAKE, etc. Their irregularities cannot be accounted for solely - as Naturalness Theory would have it - by conflicts between phonological and morphological Naturalness. Rather, they have been systematically built up through other efficient strategies. This "regularity of irregularity" is the focus of this paper, which demonstrates several particularly well-beaten paths to irregularization through contrastive diachronic investigations of frequent verbs in different Germanic languages. lrregularity, a term laden with negative connotations, is substituted by the term differentiation, which names the actual function directly. Because differentiation typically correlates with word brevity, this constellation should be considered an ideal compromise between hearer and speaker interests. A further question to be addressed is which individual categories are expressed through irregularization. It is concluded that this process is guided by token frequency and degree of relevance.
This paper presents a comparative study of probabilistic treebank parsing of German, using the Negra and TüBa-D/Z treebanks. Experiments with the Stanford parser, which uses a factored PCFG and dependency model, show that, contrary to previous claims for other parsers, lexicalization of PCFG models boosts parsing performance for both treebanks. The experiments also show that there is a big difference in parsing performance, when trained on the Negra and on the TüBa-D/Z treebanks. Parser performance for the models trained on TüBa-D/Z are comparable to parsing results for English with the Stanford parser, when trained on the Penn treebank. This comparison at least suggests that German is not harder to parse than its West-Germanic neighbor language English.
In this text, we describe the development of a broad coverage grammar for Japanese that has been built for and used in different application contexts. The grammar is based on work done in the Verbmobil project (Siegel 2000) on machine translation of spoken dialogues in the domain of travel planning. The second application for JACY was the automatic email response task. Grammar development was described in Oepen et al. (2002a). Third, it was applied to the task of understanding material on mobile phones available on the internet, while embedded in the project DeepThought (Callmeier et al. 2004, Uszkoreit et al. 2004). Currently, it is being used for treebanking and ontology extraction from dictionary definition sentences by the Japanese company NTT (Bond et al. 2004).
Jagić o Maretiću
(2008)
We present a solution for the representation of Japanese honorifical information in the HPSG framework. Basically, there are three dimensions of honorification. We show that a treatment is necessary that involves both the syntactic and the contextual level of information. The japanese grammar is part of a machine translation system.
A comprehensive investigation of Japanese particle was missing up to now. General implications were set up without the fact that a comprehensive analysis was carried out. [...] We offer a lexicalist treatment of the problem. Instead of assuming different phrase structure rules we state a type hierarchy of Japanese particles. This makes a uniform treatment of phrase structure as well as a differentiation of subcategorization patterns possible.
U radu se nastoje prikazati i kontekstualizirati Dujmušićevi jezikoslovni doprinosi očuvanju hrvatskoga standardnog jezika. Puristički se radovi, među kojima je najzanimljiviji i najopširniji “Antibarbarus hrvatskoga jezika”, klasificiraju prema Thomasovoj (1991) kategorizaciji purističke djelatnosti. Dujmušić je zanimljiv ne samo zbog izazovne, potpune anonimnosti, nego i zbog toga što je u vrijeme opozicije između vukovskoga i antivukovskoga purizma bio na onoj slabijoj, antivukovskoj strani. Znanstvena je recepcija Dujmušićeva rada potpuno izostala, što znači da njegove jezikoslovne prinose valja i prikazati i evaluirati.
Predmet ovog rada su kajkavizmi u Tkonskom zborniku – glagoljskom rukopisu koji je početkom 16. stoljeća pisan na frankopanskim posjedima. Utvrđeno je da su u tom rukopisu prisutni kajkavizmi na svim razinama: fonološkoj, morfološkoj, leksičkoj i sintaktičkoj. Najviše je kajkavizama na leksičkoj razini, a oni se mogu podijeliti u dvije skupine: 1. zajednički čakavsko- kajkavski sloj, npr. betegь, gdo, nigdar, hiniti, hud, kaštigati, lotar itd.; 2. kajkavski sloj, npr. fajtati, gorup, nekoteri, pokrivača, škoda, špotati, tanac itd. Prva je kategorija leksema interpolirana u gotovo svim dijelovima CTk, a druga je najčešća u Cvetu od kreposti i Muci. Tkonski zbornik čuva jedno ogromno leksičko bogatstvo, a pri usporedbi pojedinih leksema s onima u hrvatskoglagoljskim misalima i brevijarima, zaključeno je da su neki od njih potvrđeni i ranije, npr. betegь, kaštigati, praviti, gorup, tanac itd. To je potvrda o kontinuitetu hrvatskoglagoljske književnosti. Interpolacija kajkavizama nije ujednačena u svim dijelovima zbornika, kajkavske su intervencije najčešće u Cvetu od kreposti (f. 67 – 85) i u Muci Spasitelja našega (f. 109 – 161). Na temelju provedenog istraživanja može se zaključiti da je Tkonski zbornik rukopis sastavljen iz različitih dijelova, koji nisu nastali u istom razdoblju, ni na istom mjestu. Budući da kajkavizme u pojedinim dijelovima nalazimo na svim razinama (Cvet od kreposti i Muka), može se pretpostaviti da su oni nastali u sjevernom području, tj. bliže kajkavskom.
U radu se iznosi pokušaj razvrstavanja glagola s elementom se u valencijskome rječniku hrvatskih glagola. Kao predložak poslužila je obrada iste vrste glagola u češkome elektroničkom valencijskom rječniku VALLEX, kao i prototipno-kontekstualna analiza povratnih glagola Branimira Belaja. Glagoli se razvrstavaju na temelju gramatičkih i semantičkih kriterija.
Ključne riječi: valencija; valencijski rječnik; glagoli sa se; element se kao čestica i kao zamjenica
Um dicionário contribui para a permanência e a padronização duma língua. O desenvolvimento das línguas moçambicanas serve para enriquecer e fortalecer esta nação. Alem disso, facilita a transição do povo para a aprendizagem da língua portuguesa. A ortografia usada neste dicionário segue as recomendações de NELIMO, o Núcleo de Estudo de Línguas Moçambicanas. A única excepção é o uso da letra j para o som ‘dj’ ou ‘tj’: NELIMO recomenda que seja escrita com c. Estamos abertos para receber quaisquer sugestões que eventualmente surgirem pela parte dos prezados leitores.
Die vorliegende Arbeit soll sich mit dem „Zusammenziehen von Wörtern“ beschäftigen, das als typisch für die „Pottsprache“ […] angesehen wird. Dieses Zusammenziehen soll innerhalb der Klitisierungsforschung anhand zweier Fälle untersucht werden. Zum einen sollen reduzierte Formen der Pronomina und zum anderen reduzierte Artikelformen, nämlich die des bestimmten und des unbestimmten Artikels, als Untersuchungsgegenstand dienen. Dieses soll auf einer empirischen Basis, dass heißt auf der Basis von erhobenen und analysierten Sprachdaten, geschehen. Der erste Schritt soll dabei eine Darstellung der hier behandelten Sprachvarietät sein. […] Der zweite Schritt besteht in einer Darstellung der Theorie der Klitisierung […] Nachdem der Hintergrund dieser Arbeit dargestellt worden ist, folgt die eigentliche Analyse. Zunächst wird die Klitisierung von Pronomina untersucht […], dann die von Artikelformen […]. Beide Phänomene werden nacheinander auf ihre Eigenschaften hin untersucht, um dann zum Schluss zu einer Hypothese aus der bisherigen Forschung, nämlich die der flektierten Präpositionen, Stellung zu beziehen […]. Abschließend soll versucht werden die Ergebnisse dieser Arbeit in den Forschungsstand bei der Erforschung von Klitisierung auf der einen Seite und der Varietät Ruhrdeutsch auf der anderen Seite einzuordnen […].
Književnojezična norma franjevačkih pisaca 18. St. : sastavnica jezičnostandardizacijskih procesa
(2007)
Važnom sastavnicom hrvatskoga predstandardnoga jezika smatra se koine franjevačke književnosti 18. st. Izrasla iz pisane prakse bosanskih franjevaca 17. st., obogaćena u jeziku hrvatskih franjevaca izraznim sredstvima pučkeknjiževnosti, već je u 18. st. pokazivala obilježja standardiziranosti: polifunkcinonalnost, preskriptivnost i neovisnost o organskim idiomima. Koine je opisana u franjevačkim gramatikama, što je naznaka normativnih tendencija.
U radu će biti riječi o imenicama koje označuju mjeru i koje se redovito pojavljuju u akuzativu iako bi sintaktički na tome mjestu trebao doći koji drugi oblik. Učestalom upotrebom u akuzativnome obliku te imenice gube svoje osnovno morfološko obilježje – promjenjivost, a time i svoju nedvojbenu pripadnost imenicama kao vrsti riječi i nameću pitanje kako ih obraditi u rječniku.
Ich werde zunächst auf neuere Theorien zur Abgrenzung von Komposition und Derivation eingehen, um – darauf aufbauend –einen eigenen Lösungsvorschlag anhand von Sprachdaten auszuarbeiten. Dabei werde ich mich nicht auf das Deutsche beschränken, sondern ein Modell skizzieren, das auch eine gewisse übereinzelsprachliche Gültigkeit besitzt . Das Sprachmaterial entstammt allerdings in erster Linie indogermanischen Sprachen, da sich hier das Problem besonders augenfällig stellt. Es wäre jedoch interessant, das vorgestellte Modell an einer größeren Zahl von Sprachtypen zu überprüfen (und entsprechend zu modifizieren). In einem dritten Abschnitt schließlich möchte ich versuchen, die beobachteten Phänomene (und somit mein Modell) ansatzweise in einen Erklärungszusammenhang zu bringen. Das Hauptgewicht soll jedoch auf die Beschreibung der Phänomene selbst, d. h. den zweiten Teil meiner Ausführungen gelegt werden.
Die vorliegende Arbeit ist eine kritische Auseinandersetzung mit dem Hofstedeschen Ansatz. Dabei soll in erster Linie das Werk von Hofstede selbst einer wissenschaftstheoretisch-methodologischen Prüfung unterzogen werden. Bei sehr populären Standardansätzen, die sowohl in der Praxis einen großen Anklang finden als auch in der wissenschaftlichen Gemeinschaft ständig rezipiert und weiterentwickelt werden, bleibt es natürlich nicht aus, dass durch Vereinfachungen oder Uminterpretationen in der Literatur Inkonsistenzen entstehen, die so im Originalwerk nicht enthalten sind. In dieser Arbeit soll es im Wesentlichen nicht um solche Probleme der Hofstedeschen Rezeption gehen. Vielmehr werde ich die Argumentation von Hofstede selbst in seinen eigenen Schriften […] einer detaillierten kritischen Analyse zu unterziehen, um auf diese Weise zu prüfen, ob bestimmte gravierende Probleme schon im Originalwerk angelegt sind.
Politeness has become a key qualification in intercultural competence and didactics. The paper presents parts of an empirical research of the development and shaping of verbal politeness in critical incidents investigating the way German and Turkish students of the German language deal with criticism and complimenting. The findings show that Turkish students of German as a foreign language avoid direct criticism and prefer manners considered to be polite in German. Complimenting is an expression of their own positive feelings and acts as “messages about oneself”, whereas the German students prefer “meritorious praise” referring to merits. The discriminating effects of migration within the Turkish students are smaller than expected perhaps because of the increase of transcultural knowledge. This should give new ideas for the didactics of politeness.
U ovome radu analizira se dio korpusa hrvatskih i ruskih frazema s kulinarskim elementima kao komponentаma i onih koji u svom semantičkom talogu imaju sliku povezanu s jelom. Cilj rada je prikazati simbolički, metaforički i konotativni potencijal hrane kao frazeološke komponente putem analize načina izgradnje frazeološkog značenja, te istaknuti najočitije sličnosti i najzanimljivije razlike između ovakvog tipa frazeologije u hrvatskom i ruskom jeziku.
Extremely short verbs can be found in various Germanic languages and dialects; the roots of these verbs do not have a final consonant «C)-C-V), and they always have a monosyllabic infinitive and usually monosyllabic finite forms as well. Examples for these kinds of short verbs are Swiss German hä'to have', gä 'to go', gifii 'to give', nifif 'to take' which correspond to the Swedish verbs ha, ga, ge and ta. The last example shows that such shore verb formations also occur with verbs which do not share the same etymology. Apart from shortness, short verbs are characterized by a high degree of irregularity, often even by suppletion, which sometimes develops against sound laws. Furthermore they are among the most used verbs and often tend to grammaticalization. The present paper compares the short verbs of seven Germanic languages; in addition, it describes their various ways of development and strategies of differentiation. Moreover, it exarnines the question of why some languages and dialects (e.g., Swiss German, Frisian, Swedish, Norwegian) have many shore verbs while others (New High German, Icelandic, Faroese) do not. Finally, the paper discusses the contribution of shore verbs to questions concerning linguistic change and the morphological organization of languages.
The medium of (oral) language is mostly disregarded (or overlooked) in contemporary media theories. This "ignoring of language" in media studies is often accompanied by an inadequate transport model of communication, and it converges with an "ignoring of mediality" in mentalistic theories of language. In the present article it will be argued that this misleading opposition of language and media can only be overcome if one already regards oral language, not just written language, as a medium of the human mind. In my argumentation I fall back on Wittgenstein’s conception of language games to try to show how Wittgenstein’s ideas can help us to clear up the problem of the mediality of language and also to show to what extent the mentalistic conception of Chomskyan provenance cannot be adequate to the phenomenon of language.
Iako se prevedenicama aktiviraju vlastite izražajne mogućnosti jezika, one su također predmet purističkih reakcija. Cilj je rada analizirati latentni utjecaj engleskoga jezika na različite jezične razine kao pojavu koja je prisutna u hrvatskome i u drugim europskim jezicima. Primjeri pokazuju da se radi o rasprostranjenoj pojavi koja proizlazi iz doslovnoga i nemarnoga prijevoda, nepoznavanja norme vlastitoga jezika i pomodnoga slijeda engleske jezične norme.
In syntax, the trend nowadays is towards lexicalized grammar formalisms. It is now widely accepted that dividing words into wordclasses may serve as a laborsaving mechanism - but at the same time, it discards all detailed information on the idiosyncratic behavior of words. And that is exactly the type of information that may be necessary in order to parse a sentence. For learning approaches, however, lexicalized grammars represent a challenge for the very reason that they include so much detailed and specific information, which is difficult to learn. This paper will present an algorithm for learning a link grammar of German. The problem of data sparseness is tackled by using all the available information from partial parses as well as from an existing grammar fragment and a tagger. This is a report about work in progress so there are no representative results available yet.
Leksikografska obradba polisemnih naziva (na primjeru naziva društvenih znanstvenih disciplina)
(2009)
U radu se razmatra problem obradbe polisemnih naziva u općim i terminološkim rječnicima u hrvatskom jeziku na primjeru nazivlja društvenih znanstvenih disciplina. Nazivi iz rječničkoga korpusa (terminoloških i općih rječnika) uspoređuju se s potvrdama naziva u publicističkome funkcionalnom stilu (korpus Hrvatske jezične riznice), a posebno se analiziraju primjeri determinologizacije u publicističkome funkcionalnom stilu. Provedenom je analizom potvrđena sustavnost obradbe polisemnih naziva u općim i terminološkim rječnicima, ali su potvrđene i određene pogreške u obradbi naziva u općim rječnicima (netočno definirana značenja ili izostavljena pojedina česta značenja). U skladu s provedenom analizom daje se i prijedlog obradbe naziva u općem i terminološkom rječniku te se izdvajaju načela koja su bitna pri strukturiranju definicija u općim i terminološkim rječnicima u hrvatskome jeziku.
U radu se prikazuje i analizira leksikografski status brojevnih riječi u Rječniku hrvatskoga kajkavskoga književnog jezika. Prilaže se popis brojevnih riječi obrađenih u rječniku, utvrđuje se u kojoj su mjeri u rječničkome članku zastupljeni elementi gramatičkoga opisa i navode li se oni dosljedno. Analiziraju se elementi definicije brojevnih riječi i njezina koherentnost.
Leksičke funkcije kao pokazatelji značenjskih odnosa u kolokacijskim svezama hrvatskoga jezika
(2008)
U radu se na primjerima kolokacijskih sveza hrvatskoga jezika analiziraju leksičke funkcije koje je unutar svojega teorijskoga modela značenje – tekst razradio Igor Meljčuk. Na uzorku od desetak leksičkih funkcija koje predstavljaju opće značenjske modele primjenjive u svim jezicima opisuju se značenjski odnosi na osnovi kojih nastaju kolokacijske sveze, dakle, jezične jedinice koje po svojoj strukturi nadilaze razinu jedne riječi, tj. leksema.
The article addresses the growing importance of corpus-based research in the field of German foreign language acquisition. German corpora in general and learner corpora in particular are briefly introduced. A short overview of existing German learner corpora is followed by a detailed description of the error-annotated learner corpus Falko, a learner corpus of advanced learner German, which is accessible via internet (without any prior registration) and free of charge. Finally, a short example analysis demonstrates some of the functionalities of Falko. The aim of the article is to encourage researchers to employ corpora as helpful tools in their own work.
Die Geschichte der Beziehungen zwischen Literaturwissenschaft und Linguistik im Rahmen der Germanistik in den letzten 50 Jahren ist durchaus wechselvoll: einer zunehmenden Abkühlung, ja Entfremdung auf der einen Seite steht auf der anderen das wachsende Interesse an gemeinsam fruchtbar zu beackernden Arbeitsfeldern gegenüber. Ein Streifzug durch die Jahrgänge der Siegener Zeitschrift für Literaturwissenschaft und Linguistik (LiLi) seit den frühen 70er Jahren gibt davon ebenso Zeugnis wie aktuelle Projekte kritischer Kooperation (Kasten/Neuland/Schönert 1997, Hoffmann/Kessler Hrsg. 2003) oder der Versuch einer wissenschaftsgeschichtlichen Aufarbeitung des Verhältnisses der beiden Fächer durch das Marbacher Literaturarchiv (Haß/König Hrsg. 2003). Im folgenden Beitrag wird ein kurzer Blick auf die diesbezügliche Situation in der Schweiz geworfen und ein konzeptueller Zugriff auf mögliche Berührungspunkte exemplarisch skizziert.
The special issue of The Linguistic Review on "The Role of Linguistics in Cognitive Science" presents a variety of viewpoints that complement or contrast with the perspective offered in Foundations of Language (Jackendoff 2002a). The present article is a response to the special issue. It discusses what it would mean to integrate linguistics into cognitive science, then shows how the parallel architecture proposed in Foundations seeks to accomplish this goal by altering certain fundamental assumptions of generative grammar. It defends this approach against criticisms both from mainstream generative grammar and from a variety of broader attacks on the generative enterprise, and it reflects on the nature of Universal Grammar. It then shows how the parallel architecture applies directly to processing and defends this construal against various critiques. Finally, it contrasts views in the special issue with that of Foundations with respect to what is unique about language among cognitive capacities, and it conjectures about the course of the evolution of the language faculty.
Die Fachsprachen existieren nicht als "selbständiges Sprachsystem" mit eigener grammatischer, Struktur und eigenem Wortschatz. Sie stellen nur Teile des Gesamtsystems der jeweiligen Nationalsprache dar, die häufig als Gemeinsprache bezeichnet wird. Die Fachsprachen sind vielmehr "durch Differenzierung und Erweiterung aus der Gemeinsprache" hervorgegangen, wobei die Gemeinsprache "die lexikalische Basis und das grammatische Gerüst für die Fachsprachen liefert". In diesem Sinne sind sie in erster Linie durch einen spezifischen Fachwortschatz und spezifische Verwendung gemeinsprachlicher grammatischer, morphologischer sowie lexikalischer Mittel oder die Häufigkeit bestimmter syntaktischer Strukturen und bestimmter Wortbildungstypen gekennzeichnet. […] Eine Fachsprache läßt sich sowohl von anderen Fachsprachen abgrenzen, als auch in sich differenzieren, weil sie auf verschiedenen kommunikativ-funktionellen Ebenen völlig unterschiedliche Besonderheiten und Funktionsstile besitzt. Bei der Fachabgrenzung zeigen sich große Schwierigkeiten, weil durch die Fortentwicklung der Wissenschaft ständig neue Fachgebiete entstehen, die verschiedene Disziplinen übergreifen und die gleichzeitig weiter untergliedert werden müssen. Trotz alledem könnten die Unterschiede zwischen den einzelnen Fachsprachen darin bestehen, daß jede Fachsprache ihre eigenen Merkmale besitzt und die allgemeinen fachsprachlichen Eigenschaften nicht in gleichem Maße darstellt. […] Die Fachsprachen können unter verschiedenen bzw. kommunikativen, funktionellen, pragmatischen, stilistischen, fach- oder textbezogenen Gesichtspunkten betrachtet werden. Und daher werden sie unterschiedlich beschrieben. In diesem Sinne gibt es keine einheitliche Fachsprache. Jeder Fachbereich verfügt über seine eigene Fachsprache und damit über seine eigene Fachterminologie.
A lot of interest has recently been paid to constraint-based definitions and extensions of Tree Adjoining Grammars (TAG). Examples are the so-called quasi-trees, D-Tree Grammars and Tree Description Grammars. The latter are grammars consisting of a set of formulars denoting trees. TDGs are derivation based where in each derivation step a conjunction is built of the old formular, a formular of the grammar and additional equivalences between node names of the two formulars. This formalism is more powerfull than TAGs. TDGs offer the advantages of MC-TAG and D-Tree Grammars for natural languages and they allow underspecification. However the problem is that TDGs might be unnecessarily powerfull for natural languages. To solve this problem, in this paper, I will propose a local TDGs, a restricted version of TDGs. Local TDGs still have the advantages of TDGs but they are semilinear and therefore more appropriate for natural languages. First, the notion of the semilinearity is defined. Then local TDGs are introduced, and, finally, semilinearity of local Tree Description Languages is proven.
LTAG semantics for questions
(2004)
This papers presents a compositional semantic analysis of interrogatives clauses in LTAG (Lexicalized Tree Adjoining Grammar) that captures the scopal properties of wh- and nonwh-quantificational elements. It is shown that the present approach derives the correct semantics for examples claimed to be problematic for LTAG semantic approaches based on the derivation tree. The paper further provides an LTAG semantics for embedded interrogatives.
This paper sets up a framework for LTAG (Lexicalized Tree Adjoining Grammar) semantics that brings together ideas from different recent approaches addressing some shortcomings of TAG semantics based on the derivation tree. Within this framework, several sample analyses are proposed, and it is shown that the framework allows to analyze data that have been claimed to be problematic for derivation tree based LTAG semantics approaches.