Zur Problematik der primären Verarbeitung von elektronischen Datenmengen
Some thoughts on the primary extraction of electronic data
- Das Thema des Beitrags ist praktisch orientiert und knüpft an das VEGA Projekt „Verbale Kollokationen im Deutschen und Slowakischen“ unter der Leitung von Prof. Peter Ďurčo am Institut für Germanistik der Universität der hl. Kyrill und Method in Trnava an. Das Projekt setzt sich zum Ziel, verbale Kollokationen zu analysieren und zu beschreiben. Es setzt also voraus, dass man die Kollokabilität der sprachlichen Mittel definieren und messen kann...
- The contribution deals with the primary extraction of electronic data. For a project that aims at describing verbal collocations, a number of methods are presented which help to find (lexically speaking) the ideal number of verbs for further analysis. The initial concept of the project assumes that collocability can be clearly defined and measured. A first list of verbs has been generated, based on the frequency of verbs in a corpus. Generally, frequency is the first and most commonly used criterion when it comes to creating a basis to start research from. This raises the question whether all the verbs which are extracted based on frequency really are part of the core vocabulary. Consequently this contribution suggests the following additional criteria to extract data, which are the semantic and the instructive criterion. Applying these criteria helps to outline a lexicography of the core vocabulary, which then allows the specific collocation profiles of the extracted verbs to be analysed further in the context of this project.
Author: | Monika BanášováORCiDGND |
---|---|
URN: | urn:nbn:de:hebis:30:3-390090 |
ISSN: | 1338-0796 |
Parent Title (German): | Slowakische Zeitschrift für Germanistik |
Publisher: | Verband der Deutschlehrer und Germanisten der Slowakei |
Place of publication: | Bratislava |
Document Type: | Article |
Language: | German |
Date of Publication (online): | 2016/12/19 |
Year of first Publication: | 2014 |
Publishing Institution: | Universitätsbibliothek Johann Christian Senckenberg |
Release Date: | 2016/12/19 |
Tag: | corpus analysis; corpus linguistic; frequency |
Volume: | 6 |
Issue: | 2 |
Page Number: | 6 |
First Page: | 79 |
Last Page: | 84 |
HeBIS-PPN: | 415268133 |
Dewey Decimal Classification: | 8 Literatur / 83 Deutsche und verwandte Literaturen / 830 Literaturen germanischer Sprachen; Deutsche Literatur |
Sammlungen: | Germanistik / GiNDok |
Zeitschriften / Jahresberichte: | Slowakische Zeitschrift für Germanistik / Slowakische Zeitschrift für Germanistik 6.2014,2 |
: | urn:nbn:de:hebis:30:3-390158 |
Licence (German): | ![]() |