TY - JOUR A1 - Banášová, Monika T1 - Zur Problematik der primären Verarbeitung von elektronischen Datenmengen T1 - Some thoughts on the primary extraction of electronic data T2 - Slowakische Zeitschrift für Germanistik N2 - Das Thema des Beitrags ist praktisch orientiert und knüpft an das VEGA Projekt „Verbale Kollokationen im Deutschen und Slowakischen“ unter der Leitung von Prof. Peter Ďurčo am Institut für Germanistik der Universität der hl. Kyrill und Method in Trnava an. Das Projekt setzt sich zum Ziel, verbale Kollokationen zu analysieren und zu beschreiben. Es setzt also voraus, dass man die Kollokabilität der sprachlichen Mittel definieren und messen kann... N2 - The contribution deals with the primary extraction of electronic data. For a project that aims at describing verbal collocations, a number of methods are presented which help to find (lexically speaking) the ideal number of verbs for further analysis. The initial concept of the project assumes that collocability can be clearly defined and measured. A first list of verbs has been generated, based on the frequency of verbs in a corpus. Generally, frequency is the first and most commonly used criterion when it comes to creating a basis to start research from. This raises the question whether all the verbs which are extracted based on frequency really are part of the core vocabulary. Consequently this contribution suggests the following additional criteria to extract data, which are the semantic and the instructive criterion. Applying these criteria helps to outline a lexicography of the core vocabulary, which then allows the specific collocation profiles of the extracted verbs to be analysed further in the context of this project. KW - corpus linguistic KW - corpus analysis KW - frequency Y1 - 2016 UR - http://publikationen.ub.uni-frankfurt.de/frontdoor/index/index/docId/39009 UR - https://nbn-resolving.org/urn:nbn:de:hebis:30:3-390090 SN - 1338-0796 VL - 6 IS - 2 SP - 79 EP - 84 PB - Verband der Deutschlehrer und Germanisten der Slowakei CY - Bratislava ER -