TY - CONF A1 - Kasperek, Gerwin A1 - Abrami, Giuseppe A1 - Driller, Christine A1 - Lücking, Andy A1 - Mehler, Alexander A1 - Martínez-Muñoz, Carlos Alberto A1 - Pachzelt, Adrian T1 - Application of BIOfid tools for extracting data from biodiversity literature T2 - SPNHC 2022, Edinburgh 5th-10th June 2022 N2 - In an ideal world, extraction of machine-readable data and knowledge from natural-language biodiversity literature would be done automatically, but not so currently. The BIOfid project has developed some tools that can help with important parts of this highly demanding task, while certain parts of the workflow cannot be automated yet. BIOfid focuses on the 20th century legacy literature, a large part of which is only available in printed form. In this workshop, we will present the current state of the art in mobilisation of data from our corpus, as well as some challenges ahead of us. Together with the participants, we will exercise or explain the following tasks (some of which can be performed by the participants themselves, while other tasks currently require execution by our specialists with special equipment): Preparation of text files as an input; pre-processing with TextImager/TextAnnotator; semiautomated annotation and linking of named entities; generation of output in various formats; evaluation of the output. The workshop will also provide an outlook for further developments regarding extraction of statements from natural-language literature, with the long-term aim to produce machine-readable data from literature that can extend biodiversity databases and knowledge graphs. Y1 - 2022 UR - http://publikationen.ub.uni-frankfurt.de/frontdoor/index/index/docId/69677 UR - https://nbn-resolving.org/urn:nbn:de:hebis:30:3-696777 CY - Frankfurt am Main ER -