Terminology evolution in web archiving: Open issues

  • The correspondence between the terminology used for querying and the one used in content objects to be retrieved, is a crucial prerequisite for effective retrieval technology. However, as terminology is evolving over time, a growing gap opens up between older documents in (long-term) archives and the active language used for querying such archives. Thus, technologies for detecting and systematically handling terminology evolution are required to ensure "semantic" accessibility of (Web) archive content on the long run. As a starting point for dealing with terminology evolution this paper formalizes the problem and discusses issues, first ideas and relevant technologies.

Download full text files

Export metadata

Metadaten
Author:Nina Tahmasebi, Tereza Iofciu, Thomas RisseORCiDGND, Claudia Niederée, Wolf Siberski
URN:urn:nbn:de:hebis:30:3-550512
URL:http://www.l3s.de/~risse/pub/iwaw2008.pdf
Parent Title (English):Proceedings of the 8th International Web Archiving Workshop in conjunction with ECDL 2008, Aarhus, Denmark, September 2008
Publisher:International Web Archiving Workshop
Place of publication:Aarhus, Denmark
Document Type:Conference Proceeding
Language:English
Year of Completion:2008
Date of first Publication:2008/08/22
Publishing Institution:Universitätsbibliothek Johann Christian Senckenberg
Release Date:2020/07/22
Tag:Information Extraction; Semantics; Terminology Evolution; Web Archives
Volume:2008
Page Number:6
Note:
This work is licence under a Attribution- NonCommercial -NoDerivs 2.0 France Creative Commons Licence.
HeBIS-PPN:467628270
Institutes:Zentrale Einrichtung / Universitätsbibliothek
CCS-Classification:H. Information Systems / H.3 INFORMATION STORAGE AND RETRIEVAL / H.3.1 Content Analysis and Indexing / Linguistic processing
H. Information Systems / H.3 INFORMATION STORAGE AND RETRIEVAL / H.3.6 Library Automation / Large text archives
Dewey Decimal Classification:0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik
Sammlungen:Universitätspublikationen
Licence (German):License LogoCreative Commons - Namensnennung, Nicht kommerziell, Keine Bearbeitung 2.0