Filtern
Erscheinungsjahr
- 2014 (2)
Dokumenttyp
Sprache
- Englisch (2)
Volltext vorhanden
- ja (2)
Gehört zur Bibliographie
- nein (2) (entfernen)
Schlagworte
- web archiving (2)
- architecture (1)
- enrichment (1)
- entity and event extraction (1)
- parliament libraries (1)
- semantic content analysis (1)
- social Web (1)
- text analysis (1)
- topic detection (1)
- web crawler (1)
Institut
The constantly growing amount of Web content and the success of the SocialWeb lead to increasing needs for Web archiving. These needs go beyond the pure preservationo of Web pages. Web archives are turning into “community memories” that aim at building a better understanding of the public view on, e.g., celebrities, court decisions and other events. Due to the size of the Web, the traditional “collect-all” strategy is in many cases not the best method to build Web archives. In this paper, we present the ARCOMEM (From Future Internet 2014, 6 689 Collect-All Archives to Community Memories) architecture and implementation that uses semantic information, such as entities, topics and events, complemented with information from the Social Web to guide a novel Web crawler. The resulting archives are automatically enriched with semantic meta-information to ease the access and allow retrieval based on conditions that involve high-level concepts.
The web and the social web play an increasingly important role as an information source for Members of Parliament and their assistants, journalists, political analysts and researchers. It provides important and crucial background information, like reactions to political events and comments made by the general public. The case study presented in this paper is driven by two European parliaments (the Greek and the Austrian parliament) and targets an effective exploration of political web archives. In this paper, we describe semantic technologies deployed to ease the exploration of the archived web and social web content and present evaluation results.