Refine
Document Type
- Article (3)
- Conference Proceeding (1)
Language
- English (4)
Has Fulltext
- yes (4)
Is part of the Bibliography
- no (4)
Keywords
- web archiving (2)
- architecture (1)
- enrichment (1)
- entity and event extraction (1)
- parliament libraries (1)
- semantic content analysis (1)
- social Web (1)
- text analysis (1)
- topic detection (1)
- web crawler (1)
Institute
- Universitätsbibliothek (3)
- Physik (1)
The constantly growing amount of Web content and the success of the SocialWeb lead to increasing needs for Web archiving. These needs go beyond the pure preservationo of Web pages. Web archives are turning into “community memories” that aim at building a better understanding of the public view on, e.g., celebrities, court decisions and other events. Due to the size of the Web, the traditional “collect-all” strategy is in many cases not the best method to build Web archives. In this paper, we present the ARCOMEM (From Future Internet 2014, 6 689 Collect-All Archives to Community Memories) architecture and implementation that uses semantic information, such as entities, topics and events, complemented with information from the Social Web to guide a novel Web crawler. The resulting archives are automatically enriched with semantic meta-information to ease the access and allow retrieval based on conditions that involve high-level concepts.
Web archives created by the Internet Archive (IA) (https://archive.org), national libraries and other archiving services contain large amounts of information collected for a time period of over twenty years. These archives constitute a valuable source for research in many disciplines, including the digital humanities and the historical sciences by offering a unique possibility to look into past events and their representation on the Web.
Most Web archive services aim to capture the entire Web (IA) or national top-level domains and are therefore broad in their scope, diverse regarding the topics they contain and the time intervals they cover. Due to the large size and the broad scope it is difficult for interested researchers to locate relevant information in the archives as search facilities are very limited. Many users are more interested in studying smaller and topically coherent event-centric collections of documents contained in a Web archive [1,2]. Such collections can reflect specific events such as elections, or natural disasters, e.g. the Fukushima nuclear disaster (2011) or the German federal elections.
It is proposed to install an experimental setup in the fixed-target hall of the Nuclotron with the final goal to perform a research program focused on the production of strange matter in heavyion collisions at beam energies between 2 and 6 A GeV. The basic setup will comprise a large acceptance dipole magnet with inner tracking detector modules based on double-sided Silicon micro-strip sensors and GEMs. The outer tracking will be based on the drift chambers and straw tube detector. Particle identification will be based on the time-of-flight measurements. This setup will be sufficient perform a comprehensive study of strangeness production in heavy-ion collisions, including multi-strange hyperons, multi-strange hypernuclei, and exotic multi-strange heavy objects. These pioneering measurements would provide the first data on the production of these particles in heavy-ion collisions at Nuclotron beam energies, and would open an avenue to explore the third (strangeness) axis of the nuclear chart. The extension of the experimental program is related with the study of in-medium effects for vector mesons decaying in hadronic modes. The studies of the NN and NA reactions for the reference is assumed.
The web and the social web play an increasingly important role as an information source for Members of Parliament and their assistants, journalists, political analysts and researchers. It provides important and crucial background information, like reactions to political events and comments made by the general public. The case study presented in this paper is driven by two European parliaments (the Greek and the Austrian parliament) and targets an effective exploration of political web archives. In this paper, we describe semantic technologies deployed to ease the exploration of the archived web and social web content and present evaluation results.