OPUS 4 | Search

Looking into Pandora's box: the content of Sci-Hub and its usage [version 1; referees: 2 approved, 2 approved with reservations] (2017)

Tzovaras, Bastian Greshake

Despite the growth of Open Access, potentially illegally circumventing paywalls to access scholarly publications is becoming a more mainstream phenomenon. The web service Sci-Hub is amongst the biggest facilitators of this, offering free access to around 62 million publications. So far it is not well studied how and why its users are accessing publications through Sci-Hub. By utilizing the recently released corpus of Sci-Hub and comparing it to the data of ~28 million downloads done through the service, this study tries to address some of these questions. The comparative analysis shows that both the usage and complete corpus is largely made up of recently published articles, with users disproportionately favoring newer articles and 35% of downloaded articles being published after 2013. These results hint that embargo periods before publications become Open Access are frequently circumnavigated using Guerilla Open Access approaches like Sci-Hub. On a journal level, the downloads show a bias towards some scholarly disciplines, especially Chemistry, suggesting increased barriers to access for these. Comparing the use and corpus on a publisher level, it becomes clear that only 11% of publishers are highly requested in comparison to the baseline frequency, while 45% of all publishers are significantly less accessed than expected. Despite this, the oligopoly of publishers is even more remarkable on the level of content consumption, with 80% of all downloads being published through only 9 publishers. All of this suggests that Sci-Hub is used by different populations and for a number of different reasons, and that there is still a lack of access to the published scientific record. A further analysis of these openly available data resources will undoubtedly be valuable for the investigation of academic publishing.

Sci-Hub provides access to nearly all scholarly literature [v2] (2017)

Himmelstein, Daniel S. ; Rodriguez Romero, Ariel ; McLaughlin, Stephen Reid ; Tzovaras, Bastian Greshake ; Greene, Casey S.

The website Sci-Hub provides access to scholarly literature via full text PDF downloads. The site enables users to access articles that would otherwise be paywalled. Since its creation in 2011, SciHub has grown rapidly in popularity. However, until now, the extent of Sci-Hub’s coverage was unclear. As of March 2017, we find that Sci-Hub’s database contains 68.9% of all 81.6 million scholarly articles, which rises to 85.2% for those published in toll access journals. Coverage varies by discipline, with 92.8% coverage of articles in chemistry journals compared to 76.3% for computer science. Coverage also varies by publisher, with the coverage of the largest publisher, Elsevier, at 97.3%. Our interactive browser at greenelab.github.io/scihub allows users to explore these findings in more detail. We find Sci-Hub preferentially covers popular, paywalled content, containing 96.2% of citations to toll access journals since 2015. For recently requested articles by Unpaywall users, oaDOI provided access to 48.8% whereas Sci-Hub contained 81.5%. Together, oaDOI and Sci-Hub covered 94.1%, demonstrating that gaps in Sci-Hub’s coverage, especially for open access articles, can be filled using licit services. For the first time, nearly all scholarly literature is available gratis to anyone with an Internet connection. Sci-Hub’s scope suggests the subscription publishing model is becoming unsustainable.

Open sharing of genomic data : who does it and why? (2017)

Häusermann, Tobias ; Tzovaras, Bastian Greshake ; Blasimme, Alessandro ; Irdam, Darja ; Richards, Martin ; Vayena, Effy

We explored the characteristics and motivations of people who, having obtained their genetic or genomic data from Direct-To-Consumer genetic testing (DTC-GT) companies, voluntarily decide to share them on the publicly accessible web platform openSNP. The study is the first attempt to describe open data sharing activities undertaken by individuals without institutional oversight. In the paper we provide a detailed overview of the distribution of the demographic characteristics and motivations of people engaged in genetic or genomic open data sharing. The geographical distribution of the respondents showed the USA as dominant. There was no significant gender divide, the age distribution was broad, educational background varied and respondents with and without children were equally represented. Health, even though prominent, was not the respondents’ primary or only motivation to be tested. As to their motivations to openly share their data, 86.05% indicated wanting to learn about themselves as relevant, followed by contributing to the advancement of medical research (80.30%), improving the predictability of genetic testing (76.02%) and considering it fun to explore genotype and phenotype data (75.51%). Whereas most respondents were well aware of the privacy risks of their involvement in open genetic data sharing and considered the possibility of direct, personal repercussions troubling, they estimated the risk of this happening to be negligible. Our findings highlight the diversity of DTC-GT consumers who decide to openly share their data. Instead of focusing exclusively on health-related aspects of genetic testing and data sharing, our study emphasizes the importance of taking into account benefits and risks that stretch beyond the health spectrum. Our results thus lend further support to the call for a broader and multi-faceted conceptualization of genomic utility.

Research led by participants: a new social contract for a new kind of research (2015)

Vayena, Effy ; Brownsword, Roger ; Edwards, Sarah Jane ; Tzovaras, Bastian Greshake ; Kahn, Jeffrey P. ; Ladher, Navjoyt ; Montgomery, Jonathan ; O'Connor, Daniel ; O'Neill, Onora ; Richards, Martin P. ; Rid, Annette ; Sheehan, Mark ; Wicks, Paul ; Tasioulas, John

In recent years, there have been prominent calls for a new social contract that accords a more central role to citizens in health research. Typically, this has been understood as citizens and patients having a greater voice and role within the standard research enterprise. Beyond this, however, it is important that the renegotiated contract specifically addresses the oversight of a new, path-breaking approach to health research: participant-led research. In light of the momentum behind participant-led research and its potential to advance health knowledge by challenging and complementing traditional research, it is vital for all stakeholders to work together in securing the conditions that will enable it to flourish.

Characterization of microsatellite loci in the lichen-forming fungus cetraria aculeata (parmeliaceae, ascomycota) (2016)

Lutsak, Tetiana ; Fernández Mendoza, Fernando ; Tzovaras, Bastian Greshake ; Dal Grande, Francesco ; Ebersberger, Ingo ; Ott, Sieglinde ; Printzen, Christian

Premise of the study: Polymorphic microsatellite markers were developed for the lichen species Cetraria aculeata (Parmeliaceae) to study fine-scale population diversity and phylogeographic structure. Methods and Results: Using Illumina HiSeq and MiSeq, 15 fungus-specific microsatellite markers were developed and tested on 81 specimens from four populations from Spain. The number of alleles ranged from four to 13 alleles per locus with a mean of 7.9, and average gene diversities varied from 0.40 to 0.73 over four populations. The amplification rates of 10 markers (CA01– CA10) in populations of C. aculeata exceeded 85%. The markers also amplified across a range of closely related species, except for locus CA05, which did not amplify in C. australiensis and C. "panamericana," and locus CA10 which did not amplify in C. australiensis. Conclusions: The identified microsatellite markers will be used to study the genetic diversity and phylogeographic structure in populations of C. aculeata in western Eurasia.

openSNP : a crowdsourced web resource for personal genomics (2014)

Tzovaras, Bastian Greshake ; Bayer, Philipp E. ; Rausch, Helge ; Reda, Julia

Genome-wide association studies are widely used to correlate phenotypic traits with genetic variants. These studies usually compare the genetic variation between two groups to single out certain Single Nucleotide Polymorphisms (SNPs) that are linked to a phenotypic variation in one of the groups. However, it is necessary to have a large enough sample size to find statistically significant correlations. Direct-To-Consumer (DTC) genetic testing can supply additional data: DTC-companies offer the analysis of a large amount of SNPs for an individual at low cost without the need to consult a physician or geneticist. Over 100,000 people have already been genotyped through Direct-To-Consumer genetic testing companies. However, this data is not public for a variety of reasons and thus cannot be used in research. It seems reasonable to create a central open data repository for such data. Here we present the web platform openSNP, an open database which allows participants of Direct-To-Consumer genetic testing to publish their genetic data at no cost along with phenotypic information. Through this crowdsourced effort of collecting genetic and phenotypic information, openSNP has become a resource for a wide area of studies, including Genome-Wide Association Studies. openSNP is hosted at http://www.opensnp.org, and the code is released under MIT-license at http://github.com/gedankenstuecke/snpr.

Sci-Hub provides access to nearly all scholarly literature [v3] (2018)

Himmelstein, Daniel S. ; Rodriguez Romero, Ariel ; Levernier, Jacob ; Munro, Thomas Anthony ; McLaughlin, Stephen Reid ; Tzovaras, Bastian Greshake ; Greene, Casey S.

The website Sci-Hub enables users to download PDF versions of scholarly articles, including many articles that are paywalled at their journal’s site. Sci-Hub has grown rapidly since its creation in 2011, but the extent of its coverage was unclear. Here we report that, as of March 2017, Sci-Hub’s database contains 68.9% of the 81.6 million scholarly articles registered with Crossref and 85.2% of articles published in toll access journals. We find that coverage varies by discipline and publisher and that Sci-Hub preferentially covers popular, paywalled content. For toll access articles, green open access via licit services is quite limited, while Sci-Hub provides greater coverage than a major research university. Our interactive browser at https://greenelab.github.io/scihub allows users to explore these findings in more detail. For the first time, nearly all scholarly literature is available gratis to anyone with an Internet connection, suggesting the toll access business model will become unsustainable.

Sci-Hub provides access to nearly all scholarly literature (2019)

Himmelstein, Daniel S. ; Rodriguez Romero, Ariel ; Levernier, Jacob ; Munro, Thomas Anthony ; McLaughlin, Stephen Reid ; Tzovaras, Bastian Greshake ; Greene, Casey S.

The website Sci-Hub enables users to download PDF versions of scholarly articles, including many articles that are paywalled at their journal’s site. Sci-Hub has grown rapidly since its creation in 2011, but the extent of its coverage has been unclear. Here we report that, as of March 2017, Sci-Hub’s database contains 68.9% of the 81.6 million scholarly articles registered with Crossref and 85.1% of articles published in toll access journals. We find that coverage varies by discipline and publisher, and that Sci-Hub preferentially covers popular, paywalled content. For toll access articles, we find that Sci-Hub provides greater coverage than the University of Pennsylvania, a major research university in the United States. Green open access to toll access articles via licit services, on the other hand, remains quite limited. Our interactive browser at https://greenelab.github.io/scihub allows users to explore these findings in more detail. For the first time, nearly all scholarly literature is available gratis to anyone with an Internet connection, suggesting the toll access business model may become unsustainable.

Sci-Hub provides access to nearly all scholarly literature [v1] (2017)

Himmelstein, Daniel S. ; Rodriguez Romero, Ariel ; McLaughlin, Stephen Reid ; Tzovaras, Bastian Greshake ; Greene, Casey S.

The website Sci-Hub provides access to scholarly literature via full text PDF downloads. The site enables users to access articles that would otherwise be paywalled. Since its creation in 2011, Sci-Hub has grown rapidly in popularity. However, until now, the extent of Sci-Hub's coverage was unclear. As of March 2017, we find that Sci-Hub's database contains 68.9% of all 81.6 million scholarly articles, which rises to 85.2% for those published in closed access journals. Furthermore, Sci-Hub contains 77.0% of the 5.2 million articles published by inactive journals. Coverage varies by discipline, with 92.8% coverage of articles in chemistry journals compared to 76.3% for computer science. Coverage also varies by publisher, with the coverage of the largest publisher, Elsevier, at 97.3%. Our interactive browser at https://greenelab.github.io/scihub allows users to explore these findings in more detail. Finally, we estimate that over a six-month period in 2015–2016, Sci-Hub provided access for 99.3% of valid incoming requests. Hence, the scope of this resource suggests the subscription publishing model is becoming unsustainable. For the first time, the overwhelming majority of scholarly literature is available gratis to anyone with an Internet connection.

A multi-disciplinary perspective on emergent and future innovations in peer review [version 1; referees: 2 approved with reservations] (2017)

Peer review of research articles is a core part of our scholarly communication system. In spite of its importance, the status and purpose of peer review is often contested. What is its role in our modern digital research and communications infrastructure? Does it perform to the high standards with which it is generally regarded? Studies of peer review have shown that it is prone to bias and abuse in numerous dimensions, frequently unreliable, and can fail to detect even fraudulent research. With the advent of Web technologies, we are now witnessing a phase of innovation and experimentation in our approaches to peer review. These developments prompted us to examine emerging models of peer review from a range of disciplines and venues, and to ask how they might address some of the issues with our current systems of peer review. We examine the functionality of a range of social Web platforms, and compare these with the traits underlying a viable peer review system: quality control, quantified performance metrics as engagement incentives, and certification and reputation. Ideally, any new systems will demonstrate that they out-perform current models while avoiding as many of the biases of existing systems as possible. We conclude that there is considerable scope for new peer review initiatives to be developed, each with their own potential issues and advantages. We also propose a novel hybrid platform model that, at least partially, resolves many of the technical and social issues associated with peer review, and can potentially disrupt the entire scholarly communication system. Success for any such development relies on reaching a critical threshold of research community engagement with both the process and the platform, and therefore cannot be achieved without a significant change of incentives in research environments.

Author(s)
Title
Additional Person(s)
Referee(s)
Abstract
Fulltext

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Institute

16 search hits