Citation: Eisen JA, Ganley E, MacCallum CJ (2014) Open Science and Reporting Animal Studies: Who's Accountable? PLoS Biol 12(1): e1001757. doi:10.1371/journal.pbio.1001757
Published: January 7, 2014
Copyright: © 2014 Eisen et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Competing interests: Jonathan Eisen is chair of the PLOS Biology Advisory Board. Catriona MacCallum is an employee of PLOS whose salary is supported by PLOS income derived from the publication of open-access papers.
What responsibility do scientists have to report the experimental work and analyses they do on animals fully and transparently, and what responsibilities do funders, journal editors, and reviewers have to ensure that what is reported is done so appropriately? While the answer to both of these questions might seem obvious, the accumulating evidence suggests that the actual reporting done in publications is far from ideal. As Baker et al. discuss in this issue of PLOS Biology , 86%–87% of experimental articles do not give any indication that the animals in the study were properly randomized, and 95% do not demonstrate that their study had a sample size sufficient to detect an effect of the treatment were there to be one ,. Moreover, they show that 13% of studies of rodents with experimental autoimmune encephalomyelitis (an animal model of multiple sclerosis) failed to report any statistical analyses at all, and 55% included inappropriate statistics . And while you might expect that publications in “higher ranked” journals would have better reporting and a more rigorous methodology, Baker et al. reveal that higher ranked journals (with an impact factor greater than ten) are twice as likely to report either no or inappropriate statistics.
Poor reporting has at least four equally serious, interconnected consequences (e.g., –). The first is increasing evidence that experimental pre-clinical work purportedly demonstrating the impact of a particular drug or intervention on an animal model fails in translation ,: follow-up clinical work in humans shows either no effect, for example, or that there are side effects that were never detected in the animal model. The second is that “underpowered” studies—designed with too small a sample size—will either be too small to detect an effect, which may result in a false-negative result, or will demonstrate effects only by chance—a false positive. Underpowered studies can be cumulatively useful for inclusion in meta-analyses of interventions, but only if they are reported fully and properly. The third consequence of poor reporting is that badly reported studies cannot be validated—there are often insufficient details to replicate the study design or analyses. Finally, poor reporting leads to publication bias: the pressure to publish only positive results means that negative studies are not reported or there is a bias to include selective analyses that report significant effects ,,.
For research, if being “open” means maximizing the number of people an article can reach and minimizing the difficulties that readers have in using the information within it , then full and transparent reporting is a fundamental component of open access and open science. The ability to build on any study—to reuse information from any of its component parts—relies on accurate reporting of all the methods and analyses, and full release of any and all data and results from the study. While we can most easily see the consequences for pre-clinical experimental work on animals, the principle applies to any type of study or hypothesis tested. Where animals are involved, however, there is also an ethical obligation to ensure that the work is sound and the animals are not wasted (even if that counterintuitively means increasing the sample size of animals used).
PLOS (and other publishers such as Nature research journals and BioMed Central) endorsed the use of a set of standardized reporting guidelines for animal studies involving in vivo experiments—the ARRIVE guidelines —in 2010. As noted in an editorial in PLOS Biology at the time , while we encouraged authors of articles published in PLOS journals to follow them, we did not mandate their use. In this issue, Baker et al. show that the guidelines had little impact on how well experimental details were reported in experimental autoimmune encephalomyelitis studies in PLOS journals (primarily PLOS ONE) and Nature in the two years after introducing the guidelines, compared with the previous two years.
The authors note that endorsing the guidelines is meaningless unless journals actually implement them. They also conclude that asking authors and journals to implement all 20 items of the ARRIVE guidelines is outside the current reporting norms in biology and that some of the guidelines are already incorporated as part of the standard institutional ethical review process and need not be additionally reported in published articles. Based on analyses of neurological studies, Baker et al. , and Landis et al.  have therefore called for a core set of reporting standards that includes randomization, blinding, sample-size estimation, and the handling of all data. Some journals, such as Nature research journals, have recently mandated a customized checklist that includes some of these elements in response .
PLOS publishes a huge number of whole-animal-related research articles, including observational as well as in vivo experimental work. The animals range from worms, beetles, and flies to horses and primates. There are hundreds of articles in PLOS Biology and several thousand articles in PLOS ONE that present in vivo experimental work. We agree with Baker et al. that to be effective, journals need to implement, rather than just endorse, reporting standards. But we also need to ensure that reporting standards are consistent within and across fields. The ARRIVE guidelines have been endorsed by 300 journals , different funding agencies in the UK , and the National Research Council Institute for Laboratory Animal Research in the US , and they provide a way to set community standards and help shift the consensus about reporting norms. In the UK, the Wellcome Trust, the Biotechnology and Biological Sciences Research Council, and the Medical Research Council have mandated compliance with the guidelines as a condition of funding . Applying such standards has been most effective in the past when journals universally adopt them, such as international standards for describing new species (e.g., for plants ) or the CONSORT guidelines for reporting randomized controlled clinical trials (http://www.consort-statement.org/). If the guidelines are held and maintained by an independent and publicly funded body (such as the National Centre for the Replacement, Refinement and Reduction of Animals in Research (http://www.nc3rs.org.uk/), which hosts the ARRIVE guidelines), they are easier to update, easier for authors or reviewers to find, and easier for authors and funders to refer to in a variety of research outputs (e.g., in grant applications as well as articles). PLOS has a dedicated collection that highlights articles that have discussed guidelines or include related research on guidelines , and PLOS ONE recently discussed on its blog how staff check that articles are reported appropriately before they are assessed by editors and reviewers .
Because reporting guidelines are not an accepted norm in biology, Baker et al.  suggest that core elements of quality experimental design and reporting be adopted in the first instance. Authors, however, should already have to hand all the information requested in the ARRIVE guidelines as an intrinsic component of their research, and there seems little justification not to include all such information as part of a publication. The absence of blinding or randomization or a power calculation in an experimental study can indicate poor experimental design , but it does not necessarily mean the study is wrong or should not be published or that the data cannot be used. What is missing or not missing, however, must be reported so that others can make use of the information appropriately. Likewise, there is no reason not to include standard statements about the housing or welfare or gender of the animals involved—gender-specific outcomes are well known, but other effects are less well known: in some studies differences in housing affected the way animals responded to analgesics ,. Unless such variables are listed, there will be no opportunity to test such differences or include the study in a meta-analysis, where the conditions need to be the same. In general, release of all information associated with an article is critical for any type of research, but it seems to us that, for ethical reasons, it is extra critical in regards to animal research. These are not difficult or unreasonable requests. Authors, funders, and journals are all accountable in this process—supporting good science and open access has never been simpler.
The ARRIVE guidelines now come with a useful checklist for authors , which authors can submit alongside their manuscript with the relevant sections completed. All the journals at PLOS are editorially independent and are making their own decisions about whether and how to implement the guidelines. PLOS Medicine has already mandated that authors submit the checklist for the few relevant articles it publishes , and PLOS Medicine editors will take the checklist items into account as they evaluate the research. PLOS ONE is currently encouraging the use of the checklist and is likely to mandate its inclusion in all submissions reporting experimental in vivo work starting in 2014. The checklist will be made available to editors and reviewers. At PLOS Biology, we are discussing how best to proceed with members of our Editorial Board and will work with our authors to include correct and comprehensive reporting of all the necessary experimental information. We strongly encourage authors to submit the completed checklist to enhance reusability of data, and we welcome feedback on this from the community.
- 1. Baker D, Lidster K, Sottomayor A, Amor S (2013) Two years later: journals are not yet enforcing the ARRIVE guidelines on reporting standards for pre-clinical animal studies. PLoS Biol 11: e1001756 doi:10.1371/journal. pbio.1001756.
- 2. Vesterinen HM, Sena ES, Ffrench-Constant C, Williams A, Chandran S, et al. (2010) Improving the translational hit of experimental treatments in multiple sclerosis. Mult Scler 16: 1044–1055 doi:10.1177/1352458510379612.
- 3. Sena ES, van der Worp HB, Bath PMW, Howells DW, Macleod MR (2010) Publication bias in reports of animal stroke studies leads to major overstatement of efficacy. PLoS Biol 8: e1000344 doi:10.1371/journal.pbio.1000344.
- 4. Rice ASC, Cimino-Brown D, Eisenach JC, Kontinen VK, Lacroix-Fralish ML, et al. (2008) Animal models and the prediction of efficacy in clinical trials of analgesic drugs: a critical appraisal and call for uniform reporting standards. Pain 139: 243–247.
- 5. Van der Worp HB, Howells DW, Sena ES, Porritt MJ, Rewell S, et al. (2010) Can animal models of disease reliably inform human studies? PLoS Med 7: e1000245 doi:10.1371/journal.pmed.1000245.
- 6. Tsilidis KK, Panagiotou OA, Sena ES, Aretouli E, Evangelou E, et al. (2013) Evaluation of excess significance bias in animal studies of neurological diseases. PLoS Biol 11: e1001609 doi:10.1371/journal.pbio.1001609.
- 7. Kilkenny C, Parsons N, Kadyszewski E, Festing MFW, Cuthill IC, et al. (2009) Survey of the quality of experimental design, statistical analysis and reporting of research using animals. PLoS ONE 4: e7824 doi:10.1371/journal.pone.0007824.
- 8. Ioannidis JPA (2005) Why most published research findings are false. PLoS Med 2: e124 doi:10.1371/journal.pmed.0020124.
- 9. Button KS, Ioannidis JPA, Mokrysz C, Nosek BA, Flint J, et al. (2013) Power failure: why small sample size undermines the reliability of neuroscience. Nat Rev Neurosci 14: 365–376 Available: http://www.nature.com/nrn/journal/v14/n5/full/nrn3475.html. Accessed 3 June 2013.
- 10. The PLOS Medicine Editors (2013) Better Reporting of Scientific Studies: Why It Matters. PLoS Med 10: e1001504 doi:10.1371/journal.pmed.1001504.
- 11. The PLoS Medicine Editors (2008) Better reporting, better research: guidelines and guidance in PLoS Medicine. PLoS Med 5: e99 doi:10.1371/journal.pmed.0050099.
- 12. Hackam DG, Redelmeier DA (2006) Translation of research evidence from animals to humans. JAMA 296: 1731–1732 doi:10.1001/jama.296.14.1731.
- 13. Landis SC, Amara SG, Asadullah K, Austin CP, Blumenstein R, et al. (2012) A call for transparent reporting to optimize the predictive value of preclinical research. Nature 490: 187–191 doi:10.1038/nature11556.
- 14. Neylon C (2013) Architecting the future of research communication: building the models and analytics for an open access future. PLoS Biol 11: e1001691 doi:10.1371/journal.pbio.1001691.
- 15. Kilkenny C, Browne W, Cuthill I, Emerson M, Altman DG (2010) Improving bioscience research reporting: the ARRIVE guidelines for reporting animal research. PLoS Biol 8: e1000412 doi:10.1371/journal.pbio.1000412.
- 16. MacCallum CJ (2010) Reporting animal studies: good science and a duty of care. PLoS Biol 8: e1000413 doi:10.1371/journal.pbio.1000413.
- 17. Baker D, Amor S (2012) Publication guidelines for refereeing and reporting on animal use in experimental autoimmune encephalomyelitis. J Neuroimmunol 242: 78–83 doi:10.1016/j.jneuroim.2011.11.003.
- 18. Announcement: reducing our irreproducibility. Nature 496: 398–398 Available: http://www.nature.com/news/announcement-reducing-our-irreproducibility-1.12852. Accessed 5 November 2013.
- 19. National Centre for the Replacement, Refinement and Reduction of Animals in Research (2013) Journals supporting the ARRIVE guidelines. Available: http://www.nc3rs.org.uk/page.asp?id=1796. Accessed 5 November 2013.
- 20. National Centre for the Replacement, Refinement and Reduction of Animals in Research (2013) Funders supporting the ARRIVE guidelines. Available: http://www.nc3rs.org.uk/page.asp?id=1797. Accessed 5 November 2013.
- 21. US National Research Council Institute for Laboratory Animal Research (2011) Guidance for the description of animal research in scientific publications. Washington (District of Columbia): National Academies Press. Available: http://www.nap.edu/catalog.php?record_id=13241. Accessed 5 November 2013.
- 22. Knapp S, McNeill J, Turland NJ (2011) Changes to publication requirements made at the XVIII International Botanical Congress in Melbourne—what does e-publication mean for you? BMC Evol Biol 11: 250 Available: http://www.biomedcentral.com/1471-2148/11/250. Accessed 5 November 2013.
- 23. (2013) PLOS collections: reporting guidelines collection. Available: http://www.ploscollections.org/reportingguidelines. Accessed 5 November 2013.
- 24. PLOS ONE Editors (2013) Editorial highlight: reporting standards at PLOS ONE. Available: http://blogs.plos.org/everyone/2013/10/17/editorial-highlight-reporting-standards-at-plos-one/. Accessed 5 November 2013.
- 25. Rice ASC, Morland R, Huang W, Currie GL, Sena ES, et al. (2013) Transparency in the reporting of in vivo pre-clinical pain research: the relevance and implications of the ARRIVE (Animal Research: Reporting In Vivo Experiments) guidelines. Scand J Pain 4: 58–62 Available: http://www.scandinavianjournalpain.com/article/S1877-8860%2813%2900005-0/fulltext. Accessed 5 November 2013.
- 26. National Centre for the Replacement, Refinement and Reduction of Animals in Research (2013) The ARRIVE guidelines checklist. Animal research: reporting in vivo experiments. Available: http://www.nc3rs.org.uk/ARRIVEchecklist/. Accessed 5 November 2013.
- 27. PLOS Medicine Editors (2013) Translating translational research into global health gains. PLoS Med 10: e1001493 doi:10.1371/journal.pmed.1001493. Accessed 5 November 2013.