Indian Journal of Cancer
Home  ICS  Feedback Subscribe Top cited articles Login 
Users Online :834
Small font sizeDefault font sizeIncrease font size
Navigate here
Resource links
   Similar in PUBMED
   Search Pubmed for
   Search in Google Scholar for
   Article in PDF (296 KB)
   Citation Manager
   Access Statistics
   Reader Comments
   Email Alert *
   Add to My List *
* Registration required (free)  

  In this article

 Article Access Statistics
    PDF Downloaded91    
    Comments [Add]    

Recommend this journal


  Table of Contents  
Year : 2020  |  Volume : 57  |  Issue : 3  |  Page : 229-230

A call for transparency in data reporting

1 Associate Editor, Indian Journal of Surgery, Tata Memorial Hospital, Mumbai, India
2 Director, Tata Memorial Hospital, Mumbai, India
3 Editor, Indian Journal of Cancer

Date of Submission04-Aug-2020
Date of Decision04-Aug-2020
Date of Acceptance04-Aug-2020
Date of Web Publication10-Aug-2020

Correspondence Address:
Vinay H Deshmane
Editor, Indian Journal of Cancer

Login to access the Email id

Source of Support: None, Conflict of Interest: None

DOI: 10.4103/ijc.IJC_882_20

Rights and Permissions

How to cite this article:
Shah SR, Pramesh C S, Deshmane VH. A call for transparency in data reporting. Indian J Cancer 2020;57:229-30

How to cite this URL:
Shah SR, Pramesh C S, Deshmane VH. A call for transparency in data reporting. Indian J Cancer [serial online] 2020 [cited 2021 Jan 25];57:229-30. Available from:

“It is a capital mistake to theorize before one has data. Insensibly one begins to twist facts to suit theories, instead of theories to suit facts.”

― Sir Arthur Conan Doyle in 'A Scandal in Bohemia' The Adventures of Sherlock Holmes

A physician from the early twentieth century transported in time would hardly recognize or comprehend modern medical literature. The good doctor would be used to penning her/his personal experience in eloquent sentences. An elegant turn of phrase would be used to convince readers. We have moved far beyond this. While we lament the fact that the beauty of informal language has given way to the rigid formality of modern scientific communication, our focus in this editorial is more to do with the data within. Today, tightly structured articles bristle with statistics and use complex tables and graphs to present as much of the data as possible in summary form. Educated readers can interpret these and make their own judgments on the validity and weight of the conclusion reached. The data generating these statistics is often massive involving numerous parameters in large numbers of patients. The very aim of this exercise is to reduce the 'literature' in medical literature and attempt to increase the science; however the outcome might often be the reverse.

The ongoing COVID-19 pandemic has brought great scrutiny upon published medical literature. Faced with a rapidly spreading, previously un-encountered problem, never has it been more imperative to produce good quality scientific evidence on diagnosis, prevention, and intervention with great rapidity. Social media ensured wide dissemination of the findings often before any form of peer review; papers were dissected by unofficial reviewers, many biased for or against the conclusions. The desire to be first to discover new aspects of the disease often led to premature submissions, and journals hurried to publish potentially practice and life-altering data. The stately measured submission and peer review process of many weeks and months became a frenzied race over a few days. The biggest faux-pas were possibly by two leading journals publishing papers [1],[2] based on data provided by a medical informatics company – Surgisphere. When the company could not share the data for independent review of its veracity, it ultimately led to the retraction of these articles. This caused immense damage to the very basis of scientific research as these retractions were extensively covered by the lay press, and the reputation of these leading journals suffered.

Journal editors take pride in putting across medical literature in the most transparent fashion. The loss of literary style of issues of the past has been stimulated by the need for more transparent data presentation. Each type of article will have its own checklist to ensure that the structure is as complete as possible. The EQUATOR (Enhancing the QUAlity and Transparency Of health Research) network site lists all these. The International Committee of Medical Journal Editors (ICMJE) have ensured that transparency is maintained by standard reporting for their journals. They also require that all clinical trials be listed in a public registry along with its protocol to ensure transparency of methodology. The ICMJE statement on data sharing is the next step.[3] Every article submitted after July 2018 and any clinical trial after January 2019 is required to submit a data sharing plan. This explicitly states how much of the data is freely accessible and for what duration of time, as well as any supplemental information like the protocol and data analysis plan. Provision for secondary use of data is to be mentioned. Articles depending on secondary data are encouraged to collaborate or at the least, acknowledge the original work.

When journals were printed exclusively on paper, data sharing would have been a distant dream. Today, all databases are maintained electronically. Ultimate transparency is provided when the actual data recorded for each and every patient is available so it is easy to see what was collected and what is missing. Any flaws in the presentation of the truth in the summary statistics and by the statistical analysis technique would be exposed. Further, data fabrication would be a particular challenge if the full database was exposed for all to see. The presence of raw data would facilitate meta-analysis.

The Surgisphere data was put into doubt post-publication by researchers who questioned the number of cases in Australia in the time period reported and the drug availability in various countries. These are specific facts which may not have been available to the reviewers of the article. The retraction was precipitated by authors not being able to independently verify the data and analysis. This is an extreme case where even the first author had no transparent access to the raw data. Implementation of data transparency would have obviated this. If all the recorded parameters in the multitude of papers published in the pandemic were accessible to all, it would be easy to pool data and arrive at conclusions more rapidly. Similarly, it would be easy to discount poorly collected, incomplete data where confounders were ignored. The need for complete transparency would ensure that appropriate and ethical utilization of resources could be verified; matching with the prospectively published methodology would also ensure against selective reporting. Open scrutiny would also tighten data collection and presentation, and premature publication may be prevented.

Would there be harm in declaring the data in a transparent manner? Clearly, if there were parameters recorded where patient identity could be revealed, this would be a problem. However, masking these details in a database is now quite easy. Certain commercial entities would worry about intellectual property and the revelation of 'trade secrets'. Similarly, in academic research, the lack of exclusivity for the primary researchers to mine the data further might disincentivize original research. However, in its purest sense, science should be open and transparent for the greater benefit of all. Privately owned pharmaceutical firms like Novartis, Merck, and AstraZeneca have their own policy encouraging data transparency. If research has been done with public funds, the public would have the absolute right to access the data so collected, and therefore, there can be no objection to data transparency. Several organizations including the Wellcome Trust, the Centers for Disease Control and Prevention, and the Canadian Institutes of Health Research actively encourage the use of common data repositories for researchers to share data.

In summary, the benefits of transparency and sharing of data in the public domain far outweigh the perceived drawbacks. Modern science and researchers should embrace the philosophy of open data for the larger good of scientific progress and humanity itself.

Not only should research be done well, it should also be seen to be done well.

“In God we trust.. all others need to bring data.”

- WE Deming (attribution disputed)

  References Top

Mehra MR, Desai SS, Ruschitzka F, Patel AN. Hydroxychloroquine or chloroquine with or without a macrolide for treatment of COVID-19: a multinational registry analysis. Lancet 2020;S0140-6736:31180-6.  Back to cited text no. 1
Mehra MR, Desai SS, Kuy S, Henry TD, Patel AN. Cardiovascular disease, drug therapy, and mortality in Covid-19. N Engl J Med 2020;382:e102.  Back to cited text no. 2
Taichman DB, Backus J, Baethge C, Bauchner H, Leeuw PW, Drazen JM, et al. Sharing clinical trial data: A proposal from the International Committee of Medical Journal Editors. Ann Intern Med 2016;164:505-6.  Back to cited text no. 3


Print this article  Email this article


  Site Map | What's new | Copyright and Disclaimer
  Online since 1st April '07
  2007 - Indian Journal of Cancer | Published by Wolters Kluwer - Medknow