|
|
||||||||
1 Department of Clinical Epidemiology and Biostatistics, Academic Medical Center-University of Amsterdam, P. O. Box 22700, 1100 DE Amsterdam, The Netherlands.
Reprinted from
www.consort-statement.org/stardstatement.htm.
Accessed April 8, 2003.
Abstract
|
|
|---|
METHODS. The Standards for Reporting of Diagnostic Accuracy (STARD) steering committee searched the literature to identify publications on the appropriate conduct and reporting of diagnostic studies and extracted potential items into an extensive list. Researchers, editors, and members of professional organisations shortened this list during a 2-day consensus meeting with the goal of developing a checklist and a generic flow diagram for studies of diagnostic accuracy.
RESULTS. The search for published guidelines about diagnostic research yielded 33 previously published checklists, from which we extracted a list of 75 potential items. At the consensus meeting, participants shortened the list to a 25-item checklist, by using evidence whenever available. A prototype of a flow diagram provides information about the method of recruitment of patients, the order of test execution and the numbers of patients undergoing the test under evaluation, the reference standard, or both.
CONCLUSIONS. Evaluation of research depends on complete and accurate reporting. If medical journals adopt the checklist and the flow diagram, the quality of reporting of studies of diagnostic accuracy should improve to the advantage of clinicians, researchers, reviewers, journals, and the public.
|
|
|---|
|
In studies of diagnostic accuracy, the outcomes from one or more tests under evaluation are compared with outcomes from the reference standard, both measured in individuals who are suspected of having the condition of interest. The term test refers to any method for obtaining additional information on a patient's health status. It includes information from history and physical examination, laboratory tests, imaging tests, function tests, and histopathology. The condition of interest or target condition can refer to a particular disease or to any other identifiable condition that may prompt clinical actions, such as further diagnostic testing, or the initiation, modification or termination of treatment. In this framework, the reference standard is considered to be the best available method for establishing the presence or absence of the condition of interest. The reference standard can be a single method, or a combination of methods, to establish the presence of the target condition. It can include laboratory tests, imaging tests, and pathology, but also dedicated clinical follow-up of participants. The term accuracy refers to the amount of agreement between the information from the test under evaluation, referred to as the index test, and the reference standard. Diagnostic accuracy can be expressed in many ways, including sensitivity and specificity, likelihood ratios, diagnostic odds ratio, and the area under a receiver operator characteristic (ROC) curve [46].
Several potential threats to the internal and external validity of a study of diagnostic accuracy exist. A survey of studies of diagnostic accuracy published in four major medical journals between 1978 and 1993 revealed that the methodological quality was mediocre at best [7]. However, assessments were hampered because many reports lacked information on key elements of design, conduct, and analysis of diagnostic studies [7]. The absence of essential information about the design and conduct of diagnostic studies has been confirmed by authors of meta-analyses [8, 9]. As in any other type of research, flaws in study design can lead to biased results. One report showed that diagnostic studies with specific design features are associated with biased, optimistic estimates of diagnostic accuracy compared with studies without such deficiencies [10].
At the 1999 Cochrane Colloquium meeting in Rome, the Cochrane Diagnostic and Screening Test Methods Working Group discussed the low methodological quality and substandard reporting of diagnostic test evaluations. The Working Group felt that the first step towards correcting these problems was to improve the quality of reporting of diagnostic studies. Following the successful CONSORT (consolidated standards of reporting trials) initiative [1113], the Working Group aimed to develop a checklist of items that should be included in the report of a study of diagnostic accuracy.
The objective of the Standards for Reporting of Diagnostic Accuracy (STARD) initiative is to improve the quality of reporting of studies of diagnostic accuracy. Complete and accurate reporting allows the reader to detect the potential for bias in the study (internal validity) and to assess the generalisability and applicability of the results (external validity).
|
|
|---|
Subsequently, the STARD steering committee convened a 2-day consensus meeting for invited experts from the following interest groups: researchers, editors, methodologists, and professional organisations. The aim of the conference was to reduce the extended list of potential items, as appropriate, and to discuss the optimum format and phrasing of the checklist. The selection of items to retain was based on evidence whenever possible.
The meeting format consisted of a mixture of small group sessions and plenary sessions. Each small group focused on a group of related items of the list. The suggestions of the small groups were then discussed in plenary sessions. Overnight, a first draft of the STARD checklist was assembled on the basis of suggestions from the small groups and additional remarks from the plenary sessions. All meeting attendees discussed this version the next day and made additional changes. The members of the STARD group could suggest further changes through a later round of comments by email.
Potential users field-tested the conference version of the checklist and flow diagram, and additional comments were collected. This version was placed on the CONSORT website with a call for comments. The STARD steering committee discussed all comments and assembled the final checklist.
|
|
|---|
The STARD group received valuable comments and remarks during the various stages of evaluation after the conference, which resulted in the version of the STARD checklist that appears in Table 1.
|
|
|
|---|
We arranged the items under the usual headings of a medical research article, but this is not intended to dictate the order in which they have to appear within an article.
The guiding principle in the development of the STARD checklist was to select items that would help readers to judge the potential for bias in the study and to appraise the applicability of the findings. Two other general considerations shaped the content and format of the checklist. First, the STARD group believes that one general checklist for studies of diagnostic accuracy, rather than different checklists for each speciality, is likely to be more widely disseminated and perhaps accepted by authors, peer reviewers, and journal editors. Although the evaluation of an imaging test differs from that of a test in the laboratory, we felt that these differences were more of degree than in kind. The second consideration was the development of a checklist specifically aimed at studies of diagnostic accuracy. We did not include general issues in the reporting of research findings, such as the recommendations contained in the Uniform Requirements for Manuscripts Submitted to Biomedical Journals [14].
Wherever possible, the STARD group based the decision to include an item on evidence linking the item to biased estimates (internal validity) or to variations in measures of diagnostic accuracy (external validity). The evidence varied from narrative articles that explained theoretical principles and papers that presented the results from statistical modelling to empirical evidence derived from diagnostic studies. For several items, the evidence was rather limited.
A separate background document explains the meaning and rationale of each item and briefly summarises the type and amount of evidence [15]. This background document should enhance the use, understanding and dissemination of the STARD checklist.
The STARD group put considerable effort into the development of a flow diagram for diagnostic studies. A flow diagram has the potential to communicate vital information about the design of a study and the flow of participants in a transparent manner [16]. A comparable flow diagram has become an essential element in the CONSORT standards for reporting of randomised trials [12, 16]. The flow diagram could be even more essential in diagnostic studies, in view of the variety of designs employed in diagnostic research. Flow diagrams in the reports of studies of diagnostic accuracy indicate the process of sampling and selecting participants (external validity), the flow of participants in relation to the timing and outcomes of tests, the number of participants who do not receive either the index test or the reference standard, or both (potential for verification bias [1719], and the number of patients at each stage of the study, which provides the correct denominator for proportions (internal consistency).
The STARD group plans to measure the impact of the statement on the quality of published reports on diagnostic accuracy using a before-and-after assessment [l3]. Updates of the STARD initiative's documents will be provided when new evidence on sources of bias or variability becomes available. We welcome any comments, whether on content or form, to improve the current version.
Acknowledgments
Members of the STARD Steering Committee
Patrick Bossuyt
Academic Medical Center, Dept. of Clinical Epidemiology, Amsterdam, The Netherlands
Constantine Gatsonis
Brown University, Centre for Statistical Sciences, Providence, RI, United States of America
Les Irwig
Screening and Test Evaluation Program, School of Public Health, University of Sydney, Australia
David Moher
Chalmers Research, Group Ottawa, ON, Canada
Riekie de Vet
Free University, Institute for Research in Extramural Medicine, Amsterdam, The Netherlands
David Bruns
Clinical Chemistry, Charlottesville, VA, United States of America
Paul Glasziou
Mayne Medical School, Dept. of Social & Preventive Medicine, Herston, Australia
Jeroen Lijmer
Academic Medical Center, Dept. of Clinical Epidemiology, Amsterdam, The Netherlands
Drummond Rennie
Journal of the American Medical Association, Chicago, IL, United States of America
Members of the STARD Group
Doug Altman
Institute of Health Sciences, Centre for Statistics in Medicine, Oxford, United Kingdom
Colin Begg
Memorial Sloan-Kettering Cancer Center, Dept. Epidemiology & Biostatistics, New York, NY, United States of America
Harry Büller
Academic Medical Center, Dept. of Vascular Medicine, Amsterdam, The Netherlands
Frank Davidoff
Annals of Internal Medicine, Philadelphia, PA, United States of America
Paul Dieppe
Dept. Social Medicine, University of Bristol, Bristol, United Kingdom
Rijk van Ginkel
Academic Medical Center, Dept. of Clinical Epidemiology, Amsterdam, The Netherlands
Gordon Guyatt
McMaster University, Clinical Epidemiology and Biostatistics, Hamilton, ON, Canada
Richard Horton
The Lancet, London, United Kingdom
Stuart Barton
British Medical Journal, BMA House, London, United Kingdom
William Black
Dartmouth Hitchcock Medical Center, Dept. of Radiology, Lebanon, NH, United States of America
Gregory Campbell
US FDA, Center for Devices and Radiological Health, Rockville, MD, United States of America
Jon Deeks
Institute of Health Sciences, Centre for Statistics in Medicine, Oxford, United Kingdom
Kenneth Fleming
John Radcliffe Hospital, Oxford, United Kingdom
Afina Glas
Academic Medical Center, Dept. of Clinical Epidemiology, Amsterdam, The Netherlands
James Hanley
McGill University, Dept. Epidemiology & Biostatistics, Montreal, QC, Canada
Myriam Hunink
Erasmus Medical Center, Dept. Epidemiology & Biostatistics, Rotterdam, The Netherlands
Jos Kleijnen
NHS Centre for Reviews and Dissemination, York, United Kingdom
Erik Magid
Amager Hospital, Dept. Clinical Biochemistry, Copenhagen, Denmark
Matthew McQueen
Hamilton Civic Hospitals, Dept. of Laboratory Medicine, Hamilton, ON, Canada
John Overbeke
Nederlands Tijdschrift voor Geneeskunde, Amsterdam, The Netherlands
Anthony Proto
Radiology, Editorial Office, Richmond, VA, United States of America
David Sackett
Trout Research and Education Centre, Irish Lake, ON, Canada
Harold Sox
Annals of Internal Medicine, Philadelphia, PA, United States of America
Stephan Walter
McMaster University, Clinical Epidemiology and Biostatistics, Hamilton, ON, Canada
Andre Knottnerus
Maastricht University, Netherlands School of Primary Care Research, Maastricht, The Netherlands
Barbara McNeil
Harvard Medical School, Dept. of Health Care Policy, Boston, MA, United States of America
Andrew Onderdonk
Channing Laboratory, Boston, MA, United States of America
Christopher Price
St Bartholomew's - Royal London School of Medicine and Dentistry, London, United Kingdom
Hans Reitsma
Academic Medical Center, Dept. of Clinical Epidemiology, Amsterdam, The Netherlands
Gerard Sanders
Academic Medical Center, Dept. of Clinical Chemistry, Amsterdam, The Netherlands
Sharon Straus
Mt. Sinai Hospital, Toronto, ON, Canada
|
|
|---|
This article has been cited by other articles:
![]() |
W. B. DeMartini, P. R. Eby, S. Peacock, and C. D. Lehman Utility of Targeted Sonography for Breast Lesions That Were Suspicious on MRI Am. J. Roentgenol., April 1, 2009; 192(4): 1128 - 1134. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. B. Meijer, Y. L. O, J. Geleijns, and L. J. M. Kroft Meta-Analysis of 40- and 64-MDCT Angiography for Assessing Coronary Artery Stenosis Am. J. Roentgenol., December 1, 2008; 191(6): 1667 - 1675. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Oliver, A. Papaioannou, L. Giangregorio, L. Thabane, K. Reizgys, and G. Foster A systematic review and meta-analysis of studies using the STRATIFY tool for prediction of falls in hospital patients: how well does it work? Age Ageing, November 1, 2008; 37(6): 621 - 627. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. van Randen, S. Bipat, A. H. Zwinderman, D. T. Ubbink, J. Stoker, and M. A. Boermeester Acute Appendicitis: Meta-Analysis of Diagnostic Performance of CT and Graded Compression US Related to Prevalence of Disease Radiology, October 1, 2008; 249(1): 97 - 106. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Fletcher, S. M. Kymes, M. Gould, N. Alazraki, R. E. Coleman, V. J. Lowe, C. Marn, G. Segall, L. A. Thet, K. Lee, et al. A Comparison of the Diagnostic Accuracy of 18F-FDG PET and CT in the Characterization of Solitary Pulmonary Nodules J. Nucl. Med., February 1, 2008; 49(2): 179 - 185. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Liu, X. Zhao, J. Huang, C. J. Francois, D. Tuite, X. Bi, D. Li, and J. C. Carr Comparison of 3D Free-Breathing Coronary MR Angiography and 64-MDCT Angiography for Detection of Coronary Stenosis in Patients with High Calcium Scores Am. J. Roentgenol., December 1, 2007; 189(6): 1326 - 1332. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Suzuki, S. Furui, K. Okinaga, T. Sakamoto, J. Murata, A. Furukawa, and Y. Ohnaka Differentiation of Femoral Versus Inguinal Hernia: CT Findings Am. J. Roentgenol., August 1, 2007; 189(2): W78 - W83. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. M Kymes, K. Lee, J. W Fletcher, and SNAP (CSP 027) Study Group Assessing diagnostic accuracy and the clinical value of positron emission tomography imaging in patients with solitary pulmonary nodules (SNAP) Clinical Trials, February 1, 2006; 3(1): 31 - 42. [Abstract] [PDF] |
||||
![]() |
L. Tooth, R. Ware, C. Bain, D. M. Purdie, and A. Dobson Quality of Reporting of Observational Longitudinal Research Am. J. Epidemiol., February 1, 2005; 161(3): 280 - 288. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. T. Zivian, R. Gershater, W. K. Erly, B. C. Ashdown, and R. W. Lucio General Radiologists' Diagnostic Accuracy: Incomplete Presentation of Data Casts Doubt on Study's Conclusions Am. J. Roentgenol., February 1, 2005; 184(2): 697 - 699. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |