AJR Women's Imaging Online
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Kliewer, M. A.
Right arrow Articles by Provenzale, J. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kliewer, M. A.
Right arrow Articles by Provenzale, J. M.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Hotlight (NEW!)
Right arrow
What's Hotlight?
AJR 2005; 184:1731-1735
© American Roentgen Ray Society

Reviewing the Reviewers: Comparison of Review Quality and Reviewer Characteristics at the American Journal of Roentgenology

Mark A. Kliewer1, Kelly S. Freed2, David M. DeLong3, Perry J. Pickhardt1 and James M. Provenzale3

1 Department of Radiology, University of Wisconsin, 600 Highland Ave., Madison, WI 53792.
2 Lehigh Valley Hospital, Allentown, PA.
3 Department of Radiology, Box 3808, Duke University Medical Center, Durham, NC 27710.

Received August 12, 2004; accepted after revision September 23, 2004.

 
Address correspondence to M. A. Kliewer.


Abstract
Top
Abstract
Introduction
Materials and Methods
Results
Discussion
References
 
OBJECTIVE. The purpose of our study was to determine which manuscript reviewer characteristics are most strongly associated with reviewer performance as judged by editors of the American Journal of Roentgenology (AJR).

MATERIALS AND METHODS. At the AJR, manuscript reviews are rated by the journal editors on a subjective scale from 1 (lowest) to 4, on the basis of the value, thoroughness, and punctuality of the critique. We obtained all scores for AJR reviewers and determined the average score for each reviewer. We also sent a questionnaire to 989 reviewers requesting specific information regarding the age, sex, radiology subspecialty, number of years serving as a reviewer, academic rank, and practice type of the reviewer. The demographic profiles were correlated with the average quality score for each reviewer. Statistical analysis included correlation analysis and analysis of variance modeling. Reviewer quality scores were also correlated with the scoring of individual reviews and ultimate disposition of 196 manuscripts sent to the AJR during the same period.

RESULTS. Responses to the questionnaire were obtained from 821 reviewers (83.0%), for whom quality scores were available for 714 (87.0%). Correlation analysis shows that the quality score of reviewers strongly correlated with younger age (p = 0.001). A statistically significant correlation between quality score and practice type was seen (p = 0.008), with reviewers from academic institutions receiving higher scores. No significant correlation was found between quality score and sex (p = 0.72), years of reviewing (p = 0.26), academic rank (p = 0.10), or the ultimate disposition of the manuscript (p = 0.40). The quality score of the reviewers showed no variation by subspecialty (p = 0.99).

CONCLUSION. The highest-rated AJR reviewers tended to be young and from academic institutions. The quality of peer review did not correlate with the sex, academic rank, or subspecialty of the reviewer.


Introduction
Top
Abstract
Introduction
Materials and Methods
Results
Discussion
References
 
Manuscript reviewers are the essential but largely unseen agents of the peer review process. Reviewers are charged with the task of judging whether a manuscript is important, scientifically valid, coherent and readable, and appropriate for a particular journal. The success of peer review hinges on the skill, discernment, dedication, and fair-mindedness of a large coterie of expert reviewers. Of course, not all experts are equally skilled in this task. But who are the best reviewers? Are there characteristics of reviewers that would tend to predict the quality of their reviews?

Despite the central importance of reviewers to the peer review process, only four studies have attempted to identify the characteristics of a good reviewer, and to our knowledge no study has examined the radiology literature [1-4]. Therefore, we undertook a study to look for correlations between the professional and demographic profile of reviewers and the quality of their reviews. We hypothesized that reviewer performance might be related to a number of characteristics, including age, sex, subspecialty, number of years reviewing, academic rank, and type of practice (academic or private). The goal of the study was to determine if particular reviewer attributes tend to predict the quality of manuscript review.


Materials and Methods
Top
Abstract
Introduction
Materials and Methods
Results
Discussion
References
 
We sent a questionnaire to all 989 reviewers for the American Journal of Roentgenology (AJR) who were enrolled in the AJR reviewer database in 1998-1999 (Appendix 1). The reviewers were asked to identify their age, sex, principal subspecialty, number of years reviewing for the AJR, academic rank, and practice type (academic or private).


View this table:
[in this window]
[in a new window]

 
APPENDIX 1. Demographic Questionnaire Sent to AJR Reviewers

 

The AJR database at that time listed the following information: reviewer name, institution, average length of time for reviews, length of time since last review, and quality scores for all manuscript reviews provided by either the editor in chief, one of the two associate editors, or individuals serving an internship in the AJR office by virtue of having been awarded the ARRS Melvin M. Figley Fellowship in Radiology Journalism (all of whom are hereafter referred to as editors). These quality scores are average ratings of reviewer performance by the AJR editors. For every review received by the journal, reviewer performance is evaluated by one of the AJR editors and rated on a scale of 1-4, with 4 being the highest score. Scores are based on the level of sophistication of the commentary, the quality of the suggestions for manuscript improvement, the amount of detail, and the punctuality of the review [5-7]. These scores are subjective in nature and are not based on well-defined criteria. Editors are not blinded to reviewer identity or previous scores. Using these AJR data, we calculated the average quality score for each reviewer. For the database constructed specifically for this project, reviewers were identified only by a number code: reviewer names were purged to protect the confidentiality and privacy of the reviewers. Only the editor in chief and associate editors had access to the list correlating reviewer name and quality score.



View larger version (13K):
[in this window]
[in a new window]
[as a PowerPoint slide]
 
Fig. 1. Graph shows downward trend in AJR editors' assessment of reviewer performance when correlated with reviewer age. Older reviewers tended to receive lower quality of review scores from AJR editors. The higher score for the 80- to 89-year-old category is likely anomalous, because only a single reviewer is in this age category.

 
Eight hundred twenty-one reviewers responded to the questionnaire, representing 83.0% of all potential reviewers. These data were recorded in a database and correlated with current reviewer scores. Using Kendall tau rank correlation coefficients, we also studied the relationship between reviewer attributes (age, sex, years of reviewing, academic rank, and practice type) and the average quality score given to reviews by the AJR editors. The relationship between the quality score and the subspecialty of a reviewer was evaluated using the analysis of variance test.

Over a contemporaneous 6-month interval, 196 major papers submitted to the AJR were collected and entered into a database in which were recorded the reviewer scores of the manuscripts, the demographic profiles of the assigned reviewers, and the ultimate disposition of the manuscripts. The quality score of each reviewer was correlated with the overall score given by the reviewer to individual manuscripts and also with the ultimate disposition of the manuscript using Kendall tau correlation analysis. The overall score is the rating of the manuscript on a 10-point scale, with 1-4 recommending rejection, 5-6 recommending rejection with the opportunity to revise, and 7-10 recommending acceptance. Similarly, the ultimate disposition of a manuscript was coded as rejection, rejection with the opportunity to revise, or acceptance.

All statistical tests were considered significant at p values of 0.05 or less.


Results
Top
Abstract
Introduction
Materials and Methods
Results
Discussion
References
 
Reviewer scores were available for 714 reviewers (87.0%) who responded to the questionnaires. The respondents without scores were either no longer active reviewers, recently retired from professional life, or on hiatus from reviewing. For the group with scores, the mean age was 46 years (median, 45 years) and the mean number of years reviewing was 6.8 years (median, 5 years). The reviewers were 557 men (78%) and 157 women (22%). There were 664 reviewers (93%) from academic institutions and 50 (7%) from private practice. Of the reviewers with academic affiliations, there were 10 (1.5%) instructors, 180 (27%) assistant professors, 174 (26%) associate professors, and 300 (45%) full professors. Dividing the reviewers into age categories of 10-year intervals, we found one (0.1%) reviewer in the 20- to 29-year age range; 194 (27%), 30-39 years; 280 (39%), 40-49 years; 187 (26%), 50-59 years; 45 (6%), 60-69 years; six (0.8%), 70-79 years; and one (0.1%), 80-89 years. The distribution of the reviewers based on years reviewing was 306 (43%), 0-4 years; 171 (24%), 5-9 years; 149 (21%), 10-14 years; 57 (8%), 15-19 years; 24 (3.4%), 20-24 years; and seven (1%), 25 or more years of experience. Finally, the 731 subspecialties identified by the reviewers were bone (n = 96, 13%), chest (97, 13%), gastrointestinal (97, 13%), genitourinary (48, 6.6%), mammography (47, 6.4%), angiography and interventional (61, 8.3%), neuroradiology (74, 10%), pediatrics (67, 9.2%), obstetrics and gynecology (11, 1.5%), heart (10, 1.4%), sonography (51, 6.9%), MRI (32, 4.4%), physics (6, 0.8%), statistics (1, 0.1%), computers (7, 1.0%), economics (4, 0.5%), nuclear medicine (18, 2.5%), and emergency radiology (4, 0.5%).

The average quality score for the composite group ranged from 1 to 4 (mean, 3.4; median, 3.5). The number of reviews on which the average quality score was based ranged from 1 to 13 manuscript reviews (mean, 3.9 reviews; median, 4.0).

The average quality score of these reviewers strongly correlated with age (p = 0.001); older reviewers generally received lower quality scores. The decline of quality score for older reviewers is seen in Figure 1. Further, we found a statistically significant correlation between quality score and practice type (p = 0.008); reviewers from academia typically rated higher than those in private practice. The mean quality score of reviewers with academic affiliations was 3.41, and the mean quality score of reviewers in private practice was 3.26. We found no significant correlation between quality score and sex (p = 0.72), years of reviewing (p = 0.26), or academic rank (p = 0.10). Although the quality score of AJR reviewers did not correlate with years of reviewing when the entire data set was analyzed, we found a notable difference in the quality scores of reviewers with more than 25 years of service (mean quality score, 3.19) when compared with the quality scores of reviewers with shorter periods of service (mean quality score, 3.37-3.51). However, the number of reviewers with at least 25 years of reviewing experience was small (n = 7). Last, the quality score of reviewers showed no variation by subspecialty (p = 0.99).

The average quality score of a reviewer was not significantly correlated with either the overall score of the manuscript (r = 0.05, p = 0.33) or the ultimate disposition of the manuscript in the peer review process (r = 0.06, p = 0.40).


Discussion
Top
Abstract
Introduction
Materials and Methods
Results
Discussion
References
 
The assessment by AJR editors of the quality of reviews correlated with age of reviewers and practice type but not with other reviewer attributes (sex, years of reviewing, subspecialty, or academic rank). In our analysis, older reviewers tended to receive lower quality scores, and reviewers from academia rated higher than those in private practice. Furthermore, a notable decline was seen in the quality scores of reviewers with 25 or more years of experience when compared with the quality scores of reviewers with shorter periods of service, although the sample size for the former group was relatively small. We did not find a correlation between the reviewer quality score and manuscript disposition, although this analysis is weakened by the limited number of repeat reviews from individual reviewers in the set of 196 manuscripts. It may yet be the case that reviewers tend to receive higher scores for reviewing manuscripts of lesser quality: a longer review of an inferior manuscript could be perceived as more thorough and incisive than a shorter review of a superior manuscript if only for the abundance of detail and commentary. However, more data would be needed to test this hypothesis.

Several conceivable explanations can be ventured for our finding that reviewer age and quality of review are inversely correlated. Younger reviewers may bring more enthusiasm to a review, constructing longer, more detailed reviews that draw comparisons with the existing medical literature. Younger reviewers may more avidly seek recognition and validation from well-known and influential editors. Or perhaps, younger reviewers may be more recently schooled in issues of experimental design, statistics, physics, or emerging imaging techniques. Conversely, perhaps as reviewers grow older, a gradual flagging of enthusiasm ensues; or perhaps older reviewers simply become more jaded, believing that they have heard or read some such before; or perhaps experience gained by reviewers over time does not compensate for mounting demands on their time and energy. It is also possible that older reviewers tend to be more laconic and less prone to provide exhaustive critiques or extensive advice, which would tend to result in lower scores by the editorial staff. Furthermore, older reviewers may rely more on their perceived personal authority [1]. Older reviewers may conceivably be more entrenched in their opinions, tending to harbor harsher views toward perspectives that do not coincide with their own beliefs and experiences. This phenomenon has been referred to as "confirmatory bias" [8, 9].

Whatever the explanation, the value of contributions from reviewers of younger years and more junior rank should be recognized. That this group can overcome what are, perhaps, more limited and nascent perspectives and produce thoughtful critiques of real substance is a testament to the value of youthful enthusiasm and dedication.

It is perhaps less mysterious why reviewers from academic practice might produce more compelling reviews than those from private practice. By virtue of less work intensity and protected academic time, academic radiologists tend to have a greater opportunity during the working day to prepare manuscript critiques. Academic radiologists are perhaps more likely to participate in regular journal clubs, in which the techniques of cogent criticism and close reading are taught and reinforced. And finally, academics may have greater resources from which to draw: these could be both material (university and departmental libraries, teaching files, computer and Internet databases) and intellectual (statisticians, subspecialist colleagues, physicists).

Some of our results have precedent in the medical literature. First, our finding that younger reviewers tend to produce more highly regarded reviews has also been described by other researchers outside the discipline of radiology [1-4]. Those studies, like ours, found no other characteristics of reviewers to be consistently associated with higher quality reviews. One study did find that male reviewers were more likely than female reviewers to give extreme scores on manuscripts, but this study also found that such differences between the sexes did not influence the ultimate disposition of a manuscript in the peer review process [10]. A different study from Scandinavian researchers in which reviewers rated fictitious manuscripts failed to find a systematic pattern of manuscript rating that could be attributed to either reviewer subspecialty or sex [3].

The variability between reviewers in our study further attests to the important intermediary role played by editors in the peer review process [11-14]. Editors must monitor the variation in quality and scoring tendencies of reviewers to mitigate the effects of a biased or deviant review. For many years now, the editors at the AJR have used a subjective system of rating individual reviews [5-7]. Such a system has been used by other journals and has been shown to be moderately reliable and moderately well correlated with a reviewer's ability to report manuscript flaws [15]. Editors at the AJR use the reviewer quality score as a tool to monitor reviewer performance and as a safeguard against the mishandling of a manuscript by reviewers who might be less careful than is required. Editors must ensure that every major paper—particularly one advancing complex, unexpected, or highly original interpretations—receives at least one fair and careful reading by an accomplished reviewer. They must match specific manuscripts with reviewers with particular expertise, knowledge, and skills and scrutinize the reviews for balance, persuasiveness, and clarity.

Our study has several potential limitations. First, editors were not blinded to the identity of reviewers. This lack of blinding could have created biases in various directions. Editors might hold private opinions (good or bad) of a reviewer from prior personal or professional encounters. Feasibly, because journal editors and associate editors tend to hold their positions in mid or late career, they might be expected to be more favorably disposed toward reviewers known to them and of their peer group. Similarly, Figley fellows may prefer the perspective of their younger peers. Unfortunately, the sources of the reviewer scores were not recorded, so the scoring tendencies of specific editors could not be ascertained. Second, the criteria used to judge reviews were neither standardized nor objective. The average quality score is based itself on a subjective assessment and is therefore subject to the inherent biases of the editorial staff. Editors could harbor predetermined views about a person, an institution, an imaging technique, or a field of inquiry. Editors might tend to rate more highly those reviews that coincide with their own opinions of the manuscript. Third, a selection bias doubtless exists in the evolution of the reviewer database. Over time, the editors will retire reviewers who consistently provide poor reviews or refuse to enthusiastically participate in the review process. Arguably, though, such attrition would tend to artificially enrich the ranks of older reviewers. This selection process would have the tendency to obscure differences that might exist between different types of reviewers if all potential reviewers from the radiology community were studied. Fourth, our choice of subspecialty categories was, in retrospect, less apt than it might have been. For instance, a more accurate and inclusive categorization scheme would have probably used "musculoskeletal" rather than "bone" and "breast imaging" rather than "mammography." Considering how such terms are understood in the argot of our profession, however, it is unlikely that this misstep substantially influenced our results. Fifth, we did not control the analysis for type of manuscript. Reviews of shorter manuscripts, such as case reports, may tend to be more succinct (and receive lower scores) than reviews of longer, more detailed manuscripts. However, smaller manuscripts tend to more frequently be assigned to younger reviewers gaining experience with the review process, and this might actually skew younger reviewer scores downward. And finally, because this is a cross-sectional collection of data, one cannot be certain that it is simply age that accounts for the recorded differences in rated performance. A longitudinal study showing a progressive decline in review quality would be more definitive, but such a study is not currently available.

In summary, we found that the best reviewers at the AJR tend to be younger individuals from academic institutions. Of equal importance is the recognition that reviewers showed no significant variation in skill when compared by subspecialty or sex. Clearly, superb reviewers are found throughout the discipline of radiology. The onus to keep the peer review process in good working order falls squarely on the shoulders of the editors. An essential part of their job is tracking reviewer performance and protecting authors from reviewers who tend to be deviant or extreme [11]. We believe that the editors' subjective quality rating of peer reviews of manuscripts serves as a useful tool for monitoring reviewer performance.

What role mentoring might have in improving peer review is as yet unexplored, but it is enticing to imagine the synergy that might be created if the experience of an older mentor were wedded to the enthusiasm of a neophyte. We might soon be able to find out: the AJR staff has recently implemented a program to develop young reviewers under the benign tutelage of seasoned academics. Established reviewers have been asked to identify junior staff and trainees with potential and to help them hone their critical skills. One added benefit of such a program might be greater homogeneity of reviewer scores across age groups.


Acknowledgments
 
We gratefully acknowledge the assistance of Carrie Poole in the preparation of the manuscript; the AJR staff, especially Charles Jenkins; and Lee Rogers, who patiently supported the project. Three authors were recipients of the Melvin M. Figley Fellowship in Radiology Journalism and therefore wish to thank the ARRS for this enriching training and experience.


References
Top
Abstract
Introduction
Materials and Methods
Results
Discussion
References
 

  1. Stossel TP. Reviewer status and review quality: experience of the Journal of Clinical Investigation. N Engl J Med 1985;312:658 -659[Medline]
  2. Evans AT, McNutt RA, Fletcher SW, Fletcher RH. The characteristics of peer reviewers who produce good quality reviews. J Gen Intern Med 1993;8:422 -428[Medline]
  3. Nylenna M, Riis P, Karlsson Y. Multiple blinded reviews of the same two manuscripts. JAMA1994; 272:149 -151[Abstract/Free Full Text]
  4. Black N, van Rooyen S, Godlee F, Smith R, Evans S. What makes a good reviewer and a good review for a general medical journal. JAMA 1998;280:231 -233[Abstract/Free Full Text]
  5. Rogers LF. Peer reviewers: reviewing manuscripts for the AJR. (editorial) AJR2002; 178:1051 -1052[Free Full Text]
  6. Friedman DP. Manuscript peer review at the AJR: facts, figures, and quality assessment. AJR1995; 164:1007 -1009[Abstract/Free Full Text]
  7. Polak JF. The role of the manuscript reviewer in the peer review process. AJR1995; 165:685 -688[Abstract/Free Full Text]
  8. Ernst E, Resch KL. Reviewer bias: a blinded experimental study. J Lab Clin Med1994; 124:178 -182[Medline]
  9. Mahoney MJ. Publication prejudices: an experimental study of confirmatory bias in the peer review system. Cognitive Therapy and Research 1977;1:161 -175
  10. Gilbert JR, Williams ES, Lundberg GD. Is there gender bias in JAMA's peer review process? JAMA1994; 272:139 -142[Abstract/Free Full Text]
  11. Siegelman SS. Assassins and zealots: variations in peer review. Radiology1991; 178:637 -642[Free Full Text]
  12. Relman AS. Peer review in scientific journals: what good is it? West J Med1990; 153:520 -522[Medline]
  13. Relman AS, Angell M. How good is peer review? N Engl J Med 1989;321:827 -829[Medline]
  14. Chew FS. Manuscript peer review: general concepts and the AJR process. AJR1993; 160:409 -411[Free Full Text]
  15. Callahan ML, Baxt WG, Waeckerle JF, Wears RL. Reliability of editors' subjective quality ratings of peer reviews of manuscripts. JAMA 1998;280:229 -231[Abstract/Free Full Text]

Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Am. J. Roentgenol.Home page
T. H. Berquist
Publication in the AJR: Critical Interactions among Authors, Reviewers, and Section Editors
Am. J. Roentgenol., November 1, 2008; 191(5): 1291 - 1292.
[Full Text] [PDF]


Home page
RadiologyHome page
R. G. Sheiman
The RSNA Reviewer Mentorship Program
Radiology, September 1, 2007; 244(3): 631 - 632.
[Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Kliewer, M. A.
Right arrow Articles by Provenzale, J. M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kliewer, M. A.
Right arrow Articles by Provenzale, J. M.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Hotlight (NEW!)
Right arrow
What's Hotlight?


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS