Which physical examination tests provide clinicians with the most value when examining the shoulder? Update of a systematic review with meta-analysis of individual tests

Eric J Hegedus; Adam P Goode; Chad E Cook; Lori Michener; Cortney A Myer; Daniel M Myer; Alexis A Wright

doi:10.1136/bjsports-2012-091066

Article Text

Reviews

Which physical examination tests provide clinicians with the most value when examining the shoulder? Update of a systematic review with meta-analysis of individual tests

Free

Eric J Hegedus1,
Adam P Goode2,
Chad E Cook3,
Lori Michener4,
Cortney A Myer5,
Daniel M Myer6,7,
Alexis A Wright1

¹Physical Therapy, High Point University, 833 Montlieu Ave, High Point, North Carolina, USA
²Community and Family Medicine, Duke University School of Medicine, Durham, North Carolina, USA
³Physical Therapy, Walsh University, Canton, Ohio, USA
⁴Physical Therapy, Virginia Commonwealth University, Richmond, Virginia, USA
⁵Physical Therapy, Center for Orthopedics and Sports Medicine and Akron Childrens Hospital, Akron, Ohio, USA
⁶Orthopedic Sports Medicine, Orthopedic Research of Virginia, Richmond, VA
⁷Crystal Clinic Orthopedic Center, Akron, Ohio, USA

Correspondence to High Point University Physical Therapy, 833 Montlieu Ave, High Point, North Carolina 27262, USA; ehegedus{at}highpoint.edu

Abstract

Objective To update our previously published systematic review and meta-analysis by subjecting the literature on shoulder physical examination (ShPE) to careful analysis in order to determine each tests clinical utility.

Methods This review is an update of previous work, therefore the terms in the Medline and CINAHL search strategies remained the same with the exception that the search was confined to the dates November, 2006 through to February, 2012. The previous study dates were 1966 – October, 2006. Further, the original search was expanded, without date restrictions, to include two new databases: EMBASE and the Cochrane Library. The Quality Assessment of Diagnostic Accuracy Studies, version 2 (QUADAS 2) tool was used to critique the quality of each new paper. Where appropriate, data from the prior review and this review were combined to perform meta-analysis using the updated hierarchical summary receiver operating characteristic and bivariate models.

Results Since the publication of the 2008 review, 32 additional studies were identified and critiqued. For subacromial impingement, the meta-analysis revealed that the pooled sensitivity and specificity for the Neer test was 72% and 60%, respectively, for the Hawkins-Kennedy test was 79% and 59%, respectively, and for the painful arc was 53% and 76%, respectively. Also from the meta-analysis, regarding superior labral anterior to posterior (SLAP) tears, the test with the best sensitivity (52%) was the relocation test; the test with the best specificity (95%) was Yergason's test; and the test with the best positive likelihood ratio (2.81) was the compression-rotation test. Regarding new (to this series of reviews) ShPE tests, where meta-analysis was not possible because of lack of sufficient studies or heterogeneity between studies, there are some individual tests that warrant further investigation. A highly specific test (specificity >80%, LR+ ≥ 5.0) from a low bias study is the passive distraction test for a SLAP lesion. This test may rule in a SLAP lesion when positive. A sensitive test (sensitivity >80%, LR− ≤ 0.20) of note is the shoulder shrug sign, for stiffness-related disorders (osteoarthritis and adhesive capsulitis) as well as rotator cuff tendinopathy. There are six additional tests with higher sensitivities, specificities, or both but caution is urged since all of these tests have been studied only once and more than one ShPE test (ie, active compression, biceps load II) has been introduced with great diagnostic statistics only to have further research fail to replicate the results of the original authors. The belly-off and modified belly press tests for subscapularis tendinopathy, bony apprehension test for bony instability, olecranon-manubrium percussion test for bony abnormality, passive compression for a SLAP lesion, and the lateral Jobe test for rotator cuff tear give reason for optimism since they demonstrated both high sensitivities and specificities reported in low bias studies. Finally, one additional test was studied in two separate papers. The dynamic labral shear may be sensitive for SLAP lesions but, when modified, be diagnostic of labral tears generally.

Conclusion Based on data from the original 2008 review and this update, the use of any single ShPE test to make a pathognomonic diagnosis cannot be unequivocally recommended. There exist some promising tests but their properties must be confirmed in more than one study. Combinations of ShPE tests provide better accuracy, but marginally so. These findings seem to provide support for stressing a comprehensive clinical examination including history and physical examination. However, there is a great need for large, prospective, well-designed studies that examine the diagnostic accuracy of the many aspects of the clinical examination and what combinations of these aspects are useful in differentially diagnosing pathologies of the shoulder.

https://doi.org/10.1136/bjsports-2012-091066

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Introduction

In 2006, we reviewed shoulder physical examination (ShPE) and in 2008 our work was published in this journal.1 This publication was followed by a series of either similar or otherwise redundant publications, addressing all or dedicated pathognomic components of shoulder testing.2,–,7 The majority of those subsequent articles did not meta-analyse the ShPE test's accuracy, evaluate risk of bias among the studies, or identify studies unique to our 2008 publication.1 The fact that so many review articles analysed the diagnostic accuracy of clinical shoulder tests in a period of three years speaks to the need to clearly address the question. ‘Which physical examination tests provide clinicians with the most value for diagnosis when examining the shoulder?’

Since 2006, there have been many changes necessitating an update of the original article. First and foremost, the publication of diagnostic articles on the use of ShPE tests in the clinical examination has continued at a brisk pace resulting in numerous new publications on the accuracy of established tests and the development of new tests. Next, the methodology by which a systematic review on diagnostic accuracy is conducted has been updated from the Quality of Reporting of Meta-analysis (QUOROM)8 with the publication of Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA).9 Third, the criterion standard method of performing a meta-analysis has become a unification10 of the bivariate model11 and the hierarchical summary receiver operating characteristic (HSROC) model.12 Finally, the method by which the quality of individual studies is examined has been updated from the original Quality Assessment of Diagnostic Accuracy Studies (QUADAS)13 to the newly published QUADAS-2.14 These changes over the last five years have been extensive but the goal with this systematic review and meta-analysis has remained the same: to analyse the literature on ShPE tests of the shoulder to careful analysis in order to determine their clinical utility in adult (18 or older) patients.

Methods

This systematic review with meta-analysis was conducted and reported according to the protocol outlined by PRISMA9 using a research question framed by PICOS methodology. PICOS is a pneumonic representing population (eg, adults), intervention (eg, diagnostic test), comparison (eg, control group), outcome (eg, accuracy) and study design (eg, cohort). In order to be eligible for this review, diagnostic accuracy studies, written in English, had to report both the sensitivity and specificity of ShPE tests in adults with shoulder pain due to musculoskeletal pathology. Excluded from this review, were articles using equipment or devices that are not readily available to most clinicians during physical examination and articles in which subjects were tested under anaesthesia or in which subjects were cadavers.

Study selection

Since this review is an update of our previous work,1 the terms in our Medline and CINAHL search strategies remained the same with the exception that the search was confined to the dates November, 2006 through February, 2012. Our previous study dates were 1966 – October, 2006. Further, the original search was expanded, without date restrictions, to include two new databases: EMBASE and the Cochrane Library. A hand search was also conducted which included the authors' private collections and the searching of previous systematic reviews. Two authors (EH and AW) read titles and abstracts of all database-captured articles applying the a priori inclusion/exclusion criteria and agreement was measured using the κ statistic (figure 1). Disagreement was then resolved by discussion between the two authors and, in the event that agreement could not be reached, a third author (CC) served as the deciding vote. With the remaining articles, the same two authors (EH and AW) read the entire paper and again, a κ value was calculated to measure agreement as to which articles to retain for final analysis (figure 1). Once the final group of 32 articles was determined, 2x2 table data were extracted and saved for meta-analysis. Only data from studies, where the 2x2 data were reported or could be inferred from stated positive likelihood ratios, negative likelihood ratios, positive predictive values, and negative predictive values were retained for meta-analysis. If 2x2 data could not be discerned, the article was excluded from meta-analysis but still retained for systematic review and qualitative analysis.

Figure 1

Flow diagram of the literature screening process. Note that the total of articles broken down into subgroups does not equal 32 because multiple articles addressed more than one pathognomonic category.

Quality assessment

Once the final group of articles was agreed upon, two authors (EH and AW) independently examined the quality of each article using the QUADAS-2 tool.14 QUADAS-2 is a 4-phase tool, the last phase of which assists authors of systematic reviews in rating: 1) bias and 2) applicability. The risk of bias is assessed in four key areas: patient selection, index test, reference standard, and flow and timing. Concern for applicability is assessed in three key areas: patient selection, index test, and reference standard. For both categories, risk of bias and concern for applicability, the individual criteria were classified as low risk, high risk, or unclear and the results were presented using tables from the QUADAS web site (www.quadas.org).

Statistical analysis

In order to maximise the potential for meta-analysis, we added 2x2 data from our first meta-analysis1 to data gathered from the 32 additional articles included in this review. Hierarchical summary receiver operating characteristic (HSROC) curve12 and bivariate11 models were used to combine estimates of sensitivity (SN), specificity (SP), positive likelihood ratios (+LR), negative likelihood ratios (−LR) and diagnostic OR (DOR) with their 95% CI. Sensitivity measures the proportion of actual positives which are correctly identified as such (eg, the percentage of sick people who are correctly identified as having the condition). Specificity measures the proportion of negatives which are correctly identified (eg, the percentage of healthy people who are correctly identified as not having the condition). Positive likelihood ratio (LR+) dictates how much the odds of the disease increase when a test is positive.15 The negative likelihood ratio (LR−) dictates how much the odds of the disease decrease when a test is negative.15 Diagnostic OR express the strength of association between the test result and disease. These models, in the absence of covariates, are different parameterisations of the same model10 and take into account the correlation between sensitivity and specificity and both the within and the between study variances.16 The 95% prediction region is graphically provided which is the given probability (ie, 95%) of including the true sensitivity and specificity of a future study.17 DerSimonian-Laird18 random-effects models were used where less than four studies were eligible for statistical pooling. Heterogeneity was explored graphically with forest plots and statistically with Cochrane-Q with p<0.10 to indicate significant heterogeneity. When appropriate, meta-regression or subgroup analysis using study level characteristics was used to explore heterogeneity with a p<0.10 to indicate a significant difference in stratified estimates. A p value of <0.10 was decided upon to determine a significance in stratified estimates due to the low power of the test used to detect differences in stratified estimates.19 A 0.5 was added to all four cells of the 2x2 table when a zero was encountered in any cell as suggested by Cox.20

Publication bias was analysed statistically with the Egger21 test with a p<0.05 to indicate significant publication bias. Threshold effects were tested using Spearman correlation coefficients.22 Influential studies on summary estimates were assessed with Cooks-d and standardised residuals according to Rabe-Hesketh23 with sensitivity analyses to determine if influential studies should be removed from the analyses. All statistical analyses were conducted in Stata 11 (Stata, College Station Texas, USA) by one of the authors (AG).

Results

New Studies/Tests/Pathologies

In reference to our previous meta-analysis,1 there were 32 new studies addressing the diagnostic accuracy of ShPE tests of the shoulder in adults (figure 1). A summary of the characteristics of each study is presented in table 1.

View this table:

Table 1

Summary of studies

Twelve of these studies26 ,28 ,29 ,35 ,38 ,39 ,45,–,49 ,53 added 13 new tests to the literature, the majority of which attempted to detect a SLAP lesion. New tests were defined as those for which diagnostic accuracy statistics were reported for the first time in peer-reviewed literature. Clinically, many of these tests are not new. The 32 studies addressed the categories of: Rotator cuff tears (RCT's), Tendinopathy, Subacromial impingement, Instability, Labral tears, Biceps pathology, Stiffness-related disorders and Other. The most frequent topics of focus were RCTs, Tendinopathy, Subacromial impingement and Labral tears. Many would consider tendinopathy and impingement different labels for the same syndrome and further, that both labels capture a continuum of disease that includes RCTs. We concur with this thought but separated these pathologic entities in order to simplify analysis. Therefore, the rotator cuff tear group included those studies where diagnostic accuracy was examined inclusive of any size of tear or classification system used. Three studies25 ,30 ,33 in the RCT category addressed full-thickness tears, one study39 addressed massive RCTs, and six studies41 ,42 ,46 ,52,–,54 addressed RCT's regardless of size or classification. Of the 10 RCT studies, five used tests designed to test specific, individual muscles of the rotator cuff. An example of this methodology was the Kim et al42 study that examined the accuracy of the empty can for supraspinatus pathology, Patte's test for infraspinatus tendinopathy, and the lift-off for subscapularis tendinopathy (and Yergason's test for biceps tendinopathy).

There were some trends observed in categories other than RCTs. In the labral tear group, two studies examined the use of tests to detect any labral tear, while six studies addressed superior labral anterior to posterior (SLAP) lesions and one study37 addressed both labral tears generally and SLAP lesions specifically. Of the three studies in the Instability category,29 ,37 ,39 one39 addressed soft tissue-related instability and two29 ,37 addressed bony instability, a pathology attracting increased attention since our last review. The Stiffness-related group included studies addressing either glenohumeral OA or adhesive capsulitis. Two studies28 ,39 in this category actually used the same data for the shrug sign and published that data in two separate papers. All three of the stiffness-related papers28 ,39 ,48 addressed adhesive capsulitis, another new pathology in the diagnostic literature since our last review. Finally, the Other category consists of two articles38 ,39 on detecting acromioclavicular (AC) pathology and one addressing bony abnormality.47

The sensitivity and specificity of most ShPE tests examined in all 32 studies and the risk of bias in each study are summarised in table 2. In the interest of efficient reporting, test data was omitted from table 2 if diagnostic accuracy figures were reported for pathologies which the test was never intended to detect. For example, if an author reported values for the lift-off test (subscapularis) in a population with adhesive capsulitis, that data were not reported.

View this table:

Table 2

Alphabetical list of common shoulder physical examination (ShPE) tests

Quality assessment – risk of bias and concern for applicability

Each of the 32 papers qualifying for final review was scrutinised, via the QUADAS-2 (Q2),14 in the areas of risk of bias and concern for applicability (Appendix). Concern for applicability, for this review, was defined as concern for external validity, the degree to which results of a research study can be applied to practice. The two authors (EH and AW) independently used the Q2,14 blinded from each other's assessments. The number of low risk/concern scores was tallied into a total score for each article and agreement was calculated using a weighted κ statistic. The weighted κ was poor (κ=0.31 with 95% CI 0.10 to 0.52). Summaries of risk of bias and concern for applicability for each pathological group are presented in figure 2. The greatest risk of bias was most often associated with the Q2 items Patient Flow and Reference Standard. The greatest concern in the category of applicability was also the reference standard with the addition of the index test. Patient flow concerns become apparent when there was an indeterminate or excessive time between the issuing of the index test and the criterion standard, when patients received different reference standards, or when it was difficult to discern if all patients were included in the analysis. Most of the studies, where patient flow was an issue failed to note the length of time between the index test and reference standard, or did not make clear whether all patients were included in the analysis. Often, there was an inability to reconstruct the 2x2 tables accurately from the data reported in the original article. The concern for bias in the reference standard was most often due to a failure to use a double blind design (issuer of the criterion standard was not blinded to index test result) or the failure to use the criterion standard to confirm diagnosis. The obvious gain in popularity of diagnostic ultrasound (n=12 studies in this review) had the deleterious effect of increasing concern for bias since ultrasound is not the criterion standard for shoulder diagnosis.56,–,58 Lastly, the concern for applicability as it relates to the index test is because the authors failed to describe the index test.

Figure 2

Risk of bias and concerns for applicability. Green=low risk/concern; Orange=high risk/concern; Blue=uncertain risk/concern.

Statistical analysis

Overall

Publication bias was not found to be evident with graphical or in statistical analysis. However, publication bias cannot be completely ruled out since these tests have decreased statistical power when analysing less than 10 studies.59 No significant negative correlations were found to indicate the influence of threshold effects. Table 3 presents the results of meta-analysis for the individual ShPE tests by diagnosis, number of studies and sample size for the analyses.

View this table:

Table 3

Summary estimates from meta-analysis

Subacromial impingement

The Neer, Hawkins-Kennedy and painful arc tests for subacromial impingement were summarised for their diagnostic properties and associations. The strongest summary sensitivity was for the Hawkins-Kennedy test (0.80; 0.72, 0.86). However, the value was merely on the sensitivity threshold (80%) for assisting in ruling out subacromial impingement but because of poor specificity, the LR- value shows this test to have little effect on post-test probability to rule out subacromial impingement when negative. In fact, none of the three diagnostic tests demonstrated the likelihood ratios that would be unlikely to result in important changes in post-test probability. The pooled DOR for any of these three tests indicates the discriminative diagnostic ability to determine a positive test result among those with subacromial impingement when compared with those without subacromial impingement is unlikely to occur. Figure 3 (Neer), figure 4 (Hawkins-Kennedy) and figure 5 (painful arc) illustrate the included studies with both the 95% confidence and prediction regions indicating the probable wide variability of the true sensitivity and specificity in future studies.

Figure 3

Hierarchical summary receiver operating characteristic (HSROC) curve composed of studies examining the diagnostic value of the Neer test in cases of subacromial impingement.

Figure 4

Hierarchical summary receiver operating characteristic (HSROC) curve composed of studies examining the diagnostic value of the Hawkins-Kennedy test in cases of subacromial impingement.

Figure 5

Hierarchical summary receiver operating characteristic (HSROC) curve composed of studies examining the diagnostic value of the Painful Arc test in cases of subacromial impingement.

Meta-regression was conducted for both the Neer and Hawkins-Kennedy tests in order to determine if the summary DOR was biased as a result of differing reference standards. For the Neer test, there was a substantially greater DOR among the studies which used the gold standard of surgery for index test confirmation (4.85 ((95% CI 3.46 to 6.79)) than other reference standards (1.28 ((95% CI 0.31 to 5.19)). The ratio of DORs was strong (3.79 ((95% CI 0.87 to 16.14)) and the stratified estimates were statistically significant (p=0.07). Similarly, the DOR for the Hawkins-Kennedy test was stronger among those studies with the gold standard of surgery (6.41 ((95% 3.33 to 12.35) than for studies using other than the gold standard (3.14 ((95% 1.37 to 7.22)). However, the stratified estimates were not significantly (p=0.18) different from one another.

SLAP lesions

None of the 8 ShPE tests for which meta-analysis was possible (table 3) demonstrated sensitivity values that would likely rule out a SLAP lesion with a negative test. Yergason's test had the strongest summary specificity (95.3; 90.6,98.1), but again, the sensitivity was so poor that the LR+ demonstrates insignificant ability of this test to rule in a SLAP lesion when positive. All eight diagnostic tests for a SLAP lesion had likelihood ratios and DORs that were weak and their CI contained the null value (table 3).

The active compression test analysis found the O’Brien et al60 study to have a large Cooks-D and standardised residuals influencing the summary estimates. Cooks-D is a measure of the influence that a particular study may have on the model parameters and can be used to check for particularly influential studies. Sensitivity analysis, with removal of the O’Brien et al60 study, resulted in substantial attenuation of the DOR from 3.14 (95% CI 0.42 to 23.40) to 1.19 (95% CI 0.76 to 1.86). As such, this study was not included in summary estimates for the Active Compression test. Figure 6 illustrates the HSROC curves of the Active Compression test both with and without the outlier study.60

Figure 6

Hierarchical summary receiver operating characteristic (HSROC) curve composed of studies examining the diagnostic value of the Active Compression test in cases of a SLAP lesion. The left graph includes the original article reporting on the value of the test and the right graph shows the result of the elimination of this outlier study60.

Anterior instability

Statistical pooling was done individually for three tests for the diagnosis of anterior instability: the apprehension, relocation and surprise tests. The surprise test demonstrated the strongest sensitivity (81.8; 69.1, 90.9), and therefore, negative likelihood ratio (0.25; 0.08–0.78)) that would likely rule out anterior instability when negative. All three tests demonstrated the ability to rule in anterior instability due to strong specificity. The apprehension test had the strongest positive likelihood ratio (17.2; 10.02, 29.55) and was the only one of the three in which the CI did not contain the null value. The apprehension test had the strongest DOR (53.6; 24.3, 118.3), indicating some evidence for this test's overall diagnostic discriminative performance.

Significant heterogeneity was found in the properties and associations for the relocation test. Subgroup analysis, accomplished by removing the study by Lo et al61 based upon the non-criterion reference standard used, did not improve the overall heterogeneity.

Labral tear

In pooled analyses, the crank test for labral tear demonstrated limited ability to rule in a labral tear with a +LR of 2.4 and specificity of 76%, indicating a likely small change in post-test probability.

Tendinopathy

In pooled analyses, the Hawkins-Kennedy test for tendinopathy demonstrated no evidence for the ability to rule in or out, change post-test probability or have overall diagnostic discriminative performance.

What this study adds

This is the first meta-analysis to study ShPE tests and use the QUADAS 2 document to assist in the qualitative review and the HSROC/bivariate models for meta-analysis
There is less optimism that the biceps load II is diagnostic for SLAP lesions
The belly-off and modified belly press tests may be helpful in diagnosing subscapularis tendinopathy
The bony apprehension test may help diagnose bony instability
The olecranon-manubrium percussion test may be useful in a traumatic injury for bony abnormality requiring referral for x-ray
The passive compression test may be helpful in diagnosing a SLAP lesion
The modified dynamic labral shear test may be diagnostic of labral tears
The lateral Jobe test may be useful for diagnosing a rotator cuff tear
The shrug sign appears to be a sensitive test for stiffness-related disorders (osteoarthritis and adhesive capsulitis) as well as rotator cuff tendinopathy
The passive distraction test may be able to rule in a SLAP tear if positive

Discussion

This is the first study on diagnostic accuracy of which we know that has incorporated HSROC/bivariate models as the criterion standard during performance of a meta-analysis of ShPE tests. We feel that the use of this criterion standard promotes increased attention on and isolation of studies that demonstrate results dramatically outside others of similar context. Of particular interest, is the dramatic change in both the 95% CI and 95% prediction region of the active compression test for a SLAP lesion when the original study60 is eliminated (figure 6). Further, this study60 is a good example of the effect of bias on estimates of diagnostic accuracy given that the publication possesses examples of at least seven kinds of bias. Most notable of these biases, is partial verification bias which has been shown to overestimate the diagnostic accuracy of a test.62

For each diagnostic category, the overall results of this systematic review and meta-analysis indicate that a few tests are helpful to confirm or screen for a given diagnosis. There is a preponderance of evidence about individual physical examination tests that could not be combined for the meta-analysis. For those tests, we have used the diagnostic values and risk of bias from the Q2 to determine which tests are recommended for use as a screen or those recommended as a confirmatory test using the benchmarks of specificity >80%, sensitivity >80%, LR+ ≥ 5.0 and LR− ≤0.20. The list is short, and confidence in the diagnostic accuracy estimates is tenuous.

From the meta-analysis portion of this review, the Hawkins-Kennedy initially appears to be of value in ruling out subacromial impingement when negative. However, the LR− is poor and further, a strong argument can be made that subacromial impingement is not a valuable diagnosis but rather a cluster of diagnoses.63 The diagnosis of subacromial impingement encompasses such a broad range of pathologies, from bursitis to a complete rotator cuff tear,64 that a label of subacromial impingement may not help guide treatment.65 Yergason's test, used for detection of a SLAP lesion, has high (95%) pooled specificity. However, the sensitivity is so low, that a positive test modifies the post-test probability of detecting a SLAP lesion only a small amount. In a similar perspective to subacromial impingement, authors have argued that tests results for SLAP may be effected by the percentage of different forms of Snyder classifications present within the sample.50

Therefore, the only tests that appear to have good clinical utility are the apprehension, relocation, and surprise tests to diagnose anterior instability and these tests are primarily a continuum of the apprehension test. When a patient registers apprehension with this test, the relocation manoeuvre should then decrease apprehension, whereupon, the relocation force is removed causing a surprised reaction (surprise test) by the patient as the apprehension reappears.

While the results of the meta-analysis were, perhaps, not inspiring to the clinician searching for diagnostic answers, there are some individual tests that warrant further investigation. The posterior apprehension test for posterior instability demonstrated a higher specificity and positive likelihood ratio but these values came from a high bias study.39 Another highly specific test, but from a low bias study45 is the passive distraction test for a SLAP lesion. This test may rule in a SLAP lesion when positive. Sensitive tests of note are the shoulder shrug sign, for stiffness-related disorders (osteoarthritis and adhesive capsulitis) as well as rotator cuff tendinopathy and the Whipple test for massive rotator cuff tears. However, the diagnostic properties of the Whipple test come from a high bias study.39 Other tests of possible value from high bias studies included the AC resisted extension,39 the resisted belly press,38 and coracoid palpation.48 There are six additional tests with higher sensitivities, specificities, or both but caution is urged since all of these tests have been studied only once and more than one ShPE test (ie, active compression, biceps load II) has been introduced with great diagnostic statistics only to have further research fail to replicate the results of the original authors. The belly-off and modified belly press tests for subscapularis tendinopathy, bony apprehension test for bony instability, olecranon-manubrium percussion test for bony abnormality, passive compression for a SLAP lesion, and the lateral Jobe test for rotator cuff tear give reason for optimism since they demonstrated both high sensitivities and specificities reported in low bias studies. Finally, one additional test was studied in two separate papers.35 ,50 The dynamic labral shear may be sensitive for SLAP lesions but, when modified, be diagnostic of labral tears generally.

Looking back to our initial publication and combining that data with the current review certainly expands the clinician's diagnostic arsenal. The external rotation lag sign continues to be recommended as it was in 20081 to confirm full-thickness rotator cuff tears of the infraspinatus. The hornblower's sign may be diagnostic of severe degeneration or absence of the teres minor muscle, and the active compression test may have value as a confirmatory test for AC joint pathology when positive due to its high specificity.

Despite some cause for optimism when looking at some of the individual studies and tests, the more prudent method may be to look at clusters or combinations of tests, since that resembles more closely, the way in which most ShPE tests are used in the clinic. Table 4, while not all-inclusive, shows the best test combinations to date for detecting various pathologies.

View this table:

Table 4

Best^* Test Combinations and Reported Value for Various Pathologies

Unfortunately, even many of these test clusters modify the post-test probability by a small to minimal amount. Of note in this group of clustered tests is the combination of age>39, painful arc, and self-report of popping and clicking32 and the combination of the apprehension and relocation tests,68 both of which produce a large post-test shift toward the diagnoses of supraspinatus tendinopathy, and anterior instability, respectively.

Limitations

Any review is limited by the quality of studies contained therein. Many of the studies in this review had issues with the reference standard and subject flow and timing. There was clearly a rise in the use of diagnostic ultrasound as a criterion standard, and evidence to supports its use is currently poor.56,–,58 Further, we limited our articles to those in the English language which may make this review more prone to dissemination bias. However, publication bias was not found to be evident with graphical or in statistical analysis. Finally, this is the first meta-analysis on diagnostic accuracy of ShPE tests that was performed using the Q2 document. The original authors piloted the Q2 on five studies and found that reliability varied considerably.14 Our weighted κ (κ=0.31; 0.10, 0.52) was likewise only fair.

Conclusions

Based on data from our original review1 and this update, the use of any single ShPE test to make a pathognomonic diagnosis cannot be unequivocally endorsed due to continued quality issues in publications. Some ShPE tests are beginning to stand the tests of scrutiny and time but there are far more tests that need to be validated in more than one study. Combinations of ShPE tests provide better accuracy, but marginally so. These findings seem to provide support for stressing a comprehensive clinical examination including history and clinical examination. However, there is a great need for large, prospective, well-designed studies that examine the diagnostic accuracy of the many aspects of the clinical examination and what combinations of these aspects are useful in differentially diagnosing pathologies of the shoulder.

Acknowledgments

The authors would like to acknowledge Ms Connie Schardt for her invaluable assistance in the search process and the authors from the original paper whose initial work was foundational: S Campbell, A Morin, M Tamaddoni, C T Moorman III.

References

↵
1. Hegedus EJ,
2. Goode A,
3. Campbell S,
4. et al
. Physical examination tests of the shoulder: a systematic review with meta-analysis of individual tests. Br J Sports Med 2008;42:80–92.
OpenUrl Abstract/FREE Full Text
↵
1. Karlsson J
. Physical examination tests are not valid for diagnosing SLAP tears: a review. Clin J Sport Med 2010;20:134–5.
↵
1. Calvert E,
2. Chambers GK,
3. Regan W,
4. et al
. Special physical examination tests for superior labrum anterior posterior shoulder tears are clinically limited and invalid: a diagnostic systematic review. J Clin Epidemiol 2009;62:558–63.
OpenUrl CrossRef PubMed Web of Science
↵
1. Meserve BB,
2. Cleland JA,
3. Boucher TR
. A meta-analysis examining clinical test utility for assessing superior labral anterior posterior lesions. Am J Sports Med 2009;37:2252–8.
OpenUrl Abstract/FREE Full Text
↵
1. Walton DM,
2. Sadi J
. Identifying SLAP lesions: a meta-analysis of clinical tests and exercise in clinical reasoning. Phys Ther Sport 2008;9:167–76.
OpenUrl CrossRef PubMed Web of Science
↵
1. Dessaur WA,
2. Magarey ME
. Diagnostic accuracy of clinical tests for superior labral anterior posterior lesions: a systematic review. J Orthop Sports Phys Ther 2008;38:341–52.
OpenUrl PubMed Web of Science
↵
1. Munro W,
2. Healy R
. The validity and accuracy of clinical tests used to detect labral pathology of the shoulder–a systematic review. Man Ther 2009;14:119–30.
OpenUrl CrossRef PubMed
↵
1. Moher D,
2. Cook DJ,
3. Eastwood S,
4. et al
. Improving the quality of reports of meta-analyses of randomised controlled trials: the QUOROM statement. Quality of Reporting of Meta-analyses. Lancet 1999;354:1896–900.
OpenUrl CrossRef PubMed Web of Science
↵
1. Moher D,
2. Liberati A,
3. Tetzlaff J,
4. et al
. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. BMJ 2009;339:b2535.
↵
1. Harbord RM,
2. Deeks JJ,
3. Egger M,
4. et al
. A unification of models for meta-analysis of diagnostic accuracy studies. Biostatistics 2007;8:239–51.
OpenUrl Abstract/FREE Full Text
↵
1. Reitsma JB,
2. Glas AS,
3. Rutjes AW,
4. et al
. Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviews. J Clin Epidemiol 2005;58:982–90.
OpenUrl CrossRef PubMed Web of Science
↵
1. Rutter CM,
2. Gatsonis CA
. A hierarchical regression approach to meta-analysis of diagnostic test accuracy evaluations. Stat Med 2001;20:2865–84.
OpenUrl CrossRef PubMed Web of Science
↵
1. Whiting P,
2. Rutjes AW,
3. Dinnes J,
4. et al
. Development and validation of methods for assessing the quality of diagnostic accuracy studies. Health Technol Assess 2004;8:iii, 1–234.
OpenUrl PubMed
↵
1. Whiting PF,
2. Rutjes AW,
3. Westwood ME,
4. et al
. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med 2011;155:529–36.
OpenUrl CrossRef PubMed Web of Science
↵
1. Sackett D,
2. Haynes R,
3. Guyatt G,
4. et al
. Clinical Epidemiology: A Basic Science For Clinical Medicine. Second edition. Boston: Little Brown, 1991.
↵
1. Dinnes J,
2. Deeks J,
3. Kirby J,
4. et al
. A methodological review of how heterogeneity has been examined in systematic reviews of diagnostic test accuracy. Health Technol Assess 2005;9:1–113, iii.
OpenUrl PubMed
↵
1. Harbord RM,
2. Whiting P
. Metandi: Meta-analysis of diagnostic accuracy using hierarchical logistic regression. Stata Journal 2009;9:211–229.
OpenUrl Web of Science
↵
1. DerSimonian R,
2. Laird N
. Meta-analysis in clinical trials. Control Clin Trials 1986;7:177–88.
OpenUrl CrossRef PubMed Web of Science
↵
1. Rothman KJ
. Writing for epidemiology. Epidemiology 1998;9:333–7.
OpenUrl PubMed
↵
1. Cox D
. The analysis of binary data. London: Methuen, 1970.
↵
1. Egger M,
2. Davey Smith G,
3. Schneider M,
4. et al
. Bias in meta-analysis detected by a simple, graphical test. BMJ 1997;315:629–34.
OpenUrl Abstract/FREE Full Text
↵
1. Zamora J,
2. Abraira V,
3. Muriel A,
4. et al
. Meta-DiSc: a software for meta-analysis of test accuracy data. BMC Med Res Methodol 2006;6:31.
OpenUrl CrossRef PubMed
↵
1. Rabe-Hesketh S,
2. Skrondol A,
3. Pickles A
. GLLAMM Manual: U.C. Berkeley Division of Biostatistics Working Paper Series, 2004:60.
1. Michener LA,
2. Walsworth MK,
3. Doukas WC,
4. et al
. Reliability and diagnostic accuracy of 5 physical examination tests and combination of tests for subacromial impingement. Arch Phys Med Rehabil 2009;90:1898–903.
OpenUrl CrossRef PubMed Web of Science
↵
1. Miller CA,
2. Forrester GA,
3. Lewis JS
. The validity of the lag signs in diagnosing full-thickness tears of the rotator cuff: a preliminary investigation. Arch Phys Med Rehabil 2008;89:1162–8.
OpenUrl CrossRef PubMed Web of Science
↵
1. Kim YS,
2. Kim JM,
3. Ha KY,
4. et al
. The passive compression test: a new clinical test for superior labral tears of the shoulder. Am J Sports Med 2007;35:1489–94.
OpenUrl Abstract/FREE Full Text
1. Fodor D,
2. Poanta L,
3. Felea I,
4. et al
. Shoulder impingement syndrome: correlations between clinical tests and ultrasonographic findings. Ortop Traumatol Rehabil 2009;11:120–6.
OpenUrl PubMed
↵
1. Jia X,
2. Ji JH,
3. Petersen SA,
4. et al
. Clinical evaluation of the shoulder shrug sign. Clin Orthop Relat Res 2008;466:2813–9.
OpenUrl CrossRef PubMed Web of Science
↵
1. Bushnell BD,
2. Creighton RA,
3. Herring MM
. The bony apprehension test for instability of the shoulder: a prospective pilot analysis. Arthroscopy 2008;24: 974–82.
OpenUrl CrossRef PubMed
↵
1. Castoldi F,
2. Blonna D,
3. Hertel R
. External rotation lag sign revisited: accuracy for diagnosis of full thickness supraspinatus tear. J Shoulder Elbow Surg 2009;18:529–34.
OpenUrl CrossRef PubMed Web of Science
1. Silva L,
2. Andréu JL,
3. Muñoz P,
4. et al
. Accuracy of physical examination in subacromial impingement syndrome. Rheumatology (Oxford) 2008;47:679–83.
OpenUrl Abstract/FREE Full Text
↵
1. Chew K,
2. Pua YH,
3. Chin J,
4. et al
. Clinical predictors for the diagnosis of supraspinatus pathology. Physiotherapy Singapore 2010;13:12–17.
OpenUrl
↵
1. Bak K,
2. Sørensen AK,
3. Jørgensen U,
4. et al
. The value of clinical tests in acute full-thickness tears of the supraspinatus tendon: does a subacromial lidocaine injection help in the clinical diagnosis? A prospective study. Arthroscopy 2010; 26:734–42.
OpenUrl CrossRef PubMed
1. Bartsch M,
2. Greiner S,
3. Haas NP,
4. et al
. Diagnostic values of clinical tests for subscapularis lesions. Knee Surg Sports Traumatol Arthrosc 2010;18:1712–7.
OpenUrl CrossRef PubMed
↵
1. Ben Kibler W,
2. Sciascia AD,
3. Hester P,
4. et al
. Clinical utility of traditional and new tests in the diagnosis of biceps tendon injuries and superior labrum anterior and posterior lesions in the shoulder. Am J Sports Med 2009;37:1840–7.
OpenUrl Abstract/FREE Full Text
1. Chen HS,
2. Lin SH,
3. Hsu YH,
4. et al
. A comparison of physical examinations with musculoskeletal ultrasound in the diagnosis of biceps long head tendinitis. Ultrasound Med Biol 2011;37:1392–8.
OpenUrl CrossRef PubMed
↵
1. Fowler EM,
2. Horsley IG,
3. Rolf CG
. Clinical and arthroscopic findings in recreationally active patients. Sports Med Arthrosc Rehabil Ther Technol 2010;2:2.
OpenUrl CrossRef PubMed
↵
1. Goyal P,
2. Hemal U,
3. Kumar R
. High resolution sonographic evaluation of painful shoulder. Internet Journal of Radiology 2010;12:22.
OpenUrl
↵
1. Jia X,
2. Petersen SA,
3. Khosravi AH,
4. et al
. Examination of the shoulder: the past, the present, and the future. J Bone Joint Surg Am 2009;91 Suppl 6:10–8.
OpenUrl
1. Kelly SM,
2. Brittle N,
3. Allen GM
. The value of physical tests for subacromial impingement syndrome: a study of diagnostic accuracy. Clin Rehabil 2010; 24:149–58.
OpenUrl Abstract/FREE Full Text
↵
1. Kim HA,
2. Kim SH,
3. Seo YI
. Ultrasonographic findings of painful shoulders and correlation between physical examination and ultrasonographic rotator cuff tear. Mod Rheumatol 2007;17:213–9.
OpenUrl CrossRef PubMed
↵
1. Kim HA,
2. Kim SH,
3. Seo YI
. Ultrasonographic findings of the shoulder in patients with rheumatoid arthritis and comparison with physical examination. J Korean Med Sci 2007;22:660–6.
OpenUrl CrossRef PubMed
1. Salaffi F,
2. Ciapetti A,
3. Carotti M,
4. et al
. Clinical value of single versus composite provocative clinical tests in the assessment of painful shoulder. J Clin Rheumatol 2010;16:105–8.
OpenUrl CrossRef PubMed
1. Walsworth MK,
2. Doukas WC,
3. Murphy KP,
4. et al
. Reliability and diagnostic accuracy of history and physical examination for diagnosing glenoid labral tears. Am J Sports Med 2008;36:162–8.
OpenUrl Abstract/FREE Full Text
↵
1. Schlechter JA,
2. Summa S,
3. Rubin BD
. The passive distraction test: a new diagnostic aid for clinically significant superior labral pathology. Arthroscopy 2009; 25:1374–9.
OpenUrl PubMed Web of Science
↵
1. Gillooly JJ,
2. Chidambaram R,
3. Mok D
. The lateral Jobe test: A more reliable method of diagnosing rotator cuff tears. Int J Shoulder Surg 2010;4:41–3.
OpenUrl CrossRef PubMed
↵
1. Adams SL,
2. Yarnold PR,
3. Mathews JJ 4th.
. Clinical use of the olecranon-manubrium percussion sign in shoulder trauma. Ann Emerg Med 1988;17:484–7.
OpenUrl CrossRef PubMed
↵
1. Carbone S,
2. Gumina S,
3. Vestri AR,
4. et al
. Coracoid pain test: a new clinical sign of shoulder adhesive capsulitis. Int Orthop 2010;34:385–8.
OpenUrl CrossRef PubMed
↵
1. Ebinger N,
2. Magosch P,
3. Lichtenberg S,
4. et al
. A new SLAP test: the supine flexion resistance test. Arthroscopy 2008;24:500–5.
OpenUrl PubMed Web of Science
↵
1. Cook C,
2. Beaty S,
3. Kissenberth MJ,
4. et al
. Diagnostic accuracy of five orthopedic clinical tests for diagnosis of superior labrum anterior posterior (SLAP) lesions. J Shoulder Elbow Surg 2012;21:13–22.
OpenUrl CrossRef PubMed Web of Science
1. Gill HS,
2. El Rassi G,
3. Bahk MS,
4. et al
. Physical examination for partial tears of the biceps tendon. Am J Sports Med 2007;35:1334–40.
OpenUrl Abstract/FREE Full Text
↵
1. Kim E,
2. Jeong HJ,
3. Lee KW,
4. et al
. Interpreting positive signs of the supraspinatus test in screening for torn rotator cuff. Acta Med Okayama 2006;60:223–8.
OpenUrl PubMed Web of Science
↵
1. Naredo E,
2. Aguado P,
3. De Miguel E,
4. et al
. Painful shoulder: comparison of physical examination and ultrasonographic findings. Ann Rheum Dis 2002; 61:132–6.
OpenUrl Abstract/FREE Full Text
↵
1. Itoi E,
2. Minagawa H,
3. Yamamoto N,
4. et al
. Are pain location and physical examinations useful in locating a tear site of the rotator cuff? Am J Sports Med 2006;34:256–64.
OpenUrl Abstract/FREE Full Text
1. Oh JH,
2. Kim JY,
3. Kim WS,
4. et al
. The evaluation of various physical examinations for the diagnosis of type II superior labrum anterior and posterior lesion. Am J Sports Med 2008;36:353–9.
OpenUrl Abstract/FREE Full Text
↵
1. de Jesus JO,
2. Parker L,
3. Frangos AJ,
4. et al
. Accuracy of MRI, MR arthrography, and ultrasound in the diagnosis of rotator cuff tears: a meta-analysis. AJR Am J Roentgenol 2009;192:1701–7.
OpenUrl CrossRef PubMed Web of Science
↵
1. Smith TO,
2. Back T,
3. Toms AP,
4. et al
. Diagnostic accuracy of ultrasound for rotator cuff tears in adults: a systematic review and meta-analysis. Clin Radiol 2011;66:1036–48.
OpenUrl CrossRef PubMed
↵
1. Read JW,
2. Perko M
. Shoulder ultrasound: diagnostic accuracy for impingement syndrome, rotator cuff tear, and biceps tendon pathology. J Shoulder Elbow Surg 1998;7:264–71.
OpenUrl CrossRef PubMed Web of Science
↵
1. Sterne JA,
2. Gavaghan D,
3. Egger M
. Publication and related bias in meta-analysis: power of statistical tests and prevalence in the literature. J Clin Epidemiol 2000;53:1119–29.
OpenUrl CrossRef PubMed Web of Science
↵
1. O'Brien SJ,
2. Pagnani MJ,
3. Fealy S,
4. et al
. The active compression test: a new and effective test for diagnosing labral tears and acromioclavicular joint abnormality. Am J Sports Med 1998;26:610–3.
OpenUrl Abstract/FREE Full Text
↵
1. Lo IK,
2. Nonweiler B,
3. Woolfrey M,
4. et al
. An evaluation of the apprehension, relocation, and surprise tests for anterior shoulder instability. Am J Sports Med 2004;32:301–7.
OpenUrl Abstract/FREE Full Text
↵
1. Rutjes AW,
2. Reitsma JB,
3. Di Nisio M,
4. et al
. Evidence of bias and variation in diagnostic accuracy studies. CMAJ 2006;174:469–76.
OpenUrl Abstract/FREE Full Text
↵
1. Neer CS 2nd.
. Impingement lesions. Clin Orthop Relat Res 1983;10:70–7.
OpenUrl
↵
1. Harrison AK,
2. Flatow EL
. Subacromial impingement syndrome. J Am Acad Orthop Surg 2011;19:701–8.
OpenUrl Abstract/FREE Full Text
↵
1. Dorrestijn O,
2. Stevens M,
3. Winters JC,
4. et al
. Conservative or surgical treatment for subacromial impingement syndrome? A systematic review. J Shoulder Elbow Surg 2009;18:652–60.
OpenUrl CrossRef PubMed Web of Science
1. Guanche CA,
2. Jones DC
. Clinical testing for tears of the glenoid labrum. Arthroscopy 2003;19:517–23.
OpenUrl CrossRef PubMed Web of Science
1. Litaker D,
2. Pioro M,
3. El Bilbeisi H,
4. et al
. Returning to the bedside: using the history and physical examination to identify rotator cuff tears. J Am Geriatr Soc 2000;48:1633–7.
OpenUrl PubMed Web of Science
↵
1. Farber AJ,
2. Castillo R,
3. Clough M,
4. et al
. Clinical assessment of three common tests for traumatic anterior shoulder instability. J Bone Joint Surg Am 2006;88:1467–74.
OpenUrl CrossRef PubMed

View Abstract

Footnotes

Competing interests None.
Provenance and peer review Not commissioned; externally peer reviewed.
Correction notice This paper has been amended since it was published Online First. The complete list of authors was inadvertently omitted and this has now been rectified.

[1] ↵
Hegedus EJ,
Goode A,
Campbell S,
et al
. Physical examination tests of the shoulder: a systematic review with meta-analysis of individual tests. Br J Sports Med 2008;42:80–92.
OpenUrl Abstract/FREE Full Text

[2] Hegedus EJ,

[3] Goode A,

[4] Campbell S,

[5] et al

[6] ↵
Karlsson J
. Physical examination tests are not valid for diagnosing SLAP tears: a review. Clin J Sport Med 2010;20:134–5.

[7] Karlsson J

[8] ↵
Calvert E,
Chambers GK,
Regan W,
et al
. Special physical examination tests for superior labrum anterior posterior shoulder tears are clinically limited and invalid: a diagnostic systematic review. J Clin Epidemiol 2009;62:558–63.
OpenUrl CrossRef PubMed Web of Science

[9] Calvert E,

[10] Chambers GK,

[11] Regan W,

[12] et al

[13] ↵
Meserve BB,
Cleland JA,
Boucher TR
. A meta-analysis examining clinical test utility for assessing superior labral anterior posterior lesions. Am J Sports Med 2009;37:2252–8.
OpenUrl Abstract/FREE Full Text

[14] Meserve BB,

[15] Cleland JA,

[16] Boucher TR

[17] ↵
Walton DM,
Sadi J
. Identifying SLAP lesions: a meta-analysis of clinical tests and exercise in clinical reasoning. Phys Ther Sport 2008;9:167–76.
OpenUrl CrossRef PubMed Web of Science

[18] Walton DM,

[19] Sadi J

[20] ↵
Dessaur WA,
Magarey ME
. Diagnostic accuracy of clinical tests for superior labral anterior posterior lesions: a systematic review. J Orthop Sports Phys Ther 2008;38:341–52.
OpenUrl PubMed Web of Science

[21] Dessaur WA,

[22] Magarey ME

[23] ↵
Munro W,
Healy R
. The validity and accuracy of clinical tests used to detect labral pathology of the shoulder–a systematic review. Man Ther 2009;14:119–30.
OpenUrl CrossRef PubMed

[24] Munro W,

[25] Healy R

[26] ↵
Moher D,
Cook DJ,
Eastwood S,
et al
. Improving the quality of reports of meta-analyses of randomised controlled trials: the QUOROM statement. Quality of Reporting of Meta-analyses. Lancet 1999;354:1896–900.
OpenUrl CrossRef PubMed Web of Science

[27] Moher D,

[28] Cook DJ,

[29] Eastwood S,

[30] et al

[31] ↵
Moher D,
Liberati A,
Tetzlaff J,
et al
. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. BMJ 2009;339:b2535.

[32] Moher D,

[33] Liberati A,

[34] Tetzlaff J,

[35] et al

[36] ↵
Harbord RM,
Deeks JJ,
Egger M,
et al
. A unification of models for meta-analysis of diagnostic accuracy studies. Biostatistics 2007;8:239–51.
OpenUrl Abstract/FREE Full Text

[37] Harbord RM,

[38] Deeks JJ,

[39] Egger M,

[40] et al

[41] ↵
Reitsma JB,
Glas AS,
Rutjes AW,
et al
. Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviews. J Clin Epidemiol 2005;58:982–90.
OpenUrl CrossRef PubMed Web of Science

[42] Reitsma JB,

[43] Glas AS,

[44] Rutjes AW,

[45] et al

[46] ↵
Rutter CM,
Gatsonis CA
. A hierarchical regression approach to meta-analysis of diagnostic test accuracy evaluations. Stat Med 2001;20:2865–84.
OpenUrl CrossRef PubMed Web of Science

[47] Rutter CM,

[48] Gatsonis CA

[49] ↵
Whiting P,
Rutjes AW,
Dinnes J,
et al
. Development and validation of methods for assessing the quality of diagnostic accuracy studies. Health Technol Assess 2004;8:iii, 1–234.
OpenUrl PubMed

[50] Whiting P,

[51] Rutjes AW,

[52] Dinnes J,

[53] et al

[54] ↵
Whiting PF,
Rutjes AW,
Westwood ME,
et al
. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med 2011;155:529–36.
OpenUrl CrossRef PubMed Web of Science

[55] Whiting PF,

[56] Rutjes AW,

[57] Westwood ME,

[58] et al

[59] ↵
Sackett D,
Haynes R,
Guyatt G,
et al
. Clinical Epidemiology: A Basic Science For Clinical Medicine. Second edition. Boston: Little Brown, 1991.

[60] Sackett D,

[61] Haynes R,

[62] Guyatt G,

[63] et al

[64] ↵
Dinnes J,
Deeks J,
Kirby J,
et al
. A methodological review of how heterogeneity has been examined in systematic reviews of diagnostic test accuracy. Health Technol Assess 2005;9:1–113, iii.
OpenUrl PubMed

[65] Dinnes J,

[66] Deeks J,

[67] Kirby J,

[68] et al

[69] ↵
Harbord RM,
Whiting P
. Metandi: Meta-analysis of diagnostic accuracy using hierarchical logistic regression. Stata Journal 2009;9:211–229.
OpenUrl Web of Science

[70] Harbord RM,

[71] Whiting P

[72] ↵
DerSimonian R,
Laird N
. Meta-analysis in clinical trials. Control Clin Trials 1986;7:177–88.
OpenUrl CrossRef PubMed Web of Science

[73] DerSimonian R,

[74] Laird N

[75] ↵
Rothman KJ
. Writing for epidemiology. Epidemiology 1998;9:333–7.
OpenUrl PubMed

[76] Rothman KJ

[77] ↵
Cox D
. The analysis of binary data. London: Methuen, 1970.

[78] Cox D

[79] ↵
Egger M,
Davey Smith G,
Schneider M,
et al
. Bias in meta-analysis detected by a simple, graphical test. BMJ 1997;315:629–34.
OpenUrl Abstract/FREE Full Text

[80] Egger M,

[81] Davey Smith G,

[82] Schneider M,

[83] et al

[84] ↵
Zamora J,
Abraira V,
Muriel A,
et al
. Meta-DiSc: a software for meta-analysis of test accuracy data. BMC Med Res Methodol 2006;6:31.
OpenUrl CrossRef PubMed

[85] Zamora J,

[86] Abraira V,

[87] Muriel A,

[88] et al

[89] ↵
Rabe-Hesketh S,
Skrondol A,
Pickles A
. GLLAMM Manual: U.C. Berkeley Division of Biostatistics Working Paper Series, 2004:60.

[90] Rabe-Hesketh S,

[91] Skrondol A,

[92] Pickles A

[93] Michener LA,
Walsworth MK,
Doukas WC,
et al
. Reliability and diagnostic accuracy of 5 physical examination tests and combination of tests for subacromial impingement. Arch Phys Med Rehabil 2009;90:1898–903.
OpenUrl CrossRef PubMed Web of Science

[94] Michener LA,

[95] Walsworth MK,

[96] Doukas WC,

[97] et al

[98] ↵
Miller CA,
Forrester GA,
Lewis JS
. The validity of the lag signs in diagnosing full-thickness tears of the rotator cuff: a preliminary investigation. Arch Phys Med Rehabil 2008;89:1162–8.
OpenUrl CrossRef PubMed Web of Science

[99] Miller CA,

[100] Forrester GA,

[101] Lewis JS

[102] ↵
Kim YS,
Kim JM,
Ha KY,
et al
. The passive compression test: a new clinical test for superior labral tears of the shoulder. Am J Sports Med 2007;35:1489–94.
OpenUrl Abstract/FREE Full Text

[103] Kim YS,

[104] Kim JM,

[105] Ha KY,

[106] et al

[107] Fodor D,
Poanta L,
Felea I,
et al
. Shoulder impingement syndrome: correlations between clinical tests and ultrasonographic findings. Ortop Traumatol Rehabil 2009;11:120–6.
OpenUrl PubMed

[108] Fodor D,

[109] Poanta L,

[110] Felea I,

[111] et al

[112] ↵
Jia X,
Ji JH,
Petersen SA,
et al
. Clinical evaluation of the shoulder shrug sign. Clin Orthop Relat Res 2008;466:2813–9.
OpenUrl CrossRef PubMed Web of Science

[113] Jia X,

[114] Ji JH,

[115] Petersen SA,

[116] et al

[117] ↵
Bushnell BD,
Creighton RA,
Herring MM
. The bony apprehension test for instability of the shoulder: a prospective pilot analysis. Arthroscopy 2008;24: 974–82.
OpenUrl CrossRef PubMed

[118] Bushnell BD,

[119] Creighton RA,

[120] Herring MM

[121] ↵
Castoldi F,
Blonna D,
Hertel R
. External rotation lag sign revisited: accuracy for diagnosis of full thickness supraspinatus tear. J Shoulder Elbow Surg 2009;18:529–34.
OpenUrl CrossRef PubMed Web of Science

[122] Castoldi F,

[123] Blonna D,

[124] Hertel R

[125] Silva L,
Andréu JL,
Muñoz P,
et al
. Accuracy of physical examination in subacromial impingement syndrome. Rheumatology (Oxford) 2008;47:679–83.
OpenUrl Abstract/FREE Full Text

[126] Silva L,

[127] Andréu JL,

[128] Muñoz P,

[129] et al

[130] ↵
Chew K,
Pua YH,
Chin J,
et al
. Clinical predictors for the diagnosis of supraspinatus pathology. Physiotherapy Singapore 2010;13:12–17.
OpenUrl

[131] Chew K,

[132] Pua YH,

[133] Chin J,

[134] et al

[135] ↵
Bak K,
Sørensen AK,
Jørgensen U,
et al
. The value of clinical tests in acute full-thickness tears of the supraspinatus tendon: does a subacromial lidocaine injection help in the clinical diagnosis? A prospective study. Arthroscopy 2010; 26:734–42.
OpenUrl CrossRef PubMed

[136] Bak K,

[137] Sørensen AK,

[138] Jørgensen U,

[139] et al

[140] Bartsch M,
Greiner S,
Haas NP,
et al
. Diagnostic values of clinical tests for subscapularis lesions. Knee Surg Sports Traumatol Arthrosc 2010;18:1712–7.
OpenUrl CrossRef PubMed

[141] Bartsch M,

[142] Greiner S,

[143] Haas NP,

[144] et al

[145] ↵
Ben Kibler W,
Sciascia AD,
Hester P,
et al
. Clinical utility of traditional and new tests in the diagnosis of biceps tendon injuries and superior labrum anterior and posterior lesions in the shoulder. Am J Sports Med 2009;37:1840–7.
OpenUrl Abstract/FREE Full Text

[146] Ben Kibler W,

[147] Sciascia AD,

[148] Hester P,

[149] et al

[150] Chen HS,
Lin SH,
Hsu YH,
et al
. A comparison of physical examinations with musculoskeletal ultrasound in the diagnosis of biceps long head tendinitis. Ultrasound Med Biol 2011;37:1392–8.
OpenUrl CrossRef PubMed

[151] Chen HS,

[152] Lin SH,

[153] Hsu YH,

[154] et al

[155] ↵
Fowler EM,
Horsley IG,
Rolf CG
. Clinical and arthroscopic findings in recreationally active patients. Sports Med Arthrosc Rehabil Ther Technol 2010;2:2.
OpenUrl CrossRef PubMed

[156] Fowler EM,

[157] Horsley IG,

[158] Rolf CG

[159] ↵
Goyal P,
Hemal U,
Kumar R
. High resolution sonographic evaluation of painful shoulder. Internet Journal of Radiology 2010;12:22.
OpenUrl

[160] Goyal P,

[161] Hemal U,

[162] Kumar R

[163] ↵
Jia X,
Petersen SA,
Khosravi AH,
et al
. Examination of the shoulder: the past, the present, and the future. J Bone Joint Surg Am 2009;91 Suppl 6:10–8.
OpenUrl

[164] Jia X,

[165] Petersen SA,

[166] Khosravi AH,

[167] et al

[168] Kelly SM,
Brittle N,
Allen GM
. The value of physical tests for subacromial impingement syndrome: a study of diagnostic accuracy. Clin Rehabil 2010; 24:149–58.
OpenUrl Abstract/FREE Full Text

[169] Kelly SM,

[170] Brittle N,

[171] Allen GM

[172] ↵
Kim HA,
Kim SH,
Seo YI
. Ultrasonographic findings of painful shoulders and correlation between physical examination and ultrasonographic rotator cuff tear. Mod Rheumatol 2007;17:213–9.
OpenUrl CrossRef PubMed

[173] Kim HA,

[174] Kim SH,

[175] Seo YI

[176] ↵
Kim HA,
Kim SH,
Seo YI
. Ultrasonographic findings of the shoulder in patients with rheumatoid arthritis and comparison with physical examination. J Korean Med Sci 2007;22:660–6.
OpenUrl CrossRef PubMed

[177] Kim HA,

[178] Kim SH,

[179] Seo YI

[180] Salaffi F,
Ciapetti A,
Carotti M,
et al
. Clinical value of single versus composite provocative clinical tests in the assessment of painful shoulder. J Clin Rheumatol 2010;16:105–8.
OpenUrl CrossRef PubMed

[181] Salaffi F,

[182] Ciapetti A,

[183] Carotti M,

[184] et al

[185] Walsworth MK,
Doukas WC,
Murphy KP,
et al
. Reliability and diagnostic accuracy of history and physical examination for diagnosing glenoid labral tears. Am J Sports Med 2008;36:162–8.
OpenUrl Abstract/FREE Full Text

[186] Walsworth MK,

[187] Doukas WC,

[188] Murphy KP,

[189] et al

[190] ↵
Schlechter JA,
Summa S,
Rubin BD
. The passive distraction test: a new diagnostic aid for clinically significant superior labral pathology. Arthroscopy 2009; 25:1374–9.
OpenUrl PubMed Web of Science

[191] Schlechter JA,

[192] Summa S,

[193] Rubin BD

[194] ↵
Gillooly JJ,
Chidambaram R,
Mok D
. The lateral Jobe test: A more reliable method of diagnosing rotator cuff tears. Int J Shoulder Surg 2010;4:41–3.
OpenUrl CrossRef PubMed

[195] Gillooly JJ,

[196] Chidambaram R,

[197] Mok D

[198] ↵
Adams SL,
Yarnold PR,
Mathews JJ 4th.
. Clinical use of the olecranon-manubrium percussion sign in shoulder trauma. Ann Emerg Med 1988;17:484–7.
OpenUrl CrossRef PubMed

[199] Adams SL,

[200] Yarnold PR,

[201] Mathews JJ 4th.

[202] ↵
Carbone S,
Gumina S,
Vestri AR,
et al
. Coracoid pain test: a new clinical sign of shoulder adhesive capsulitis. Int Orthop 2010;34:385–8.
OpenUrl CrossRef PubMed

[203] Carbone S,

[204] Gumina S,

[205] Vestri AR,

[206] et al

[207] ↵
Ebinger N,
Magosch P,
Lichtenberg S,
et al
. A new SLAP test: the supine flexion resistance test. Arthroscopy 2008;24:500–5.
OpenUrl PubMed Web of Science

[208] Ebinger N,

[209] Magosch P,

[210] Lichtenberg S,

[211] et al

[212] ↵
Cook C,
Beaty S,
Kissenberth MJ,
et al
. Diagnostic accuracy of five orthopedic clinical tests for diagnosis of superior labrum anterior posterior (SLAP) lesions. J Shoulder Elbow Surg 2012;21:13–22.
OpenUrl CrossRef PubMed Web of Science

[213] Cook C,

[214] Beaty S,

[215] Kissenberth MJ,

[216] et al

[217] Gill HS,
El Rassi G,
Bahk MS,
et al
. Physical examination for partial tears of the biceps tendon. Am J Sports Med 2007;35:1334–40.
OpenUrl Abstract/FREE Full Text

[218] Gill HS,

[219] El Rassi G,

[220] Bahk MS,

[221] et al

[222] ↵
Kim E,
Jeong HJ,
Lee KW,
et al
. Interpreting positive signs of the supraspinatus test in screening for torn rotator cuff. Acta Med Okayama 2006;60:223–8.
OpenUrl PubMed Web of Science

[223] Kim E,

[224] Jeong HJ,

[225] Lee KW,

[226] et al

[227] ↵
Naredo E,
Aguado P,
De Miguel E,
et al
. Painful shoulder: comparison of physical examination and ultrasonographic findings. Ann Rheum Dis 2002; 61:132–6.
OpenUrl Abstract/FREE Full Text

[228] Naredo E,

[229] Aguado P,

[230] De Miguel E,

[231] et al

[232] ↵
Itoi E,
Minagawa H,
Yamamoto N,
et al
. Are pain location and physical examinations useful in locating a tear site of the rotator cuff? Am J Sports Med 2006;34:256–64.
OpenUrl Abstract/FREE Full Text

[233] Itoi E,

[234] Minagawa H,

[235] Yamamoto N,

[236] et al

[237] Oh JH,
Kim JY,
Kim WS,
et al
. The evaluation of various physical examinations for the diagnosis of type II superior labrum anterior and posterior lesion. Am J Sports Med 2008;36:353–9.
OpenUrl Abstract/FREE Full Text

[238] Oh JH,

[239] Kim JY,

[240] Kim WS,

[241] et al

[242] ↵
de Jesus JO,
Parker L,
Frangos AJ,
et al
. Accuracy of MRI, MR arthrography, and ultrasound in the diagnosis of rotator cuff tears: a meta-analysis. AJR Am J Roentgenol 2009;192:1701–7.
OpenUrl CrossRef PubMed Web of Science

[243] de Jesus JO,

[244] Parker L,

[245] Frangos AJ,

[246] et al

[247] ↵
Smith TO,
Back T,
Toms AP,
et al
. Diagnostic accuracy of ultrasound for rotator cuff tears in adults: a systematic review and meta-analysis. Clin Radiol 2011;66:1036–48.
OpenUrl CrossRef PubMed

[248] Smith TO,

[249] Back T,

[250] Toms AP,

[251] et al

[252] ↵
Read JW,
Perko M
. Shoulder ultrasound: diagnostic accuracy for impingement syndrome, rotator cuff tear, and biceps tendon pathology. J Shoulder Elbow Surg 1998;7:264–71.
OpenUrl CrossRef PubMed Web of Science

[253] Read JW,

[254] Perko M

[255] ↵
Sterne JA,
Gavaghan D,
Egger M
. Publication and related bias in meta-analysis: power of statistical tests and prevalence in the literature. J Clin Epidemiol 2000;53:1119–29.
OpenUrl CrossRef PubMed Web of Science

[256] Sterne JA,

[257] Gavaghan D,

[258] Egger M

[259] ↵
O'Brien SJ,
Pagnani MJ,
Fealy S,
et al
. The active compression test: a new and effective test for diagnosing labral tears and acromioclavicular joint abnormality. Am J Sports Med 1998;26:610–3.
OpenUrl Abstract/FREE Full Text

[260] O'Brien SJ,

[261] Pagnani MJ,

[262] Fealy S,

[263] et al

[264] ↵
Lo IK,
Nonweiler B,
Woolfrey M,
et al
. An evaluation of the apprehension, relocation, and surprise tests for anterior shoulder instability. Am J Sports Med 2004;32:301–7.
OpenUrl Abstract/FREE Full Text

[265] Lo IK,

[266] Nonweiler B,

[267] Woolfrey M,

[268] et al

[269] ↵
Rutjes AW,
Reitsma JB,
Di Nisio M,
et al
. Evidence of bias and variation in diagnostic accuracy studies. CMAJ 2006;174:469–76.
OpenUrl Abstract/FREE Full Text

[270] Rutjes AW,

[271] Reitsma JB,

[272] Di Nisio M,

[273] et al

[274] ↵
Neer CS 2nd.
. Impingement lesions. Clin Orthop Relat Res 1983;10:70–7.
OpenUrl

[275] Neer CS 2nd.

[276] ↵
Harrison AK,
Flatow EL
. Subacromial impingement syndrome. J Am Acad Orthop Surg 2011;19:701–8.
OpenUrl Abstract/FREE Full Text

[277] Harrison AK,

[278] Flatow EL

[279] ↵
Dorrestijn O,
Stevens M,
Winters JC,
et al
. Conservative or surgical treatment for subacromial impingement syndrome? A systematic review. J Shoulder Elbow Surg 2009;18:652–60.
OpenUrl CrossRef PubMed Web of Science

[280] Dorrestijn O,

[281] Stevens M,

[282] Winters JC,

[283] et al

[284] Guanche CA,
Jones DC
. Clinical testing for tears of the glenoid labrum. Arthroscopy 2003;19:517–23.
OpenUrl CrossRef PubMed Web of Science

[285] Guanche CA,

[286] Jones DC

[287] Litaker D,
Pioro M,
El Bilbeisi H,
et al
. Returning to the bedside: using the history and physical examination to identify rotator cuff tears. J Am Geriatr Soc 2000;48:1633–7.
OpenUrl PubMed Web of Science

[288] Litaker D,

[289] Pioro M,

[290] El Bilbeisi H,

[291] et al

[292] ↵
Farber AJ,
Castillo R,
Clough M,
et al
. Clinical assessment of three common tests for traumatic anterior shoulder instability. J Bone Joint Surg Am 2006;88:1467–74.
OpenUrl CrossRef PubMed

[293] Farber AJ,

[294] Castillo R,

[295] Clough M,

[296] et al

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Statistics from Altmetric.com

Request Permissions

Introduction

Methods

Study selection

Quality assessment

Statistical analysis

Results

New Studies/Tests/Pathologies

Quality assessment – risk of bias and concern for applicability

Statistical analysis

Overall

Subacromial impingement

SLAP lesions

Anterior instability

Labral tear

Tendinopathy

What this study adds

Discussion

Limitations

Conclusions

Acknowledgments

References

Footnotes

Read the full text or download the PDF:

Log in using your username and password