Article Text

Download PDFPDF

Persistent effects of playing football and associated (subconcussive) head trauma on brain structure and function: a systematic review of the literature
  1. A A Tarnutzer1,2,
  2. D Straumann1,2,
  3. P Brugger1,
  4. N Feddermann-Demont1,2
  1. 1Department of Neurology, University Hospital Zurich and University of Zurich, Zurich, Switzerland
  2. 2Swiss Concussion Center, Schulthess Clinic, Zurich, Switzerland
  1. Correspondence to Dr A A Tarnutzer, Department of Neurology, University Hospital Zurich, Frauenklinikstr. 26, Zurich 8091, Switzerland; alexander.tarnutzer{at}access.uzh.ch, atarnutzer{at}gmail.com

Abstract

Aim/objective There is ongoing controversy about persistent neurological deficits in active and former football (soccer) players. We reviewed the literature for associations between football activities (including heading/head injuries) and decline in brain structure/function.

Design Systematic literature review.

Data sources MEDLINE, Embase, PsycINFO, CINAHL, Cochrane-CRCT, SportDiscus, Cochrane-DSR=4 (accessed 2 August 2016).

Eligibility criteria for selecting studies Original studies reporting on football-related persistent effects on brain structure/function. Results from neurocognitive testing, neuroimaging and EEG were compared with controls and/or correlated with heading frequency and/or head injuries. Methodological quality was rated for risk-of-bias, including appropriateness of controls, correction for multiple statistical testing and assessment of heading frequency and head injuries.

Results 30 studies with 1691 players were included. Those 57% (8/14) of case–control studies reporting persistent neurocognitive impairment had higher odds for inappropriate control of type 1 errors (OR=17.35 (95% CI (10.61 to 28.36)) and for inappropriate selection of controls (OR=1.72 (1.22 to 2.43)) than studies observing no impairment. Studies reporting a correlation between heading frequency and neurocognitive deficits (6/17) had lower quality of heading assessment (OR=14.20 (9.01 to 22.39)) than studies reporting no such correlation. In 7 of 13 studies (54%), the number of head injuries correlated with the degree of neurocognitive impairment. Abnormalities on neuroimaging (6/8 studies) were associated with subclinical neurocognitive deficits in 3 of 4 studies.

Summary/conclusions Various methodological shortcomings limit the evidence for persistent effects of football play on brain structure/function. Sources of bias include low-quality assessment of heading frequency, inappropriate control for type 1 errors and inappropriate selection of controls. Combining neuroimaging techniques with neurocognitive testing in prospective studies seems most promising to further clarify on the impact of football on the brain.

  • Football
  • Neurology
  • Sporting injuries
  • Trauma

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Introduction

Concussions (ie, a subtype of mild traumatic brain injury (mTBI) without structural abnormalities on conventional CT or MRI)1 represent 1–5% of all football-related (soccer-related) injuries.2–5 While most players return to play within 7–10 days, head-trauma-related symptoms may last for weeks to months in 10–15% and even persist in selected cases.6 Neurodegenerative disorders (such as Alzheimer disease) have been reported in retired professional football players and in athletes from other contact sports as rugby and American football.7 ,8 A postulated association between football play and chronic traumatic encephalopathy, however, remains controversial,9 and the effect of football-related concussions is not well understood.

Likewise, the impact of purposeful heading the ball to play and guide its direction—unique to football—on the brain has been debated.10 ,11 On average, players head the ball 1–16 times during a competitive football match,12–16 accumulating over a season to several hundred headings17–19 and to many thousand headings during a professional football career. This has raised concerns that heading may—similar to boxers receiving punches to the head—pose players at increased risk for ‘subconcussive’ trauma,20–24 potentially resulting in neuronal damage similar to that in repetitive concussions but not accompanied by overt symptoms.20–23 ,25 These considerations have led to uncertainty in football players and their (medical) attendants,1 albeit such a link is far from being established and the impact of parameters such as heading technique, player's age and playing position remains unclear. Nonetheless, with raising concerns and facing a concussion litigation, the football federation of the USA issued in November 2015 a ban for heading in children aged 10 years or less and limited heading in children aged 11–13 years.26 Concerns that the maturing brain could be especially vulnerable to subconcussive head injury may have supported this decision.

Research interest in associations between concussion, heading and persistent changes of the human brain has grown substantially. At the end of the last century, a series of case–control studies indicated persistent neurocognitive impairments in Dutch professional16 ,19 and amateur27 football players. These studies were the basis for further investigations addressing functional, structural and metabolic brain changes in football players. While some studies confirmed neurocognitive abnormalities compared with controls,28 ,29 others found no such evidence.14 ,15 ,30 ,31 Likewise, associations between neurocognitive deficits and heading frequency were reported by some,16 ,17 ,19 ,28 ,29 but not by others.14 ,15 ,30–34

In accordance with neuroimaging for TBI,35 different protocols were applied to study structural (diffusion tensor imaging (DTI),36 voxel-based MR morphometry (VBM)) and metabolic (functional MRI (fMRI), magnetic resonance spectroscopy (MRS)) brain changes in football players and to correlate with neurocognitive function. In two small case–control studies, memory impairment was linked to cortical thinning in former professionals37 and to diffuse white matter abnormalities in amateur players,17 while recently in a prospective case–control study over 5 years in professional players, no such link could be drawn.38

In summary, whether or not football play is linked to persistent changes in brain function/structure remains controversial. Against this background, we aimed to systematically review the literature on associations between football play and persistent changes in brain function/structure and the impact of heading frequency and concussive head injuries. Assessing study quality and identifying methodological limitations using standardised tools for reporting risk-of-bias was a special focus.10

Materials and methods

Data sources and searches

A literature search (MEDLINE, Embase, PsycINFO, CINAHL, Cochrane CRCT, SportDiscus, Cochrane DSR=4) was performed (2 August 2016) to identify articles reporting on associations between football play and especially heading and football-related head injuries and persistent structural/functional changes of the brain. The MEDLINE (OVID) search strategy was translated for each database, and is reported in online supplementary file 1.

‘Persistent’ changes were defined as changes that were still recognisable >6 months after a potential impact or linked to exposure to football play for >6 months or a full season. We adhered to the time-frame usually applied for the persistent postconcussive syndrome,39 albeit no consensus-based definition of this term exists.

We also performed a manual search of reference lists from eligible articles. We did not seek to identify research abstracts from meeting proceedings or unpublished studies, nor non-English language studies. Retrospective or prospective studies with five or more participants were eligible. This review complies with PRISMA guidelines.40

Study selection

We identified 2191 citations for screening and included 30 studies for quantitative synthesis (figure 1) based on abstract and for selected studies full-text review by two experienced neurologists (NF-D, AAT). Articles were selected using predetermined criteria (see online supplementary file 1).

Figure 1

Flow chart depicting the selection process of identified articles. *One study was excluded because of duplicity of data;46 another study was excluded because of a ‘poor’ risk-of-bias rating (Newcastle–Ottawa Scale).45

Data extraction and quality assessment

Reports on neurocognitive testing, neuroimaging, postural control and EEG were considered. Data extraction was performed by AAT and confirmed by NF-D. When extracting data from selected studies, we assessed the type of study, the type of diagnostic tests performed, the frequency of heading and head injuries and the level of play, distinguishing between youth, high school/college (including interscholar), university, amateur, active professional and former professional players. In studies reporting on neurocognitive testing, all tests applied were retrieved and assigned to the category that best described the domain of neurocognitive function evaluated. Categories were: abstract reasoning, attention, (verbal) creativity and divergent thinking, decision-making, executive functions, intelligence, language and language-associated functions, memory/learning, mood, motor skills and visuospatial skills.

A standardised risk-of-bias assessment was performed using the Newcastle–Ottawa Scale (NOS).41 Its use for non-randomised case–control studies and observational studies has been promoted by the Cochrane Collaboration.42 The NOS requires rating the selection, comparability and exposure/outcome for a total of nine items. Study quality was rated as ‘good’, ‘fair’ or ‘poor’ (see ref. 43 and online supplementary file 2). Studies rated as ‘poor’ were excluded. The NOS included an assessment of the response rate when asked to participate. Studies with a low (<50%) participation rate, with participation rates differing >10% between football players and controls or studies that did not report response rates were rated as high risk for selection bias. Whenever non-football playing controls were available (n=21 studies), their suitability was rated. Only control groups that were age-matched and gender-matched and that participated in non-contact sports with a comparable physical activity profile (eg, swimming, track, tennis) were considered ‘appropriate’ or low risk for bias. A distinct (ie, lower/higher) physical activity profile may introduce a bias and observed differences may be attributed falsely to effects of football play. Controls falling short of these criteria were considered ‘inappropriate’ or high risk of bias.

Based on previously described methodological limitations, we further assessed the quality of included studies regarding assessment of heading frequency and history of head injuries and control for type 1 errors.10 Rutherford et al10 identified insufficient control for type 1 errors as a potential source for false-positive statistical differences. To follow-up on this limitation, we assessed methods for avoiding type 1 errors. Only studies that reported sufficient controlling for multiple testing (eg, by applying the Bonferroni correction) were considered ‘appropriate’ or low risk for type 1 errors, while studies falling short of these criteria were considered ‘inappropriate’ or high risk. Heading frequency and head injuries may be overstated or understated by players, posing them at risk for recall bias. Therefore, only studies that prospectively collected data on heading frequency (eg, by an independent observer) were considered low risk of bias, while studies falling short of these requirements (eg, relied on self-reported numbers, a heading exposure index44) were considered high risk. We did not require loss of consciousness for making the diagnosis of concussion and relied on the original study authors' assessment.

Data synthesis and analysis

Excel 2011 (Microsoft Corp, Redmont, USA) and Matlab V.7.0 (The MathWorks, Nantuck, USA) were used for data analyses. Statistical analyses were performed using two-sample t-tests (with the Bonferroni correction) and ORs including 95% CIs.

Results

From the 32 studies included for qualitative synthesis (figure 1), one was excluded because of ‘poor’ quality on NOS45 and one was removed because of duplicity of data.46 Among the 30 studies included for quantitative synthesis (n=1691, 22.4% females), only 6 were prospective. Twenty-three studies (76.7%, n=1518) reported on results of neurocognitive testing, while data on neuroimaging were provided in eight studies (26.7%, n=143). Information on EEG (6.7%, n=106) was available from two; postural stability (3.3%, n=15) was provided in one study (tables 1 and 2). Four studies reported on more than one modality (see online supplementary file 3). NOS ratings were ‘good’ and ‘fair’, respectively, in 15 studies each (see online supplementary file 2). Key domains for neurocognitive testing of (sub)concussive brain injury (attention, executive functions, memory) were assessed by 18 of 23 studies, while in the remaining 5 studies, 1 (n=3) or 2 (n=2) key domains were missing.

Table 1

Metadata of included studies (n=30)

Table 2

Summary information about included studies (n=30)

Case–control studies reporting on neurocognitive testing

Fourteen studies compared neurocognitive test results in football players (n=581) with those from controls (n=348). On average, 8.7±5.8 tests covering 4.4±1.9 categories were administered. Eight studies (57.1%) reported significantly lower results for the football players than for the controls in at least one test (2.8±2.7 tests, average±1 SD). Most frequently, deficits of attention, executive function and memory were noted (table 3 and figure 2).

Table 3

Studies reporting on neurocognitive testing (NCT) in football players*

Figure 2

Spider plots illustrating in how many studies the different neurocognitive categories were evaluated (in red) and how often an abnormal test result was retrieved in this category (in blue). The number of studies is provided along the intersection of the web. Separate plots are provided for all case–control studies (A), for all studies investigating the impact of heading on neurocognitive tests (B) and for all studies reporting on the impact of head injuries on cognition (C). Irrespective of the study categorisation, attention, executive functions, memory and visuospatial functions were the cognitive domains most frequently tested and also most frequently impaired.

Studies reporting neurocognitive deficits had a higher rate of inappropriate control of type 1 errors (OR=17.35 (95% CI (10.61 to 28.36)) and higher odds for inappropriate controls (OR=1.72 (1.22 to 2.43)) than studies not reporting any differences. The fraction of female players was about the same for both groups (OR=0.82 (0.47 to 1.47)). The fraction of younger players (youth/high school/college) compared with more elderly players was larger in studies with negative findings than in those with significant deficits (OR=1.92 (1.38 to 2.68)) (table 3).

Impact of heading on neurocognitive functions

A potential link between heading exposure and performance on neurocognitive testing was analysed in 17 studies (n=1173, 26.4% females). On average, 8.5±5.1 neurocognitive tests covering 4.4±1.8 categories were obtained (table 3). Two-thirds of the studies did not find any relation between heading frequency and neurocognitive test performance, while six studies (35%) reported a correlation in 2.0±1.1 tests. Deficits of attention, memory and intelligence were most frequent (table 4). The quality of heading frequency assessment was lower for those studies reporting a link than for those studies without such a link (OR=14.2 (9.0 to 22.4)). The rate of inappropriate control of type 1 errors was similar in studies confirming or discarding such a link (OR=0.74 (0.53 to 1.04)). The fraction of players with more extensive exposure (amateurs, university, (former) professionals) was significantly higher among those studies reporting neurocognitive deficits than for those studies that did not observe deficits (OR=2.15 (1.63 to 2.85)).

Table 4

Distribution of neurocognitive testing categories (in alphabetical order) and percentage of abnormal tests*

Impact of head injuries on neurocognitive functions

A potential association between previous head injuries and neurocognitive deficits was investigated in 13 studies (n=1103, 26.6% females) (table 3). On average, 11.3±4.9 neurocognitive tests covering 5.2±1.4 categories were obtained. Seven studies (54%) reported a correlation, with abnormalities noted in 2.4±1.8 tests. Deficits of visuospatial functions, decision-making, attention and executive function were most frequent (table 4). The rate of inappropriate control of type 1 errors was lower in studies with positive findings compared with studies with negative findings (OR=0.20 (0.15 to 0.26)).

Data on previous head injuries were available in 10 studies with average numbers of concussions ranging between 1.0 and 2.1, with the most recent event between 6 and 8 months and several years ago. The assessment of previous head injuries was based on players' reports in all but two studies.13 ,49 Only football-related concussions were considered in five of seven studies with positive findings and in three of six studies with negative findings, while the remaining five studies included other, non-football-related concussions as well or did not further specify.

Neuroimaging studies

We identified eight studies (n=143, 15.4% females) using imaging modalities focusing on brain structure (conventional MRI, VBM, DTI) or brain metabolism (fMRI, MRS). DTI (2 studies, n=49), conventional MRI (2 studies, n=44) and VBM (2 studies, n=25) were most frequently applied (table 5). All studies used a case–control design with selection of controls rated as ‘appropriate’ in six (75%). Most players were professionals (active=56; former=26) or amateurs (n=37). Only two studies were prospective.38 ,54 In one prospective study, no conventional MRI changes could be depicted in professional players (observation period=5 years).38 Prospectively observing female high-school players over one season using fMRI, significant reductions in frontotemporal cerebrovascular reactivity persisting up to 4–5 months after the season had ended were reported,54 resembling the pattern described in mTBI.60 ,61 Retrospectively, in former professionals, VBM demonstrated cortical thinning in the right inferolateral parietal, temporal and occipital cortex37 and MRS showed higher choline and myo-inositol levels in the posterior cingulate gyrus.51 In professional players, DTI indicated widespread white matter abnormalities (albeit no changes in fractional anisotropy),50 but conventional MRI did not demonstrate changes related to the years of football participation.44 In college-football players, VBM showed decreased grey matter density and volume within the anterior–temporal cortex.48

Table 5

Persistent effects of football on the brain: neuroimaging studies (n=8)

Four studies linked neuroimaging with neurocognitive data.17 ,37 ,38 ,51 Cortical thinning was associated with worse performance on 1 of 6 tests (Rey-Osterrieth complex-figure long-delay recall),37 Glutathion levels were linked to inferior results in 1 of 4 tests (trail making test B)51 and lower levels of fractional anisotropy in parieto-occipital areas were associated with 1 of 6 tests (poorer memory).17 In the only prospective study, neither changes in neurocognitive performance nor in conventional MRI could be depicted over an observation period of 5 years.38

A link between heading exposure and structural/metabolic neuroimaging changes was investigated in five studies.17 ,37 ,44 ,51 ,54 Lifetime estimates of heading numbers were inversely correlated with cortical thickness in the right parietal/occipital lobes37 and with myoinositol and glutathione levels.51 Fractional anisotropy levels in temporoparietal white matter were inversely correlated with the annual number of headings.17 A high cumulative head acceleration exposure was linked to more profound reductions in cerebrovascular reactivity, outlasting the end of the season by 4–5 months before returning to baseline by month 8.54 No correlation between career heading exposure and abnormalities on conventional MRI were reported in another study.44 The potential impact of remote head injuries on brain structure was examined in two studies, both demonstrating no association.17 ,44

EEG studies

Two studies used EEG in active (n=69)55 and former (n=37)56 professional male players. In both studies, standard EEG recordings were examined by a clinical neurophysiologist and EEGs were classified as ‘normal’, ‘slightly abnormal’ or ‘abnormal’ based on background activity and α-activity. EEG ratings in the players were compared with those in age-matched men of ‘various occupations’. With information on matched physical activities lacking in the controls, their quality was rated ‘inappropriate’. The rate of EEGs considered normal was lower in active and former players compared with controls. Among the active players, all abnormal EEGs were observed in players who considered themselves as non-headers.55 Among former players, there were no EEG differences between headers and non-headers.56

Postural stability

One study (n=15) reported on balance, using the balance error scoring system.37 This study described no significant differences between players and controls.

Discussion

With the recently issued ban for heading in child-football players in the USA,26 the ongoing debate about potential persistent effects of football and football-related (subconcussive) trauma on brain function received increased attention and caused uncertainty among football players, medical staff and media. Given the worldwide popularity of football,62 football-related health issues may have far-reaching implications that have to be balanced and compared with benefits due to regular activity. This emphasises the need to intensify hypothesis-driven research and the study of associations between football play and persistent structural/functional changes of the brain.

Under-representation of female players

For most aspects evaluated, female players were in a minority, consistent with reportedly lower numbers of active female football players.62 While no conclusions could be drawn on football-related changes in neuroimaging and EEG, women were under-represented in studies that reported neurocognitive impairment compared with those not observing such deficits. This observation was unexpected since the rate of football-related head injuries was reportedly higher in women.63–66 Of note, none of the studies reported on (former) professional female players. Also, for studies reporting on neurocognitive testing, female players were over-represented in lower levels of play (youth, high school/college) compared with higher levels (university, amateur, (former) professional) (OR=28.57 (19.25 to 42.41)). These observations suggest that cumulative exposure to football play or cumulative intensity has been lower in female players, not reaching levels that may be necessary to result in brain abnormalities. Future studies should pay special attention on functional/structural brain changes in female players with more extensive football exposure.

Neurocognitive testing in (former) football players

Applied in 77% of studies, neurocognitive testing remains the most common approach to investigate potential associations between football play and changes in brain function. Most studies dealt with effects of heading (74%) and head injuries (57%). Over 60 different neurocognitive tests were used, most of them only in few studies. With all three key domains (attention, executive function and memory) assessed by 78% of studies, risk for false-negative results due to inappropriate selection of neurocognitive test domains seems low. Even for the most frequently used tests in those domains considered most important in patients with (sub)concussive brain injury (see online supplementary file 4), the fraction of abnormal test results was low (0.29±0.18). This suggests that reported neurocognitive impairments were rather subtle and their detection may have depended on study-specific parameters as age, gender, level of play and selection of controls. Also, among all tests administered in a given study, those with abnormal outcome were infrequent (fraction=0.21±0.27). In 19 of 23 studies, more than one neurocognitive test was applied to evaluate a single category (eg, TMT-B and Stroop for executive functions). Noteworthy, in 42% of these studies, discrepant test results in a given category were noted. This affected 37% (17/46) of all categories in studies that received multiple testing. These results suggest that changes are subtle and may get identified only by some tests. Moreover, these inconsistencies underline the importance of standardised neurocognitive testing in football.

Persistent neurocognitive changes in (former) players compared with controls

Fifty-seven per cent of case–control studies reported persistent associations between football and neurocognitive impairment focusing on attention, executive function and memory. These categories are primarily mediated by the frontal and temporal lobes and are typically involved in mTBI.67 Based on the quality assessment performed, several confounders must be considered to put the significance of these observations in context. Probably of the most far-reaching implication is the finding that the rate of appropriate control for type 1 errors was smaller among studies with abnormal test results (OR=17.35 (10.61 to 28.36)). These studies thus bear an increased risk for false-positive test results. Choosing the right controls is essential. Including control participants without matching the profile of physical activity might point to global effects of physical activity rather than to football-related changes.14 While in our review, controls were judged as ‘appropriate’ in 67% of case–control studies, the odds for inappropriate controls were higher for studies reporting neurocognitive deficits (OR=1.72 (1.22 to 2.43)). This indicates that inappropriate selection of control participants may represent a serious source of bias. As a consequence, caution is warranted when interpreting impairment in neurocognitive testing based on the existing literature. Noteworthy, with the fraction of younger players being larger in studies with negative findings than in those with significant deficits, this might suggest that the exposure duration was simply not long or intense enough to cause a significant effect, further limiting conclusions.

No clear evidence for heading-related persistent impairment of neurocognitive function

Six of 17 studies that correlated heading frequency with neurocognitive deficits reported a link (mostly for attention, executive functions and memory), but these studies also contained more methodological limitations than those reporting no link. Most importantly, the assessment quality of heading frequency was lower in studies with positive findings (OR=14.2 (9.0 to 22.4)). Self-reported heading frequencies tend to be higher than those obtained by more reliable approaches,68 indicating potential risk of reporting bias. This emphasises the need for prospective observer-based assessments of heading frequency in future studies. Furthermore, studies so far remained incomplete in providing an accurate estimate of heading exposure, since heading during practice sessions was not considered and other variables such as heading technique and ball properties and ball velocity were not available. Studies focusing on former players with heading exposure decades ago45 ,56 ,57 may also be biased by differences in the properties of the ball—heavy leather balls, in common use until the mid-1970s and even heavier on wet undergrounds, should not be compared with the more lightweight balls used thereafter.

The ratio of more senior to more junior players was higher in studies reporting significant impact of heading than in those with negative findings (OR=2.15 (1.63 to 2.85)). For more senior players, the lifetime heading number is higher and the duration of exposure is longer. Current studies therefore cannot exclude that in more senior players, neurocognitive deficits may eventually arise due to a decreased cerebral reserve capacity in view of accumulated (sub)clinical head trauma.14 ,69 Of note, in retired professional UK-football players, no signs for accelerated cognitive decline were found.34 Furthermore, female players were under-represented in studies that noted a link between heading frequency and neurocognitive impairments. A lower heading frequency in female players18 and the skewed distribution of female players towards lower levels of play may explain this seemingly ‘protective effect’ of female gender on heading related neurocognitive impairments.

Based on our review, no firm conclusions on an association between heading frequency and accelerated neurocognitive decline can be drawn. Methodological limitations identified more often in studies with positive findings emphasise caution in linking heading to persistent neurocognitive deficits. This extends conclusions of a previous review on acute and persistent effects of concussions and heading in football.70 Whether or not significant differences in neurocognition arise with increasing heading exposure awaits further clarification, especially in former professional players with extensive exposure and better control for head injury.

Repetitive concussions may be linked to persistent cognitive impairment

A negative impact of (repetitive) head injuries to brain function has been reported for other contact sports as American football,71 ,72 rugby52 or boxing.73–75 For football (soccer) play, Barnes et al76 estimated a 50% risk that a professional male player will suffer a concussion within a 10-year period, while the corresponding figure for female players was 22%. Neuropathological changes associated with mTBI are axonal injuries with a focus on the orbitofrontal and temporal polar zones.67 Cognitive functions affected most are delayed memory, executive functions, language and attention.77 ,78 Based on our review, 54% of studies addressing the influence of head injuries on neurocognitive test performance found a link in one or more categories. Seventy-one per cent of studies with positive findings restricted their analysis to football-related concussions, which indicates a low risk that football-unrelated concussions biased the results. Furthermore, the rate of studies with inappropriate control of type 1 errors was even lower among studies with positive findings compared with those with negative findings.

Distinguishing between effects secondary to heading and head trauma,11 ,32 however, may be difficult or even impossible for several reasons: first, up to 50% of concussions are not reported by players or team physicians.63 ,79 ,80 Second, 89% of studies controlled only for recent (3–6 months) head trauma or did not take head trauma into account at all. Most studies relied on self-reporting of head injuries, introducing risk of recall bias. Even more importantly, lack of control for head injuries bears the risk that effects of heading and head trauma are mixed as players with higher heading frequencies tend to experience head injuries more frequently.14 ,19 ,32 Third, definitions for football-related head injuries have been applied differently,52 potentially resulting in inconsistencies between studies. Assuming that effects of heading and head injury may have been intermingled, one would expect overestimating the link between heading frequency and neurocognitive deficits. In fact, the opposite was true; refuting the assumption that head-injury-related effects have significantly biased results in the assessment of heading-related persistent effects. In summary, the link between head injuries and persistent neurocognitive impairment was moderate only and its impact may have been overestimated due to several methodological shortcomings.

Structural and metabolic changes on neuroimaging

Neuroimaging in football players was driven by the hypothesis that repetitive subconcussive head trauma result in structural/metabolic changes similar to those known from mTBI.81 ,82 Along with the anterior–posterior gradient in brain vulnerability,83 anterior regions may also be linked to executive and neurocognitive deficits in mTBI.84

With a limited number of study participants (n=143) and studies (n=8), different neuroimaging modalities and levels of play, all but two studies reported significant brain changes in football players. These changes were localised preferentially to frontal and anterior–temporal regions.45 ,48 ,50 ,51 ,54 Structural abnormalities located in parieto-occipital areas,17 ,37 that is, in areas opposite to the presumed point of heading impact, may be explained in analogy to the principle of contre-coup injury.17

In 3 studies with 63 players, changes depicted on neuroimaging could be linked to subclinical neurocognitive deficits, suggesting that these changes may be functionally relevant.17 ,37 ,51 While these studies used advanced neuroimaging, no changes could be observed on conventional MRI in the only prospective study (5-year observation period). Whereas there is extensive experience in the interpretation of standard MR sequences, changes in advanced imaging as DTI, VBM and MR spectroscopy are much more difficult to put into clinical context due to their relatively recent use.

A correlation between heading exposure and persistent changes on neuroimaging was observed in four out of five studies,17 ,37 ,44 ,51 ,54 albeit reversible within 8 months after season end in the only prospective study.54 While these findings suggest a possible link between heading frequency and neurodegeneration, the heading exposure assessment was low quality in four out of five studies, posing them at increased risk for recall bias. Correction for multiple statistical testing was reported by Svaldi et al54 and in one study by Koerte et al,37 but not in another,51 indicating possible increased risk of false-positive correlations. In the study by Lipton et al,17 a link was observed only for players with more than 885–1500 headings per year, suggesting that heading below this threshold may be safe. Overall, taking the methodological limitations and the small sample sizes into consideration, support for a link between heading frequency and persistent structural brain changes seems weak. Our review did not find any evidence for an association between (repetitive) concussions and structural brain changes on neuroimaging, albeit small sample sizes limit conclusions.17 ,44

Low quality of EEG-based studies

Both studies included reported higher incidences of EEG abnormalities in male professional players than in controls.55 ,56 With only 106 players analysed, conclusions on persistent effects of football on EEG can be considered preliminary only. Caution is warranted as significant limitations were identified: control groups were rated ‘inappropriate’ and the authors did not control for remote head injury. The clinical implications of the higher incidence of EEG abnormalities remain unclear, especially since no information on the location of EEG abnormalities was provided. Both studies did not provide any evidence for a link between heading and EEG changes as in the group of active football players, all abnormal EEGs were observed in players who considered themselves as non-headers.55 In the group of former players, no EEG differences were detected comparing headers and non-headers.56 Nonetheless, the authors concluded that the reason for the EEG abnormalities was most likely repeated minor head trauma. Considering the limitations listed above, evidence in support of this conclusion is unconvincing.

Limitations

Studies varied in the criteria for reporting previous concussions and we were lacking details to validate the ratings. Self-reporting was standard in most studies, posing them at high risk for recall bias. Therefore, the impact of previous concussions on brain structure/function may have been overestimated or underestimated in our review. The impact of heading depends on many parameters. However, only scarce information about player's position, circumstances of heading (purposeful, passively hit) and heading skills could be retrieved. This limits the assessment of a possible association between heading and structural/functional brain changes. Lack of systematically controlling for timing and time lags between exposure and testing potentially weakens reported associations. Likewise, failure to assess and/or correct for potential confounders as concussions unrelated to football, different physical activity profiles, medical conditions (hypertension, overweight, diabetes) and lifestyle (eg, alcohol consumption, smoking) may have biased associations between (former) football play and structural/functional brain changes.

A pooled analysis of individual study data (ie, a meta-analysis) was not possible due to the heterogeneity in study design, data collection and data analysis. This may have resulted in the incorrect detection or blinding of high-quality studies due to a higher number of low-quality studies.

We did not identify any studies that compared the diagnostic accuracy of different neurocognitive testing procedures that assessed the same neurocognitive domains. Moreover, single studies obtaining more than one neurocognitive test for a specific domain often demonstrated discrepancies. This limits any recommendation on specific neurocognitive tests and emphasises the need for prospective, controlled studies comparing the diagnostic accuracy of neurocognitive tests.

Conclusions

There is weak to non-existent evidence from the medical literature for football-related persistent functional and structural brain deficits and a putative role of (repetitive) head trauma in the development of neurocognitive impairment. Comparison of included studies was limited by various methodological shortcomings. Case–control studies reporting neurocognitive deficits in football players significantly more often included inappropriate controls and control of type 1 errors than studies reporting no deficits. Furthermore, no clear link between heading frequency and neurocognitive deficits could be established and low-quality assessment of heading frequency was identified as the major confounder. Of special interest were studies that combined different modalities: while in four out of five neuroimaging studies, structural and metabolic deficits could be correlated with heading exposure, the clinical and preventive implications of these findings remain inconclusive, as most studies used a low-quality assessment of heading frequency. In three out of four small case–control studies, a link between neuroimaging abnormalities and subclinical neurocognitive deficits could be established, suggesting that these morphological and metabolic changes might be functionally relevant.

Further studies combining functional and structural modalities in larger numbers of football players with long-lasting football exposure appear most promising to shed more light on a potential link between football play and brain structure/function. Such studies should include neurocognitive testing of attention, executive functions and memory as well of objective tests of cervical, vestibular and ocular motor function. They should also be prospective to appropriately control for confounders as history of head injuries, heading frequency and medical conditions. Further validation and head-to-head comparison is required to provide the basis of more standardised testing batteries to improve the quality of studies and to allow for better comparability between studies. A longitudinal and cross-sectional study design will help to determine whether identified subclinical structural and functional abnormalities eventually progress to clinically relevant, symptomatic deficits or rather resolve again, especially after finishing exposure to football play.

What is already known?

  • Repetitive head injuries and heading the ball were suggested to be linked to persistent neurocognitive impairments and structural brain abnormalities in advanced neuroimaging in professional and amateur football players.

  • However, there is ongoing controversy to what extent these findings are real or rather result from methodological limitations.

  • A systematic assessment of existing studies using prospectively defined criteria is therefore needed to improve our understanding of persistent effects of football on the brain and to better estimate the role of methodological limitations in previous studies.

What are the findings?

  • While the majority of studies, addressing the effect of football play and football-related injuries on neurocognitive functions, reported significant impairment in at least one domain, methodological shortcomings were found to be more frequent in studies with reportedly significant findings.

  • Evidence for a correlation between heading frequency and neurocognitive deficits was weak and likely biased by inaccurate heading frequency estimates.

  • Although the rate of football-related head injuries was reportedly higher in women than men, women were under-represented in studies that reported neurocognitive impairment compared with those studies that did not identify deficits, which may be related to the fact that none of these studies included (retired) female professional football players.

  • Combining neuroimaging and neurocognitive testing in prospective longitudinal and cross-sectional studies in male and female players to link structural and functional deficits seems most promising to further clarify associations between football play and brain abnormalities.

Acknowledgments

The authors thank Dr K Alix Hayden (Libraries & Cultural Resources University of Calgary, Calgary, Canada) for optimising the search strategy and performing the literature search.

References

Footnotes

  • Contributors AAT conceived of the study, designed the search strategy, selected suitable articles, extracted and analysed the data, drafted the manuscript and approved the final version of the manuscript. PB was involved in the analysis of the data, critically edited the manuscript and approved the final version of the manuscript. DS was involved in conceiving the study design, critically edited the manuscript and approved the final version of the manuscript. NF-D conceived of the study, was involved in designing the search strategy, selected suitable articles, critically edited the manuscript and approved the final version of the manuscript.

  • Competing interests None declared.

  • Provenance and peer review Not commissioned; externally peer reviewed.