Background Type XI collagen, which is expressed in developing tendons and is encoded by the COL11A1, COL11A2 and COL2A1 genes, shares structural and functional homology with type V collagen, which plays an important role in collagen fibril assembly. We investigated the association of these three polymorphisms with Achilles tendinopathy (AT) and whether these polymorphisms interact with COL5A1 to modulate the risk of AT.
Methods 184 participants diagnosed with chronic AT (TEN) and 338 appropriately matched asymptomatic controls (CON) were genotyped for the three polymorphisms.
Results Although there were no independent associations with AT, the TCT pseudohaplotype constructed from rs3753841 (T/C), rs1676486 (C/T) and rs1799907 (T/A) was significantly over-represented (p=0.006) in the TEN (25.9%) compared with the CON (17.1%) group. The TCT(AGGG) pseudohaplotypes constructed using these type XI collagen polymorphisms and the functional COL5A1 rs71746744 (-/AGGG) polymorphism were also significantly over-represented (p<0.001) in the TEN (25.2%) compared with the CON (9.1%) group.
Discussion The genes encoding structural and functionally related type XI (COL11A1 and COL11A2) and type V (COL5A1) collagens interact with one another to collectively modulate the risk for AT. Although there are no immediate clinical applications, the results of this study provide additional evidence that interindividual variations in collagen fibril assembly might be an important molecular mechanism in the aetiology of chronic AT.
- Sporting injuries
- Achilles tendon
- Genetics/sex testing
Statistics from Altmetric.com
Injury to the midsubstance of the Achilles tendon is a multifactorial condition, resulting from the poorly understood interactions of several extrinsic and intrinsic risk factors.1 The genetic profile of an individual is an important intrinsic risk factor.2 Although DNA sequence variants within several genes have been independently associated with the risk of chronic Achilles tendinoapthy,2 ,3 one of these most extensively studied genes to date is COL5A1.4–7 COL5A1 encodes the α1 chain of type V collagen, which plays an important role in collagen fibril nucleation and the regulation of fibril diameter (fibrillogenesis).8 Several variants within the 3′-untranslated region (3′-UTR) of COL5A1 have been associated with Achilles tendinopathy.5 ,7 These variants are believed to be functional, altering COL5A1 messenger RNA (mRNA) stability within the cytoplasm of the tenocyte.6 Increased mRNA stability has been associated with tendinopathy, which is believed to result in increased α1(V) chain and type V collagen production, decreasing the fibril diameter and packing density, and potentially altering the mechanical properties of the tendon.9
Type XI collagen, a heterotrimer consisting of α1(XI), α2(XI) and α3(XI) chains encoded by the COL11A1 (chromosome 1p21), COL11A2 (chromosome 6p21.3) and COL2A1 (chromosome 12q13.11) genes, respectively,10 shares structural and functional homology with type V collagen.11 Although predominately expressed in cartilage,11 type XI collagen is also expressed in developing tendons.12
Mutations in all the three genes have been implicated in various inherited Mendelian connective tissue disorders,13 and polymorphisms within these genes have also been associated with multifactorial musculoskeletal injuries and connective tissue disorders.14–21
Single nucleotide polymorphisms (SNPs), namely COL11A1 rs3753841 (T/C), COL11A1 rs1676486 (C/T) and/or COL11A2 rs1799907 (T/A), have been associated with lumbar disc herniation,18 limbus vertebra in gymnasts,21 lumbar spine stenosis,17 ossification of the posterior longitudinal ligament of the spine14–16 and rheumatoid arthritis.19
Therefore, we investigated the association of the COL11A1 polymorphisms rs3753841 and rs1676486, as well as the COL11A2 rs1799907 polymorphism with Achilles tendinopathy in a South African and an Australian population. A secondary aim of this study was to investigate whether the COL11A1 and COL11A2 polymorphisms interact with the previously investigated rs71746744 (-/AGGGG) polymorphism within the COL5A1 3′-UTR7 to modulate the risk of Achilles tendinopathy.
Participants and DNA extraction
A total of 267 unrelated, physically active participants from South Africa, consisting of 161 asymptomatic controls (SA CON) and 106 individuals diagnosed with chronic Achilles tendinopathy (SA TEN), were recruited as previously described.4 ,22 The SA TEN group was recruited from Sports Medicine clinical practices in Cape Town and Johannesburg, South Africa. The diagnosis was confirmed by a sports physician, using the previously described inclusion and exclusion criteria.22 The SA CON participants reported no history of tendon pathology and were recruited from various recreational sporting clubs. A total of 255 participants from Australia, consisting of 177 asymptomatic controls (AUS CON) and 78 patients with chronic Achilles tendinopathy (AUS TEN) were recruited as previously described.5 The diagnosis of the tendinopathy participants, who were recruited from the Musculoskeletal Research Centre at La Trobe University in Melbourne, Australia, was confirmed using the same clinical criteria by a sports physiotherapist.5
All participants were of self-reported Caucasian European ancestry. They gave written informed consent and completed questionnaires concerning their medical history and involvement in physical activity. This study was approved by the Human Research Ethics Committee at the University of Cape Town, South Africa, and the Human Ethics Committees of La Trobe and Deakin Universities in Melbourne, Australia. DNA was extracted from approximately 4.5 ml of venous blood as previously described.5
COL11A1 and COL11A2 genotyping
The COL11A1 rs3753841 (T/C) and rs1676486 (C/T) non-synonymous SNPs, as well as the COL11A2 rs1799907 (IVS6–4, T/A) SNP, were genotyped using fluorescence-based Taqman PCR assays (Applied Biosystems, Foster City, California, USA). Inventoried allele-specific probes and flanking primer sets were used along with a premade PCR mastermix containing ampliTaq DNA polymerase Gold (Applied Biosystems, Foster City, California, USA) in a final reaction volume of 8 µl. The PCR reactions were conducted on an Applied Biosystems StepOnePlus Real-Time PCR system (Applied Biosystems) using the Applied Biosystems StepOnePlus Real-Time PCR software V.2.2.2 (Applied Biosystems) following the manufacturers recommended cycling conditions.
Quanto V.1.2.4 was used to determine the statistical power for a given sample size and minor allele frequency.23 Assuming a minor allele frequency of 0.36, 0.20 and 0.26 for rs3753841, rs1676486 and rs1799907, respectively, a sample size of approximately 180 cases and 360 controls, would be adequate to detect a genetic effect size of at least 2.25, 1.85 and 1.95, respectively, at a power of 80% and a significance level of 5%.
Data were analysed using STATISTICA V.10.0 (StatSoft, Tulsa, Oklahoma, USA) and GraphPad Prism V.5.0d for Mac OS X (GraphPad Software, San Diego, California, USA) programs. A one-way analysis of variance was used to determine any significant differences between the characteristics of the TEN and CON groups within the AUS, SA and combined cohorts. A χ2-analysis or Fisher's exact test was used to analyse any differences in the genotype frequencies and other categorical data between the groups. Hardy-Weinberg equilibrium (HWE) was established using the program Genepop web V.3.4 (http://genepop.curtin.edu.au/). The linkage disequilibrium (LD) between the polymorphisms within COL11A1 was calculated using CubeX: cubic exact solution (http://www.oege.org/software/cubex/).24 The Chaplin case–control haplotype inference package was used to infer haplotypes and pseudohaplotypes.25 ,26 Statistically significant differences were accepted when p<0.05.
The SA CON and SA TEN groups were matched for height, sex and country of birth (table 1). The SA TEN group was, however, significantly older (p=0.007) and heavier (weight, p=0.002; body mass index (BMI), p=0.003) than the SA CON group. The mean difference between age of injury and age of recruitment of the SA TEN group was 7.1±8.9 years. There was therefore also a significant difference between the age at recruitment of the SA TEN (48.2±11.6 years, n=99) and the SA CON (36.4±10.8 years, n=154; p<0.001) groups. When adjusted for age of recruitment, weight and BMI were no longer significantly different.
The AUS CON and AUS TEN groups were similarly matched for age and country of birth (table 1). There was significantly more men in the AUS CON (40.3%) compared with the AUS TEN (71.8%, p<0.001) groups. After adjusting for sex, the groups were matched for height. The mean difference between the age of injury and the age of recruitment of the AUS TEN group was 9.0±10.1 years. There was therefore a significant difference between the age at recruitment of the AUS TEN (49.6±12.8 years, n=78) and the AUS CON (39.4±12.3 years, n=174; p<0.001) groups. After adjusting for both age at recruitment and sex, the AUS CON and AUS TEN groups were also matched for weight and BMI (table 1).
When the SA and AUS groups were combined, there were significantly more men in the TEN group (69.4%) compared with the CON group (51.5%, p<0.001). The TEN group (age of initial injury 40.8±14.7 years, weight 78.3±13.4 kg) was also significantly older and heavier than the CON group (age of recruitment 38.0±11.6 years, p=0.022; weight 72.5±13.2 kg, adjusted p=0.049). The groups were, however, matched for height (TEN 175±10 cm, CON 173±9 cm; adjusted p=0.419) and BMI (TEN 25.5±3.5 kg/cm2, CON 24.1±3.5 kg/cm2; adjusted p=0.214).
There were no significant COL11A1 or COL11A2 genotype effects on any of the combined SA and AUS participant characteristics (data not shown).
COL11A1 and COL11A2 genotype and allele frequency distributions
There were no significant differences in any of the genotype or allele frequency distributions between the SA CON and SA TEN or the AUS CON and AUS TEN groups for COL11A1 rs3753841 (T/C), COL11A1 rs1676486 (C/T) and COL11A2 rs1799907 (T/A; see online supplementary table S1). In addition, there were no significant differences in the genotype distribution of these three polymorphisms between the participants from South Africa and Australia (see online supplementary table S1). For this reason, the two populations were combined. As illustrated in figure 1, there were no significant differences (p>0.100) in the genotype frequency distributions between CON and TEN groups of the combined Australian and South African cohorts when any of the three polymorphisms were analysed. There was also no significant difference in the COL11A1 rs3753841 (p=0.121, figure 1C) and rs1676486 (p=0.221, figure 1D) allele frequency distributions between combined CON and TEN groups. However, a tendency for the minor T allele of COL11A2 rs1799907 (p=0.095) to be over-represented in the combined TEN group was noted (figure 1F). Although there was a tendency for some to deviate, all the groups were in HWE (figure 1).
COL11A1 and COL11A2 interactions
Polymorphisms COL11A1 rs3753841 and rs1676486 were in high LD for the CON (SA D′=0.862; AUS D′=0.911) and TEN (SA D′=1.000; AUS D′=0.898) groups, as well as the combined group (D′=0.932). Since the three polymorphisms investigated in this study are located within type XI collagen genes, inferred pseudohaplotypes were constructed. Seven of the possible eight pseudohaplotypes were inferred with a frequency greater than 0.4% from the two COL11A1 and one COL11A2 polymorphisms (figure 2). The TCT pseudohaplotype was significantly over-represented (p=0.006) in the TEN group (25.9%, n=47) compared with the CON group (17.1%, n=57) when the combined populations were investigated (figure 2). Similar pseudohaplotype distributions were obtained when the SA and AUS cohorts were analysed separately, with the TCT pseudohaplotype being significantly over-represented in the TEN group of both the SA (CON 19.6% vs TEN 29.7%, p=0.017) and AUS (CON 12.9% vs TEN 21.3%, p=0.041) cohorts (see online supplementary figure S1).
Interactions between COL11A1, COL11A2 and COL5A1
The type V and type XI collagens both regulate fibrillogenesis, therefore inferred pseudohaplotypes were constructed between the COL11A1 and COL11A2 polymorphisms investigated in this study and the previously investigated COL5A1 rs71746744 (-/AGGG) polymorphism.7 Eleven of the possible 16 pseudohaplotypes were inferred with a frequency greater than 2% from the COL11A1, COL11A2 and COL5A1 polymorphisms (figure 3). The TCT(AGGG) pseudohaplotype was significantly over-represented (p=0.003) in the TEN group (25.2%, n=46) compared with the CON group (9.1%, n=30; figure 3). Similar pseudohaplotype distributions were obtained when the SA and AUS cohorts were analysed separately, with the TCT(AGGG) pseudohaplotype being significantly over-represented in the TEN group of both the SA (CON 10.0% vs TEN 29.4%, p=0.003) and AUS (CON 7.2% vs TEN 20.3%, p=0.008) cohorts (see online supplementary figure S2).
Polymorphisms in several genes, including COL5A1, have been identified as intrinsic risk factors for Achilles tendinopathy.2 COL5A1 encodes the α1 chain of type V collagen, which plays an important role in collagen fibril assembly, the basic building block of tendons.8 Type XI collagen, which is encoded by the COL11A1, COL11A2 and COL2A1 genes,10 shares structural and functional homology with type V collagen.11
None of the three investigated polymorphisms within the COL11A1 (rs3753841 and rs1676486) and COL11A2 (rs1799907) genes were independently associated with chronic Achilles tendinopathy in the Australian, South African or combined cohorts. The genotype and allele frequencies calculated for these three SNPs were similar to the reported frequencies presented in the NCBI SNP database (http://www.ncbi.nlm.nih.gov/snp/).
The main finding of this study was the association of the TCT-inferred pseudohaplotype, constructed from the three polymorphisms within COL11A1 (rs3753841 T/C and rs1676486 C/T) and COL11A2 (rs17999079 T/A), with an increased risk of chronic Achilles tendinopathy. The non-synonymous rs3753841 polymorphism within exon 52 of COL11A1, results in a predicted deleterious substitution of a leucine for a proline at amino acid position 1323 of the α1(XI) chain.27 The rs1676486 polymorphism within exon 62 also results in an amino acid substitution, proline to serine at position 1535, which could potentially cause a conformational change in type XI collagen.18 In addition, the T allele of rs1676486 also appears to be associated with increased mRNA degradation.18
Furthermore, the T allele of COL11A2 rs1799907 (IVS6–4, T/A), which was also implicated in the pseudohaplotypes, produces a distinct isoform of the α2(XI) chain in which several amino acids in the amino terminal acidic domain are deleted. This acidic domain provides potential sites for the interaction of type XI collagen with other molecules and may prevent further deposition of collagen molecules in the fibril.15 ,16 It is therefore reasonable to hypothesise that the risk-associated pseudohaplotype has a biological consequence. Although expressed in the developing tendon, further research is, however, required to determine the functional significance of the type XI collagen gene polymorphisms in tendinopathy.
Intriguingly, while the minor C allele of COL11A1 rs3753841 and the minor T allele of COL11A1 of rs1676486 were over-represented in patients with lumber disc herniation, the current study, however, implicated the alternate alleles in the risk-associated pseudohaplotype. Similarly, the functional rare TT genotype of the COL1A1 Sp1 binding site polymorphism was reported to be associated with risk for several multifactorial disorders, including lumber disc disease,28–31 but was associated with protection in anterior cruciate ligament ruptures in three independent populations.32–34 These findings suggest that both the alternative alleles within a gene can be associated with an increased risk of different multifactorial disorders.
An additional finding of this study was the additive contribution of the AGGG allele of the COL5A1 -/AGGG polymorphism together with the TCT-inferred pseudohaplotype of the COL11A1 and COL11A2 genes’ increased risk of chronic Achilles tendinopathy. This gene–gene association is perhaps not surprising taking into account the mounting evidence that the structural and functional homologies between the α1(XI) and α1(V) chains encoded by COL11A1 and COL5A1, respectively, facilitate an interchangeability between the two polypeptides resulting in heterotrimer (α1(XI)2α2(V)) formation.35–38 It is possible that the reported interaction in this study between the COL5A1 and COL11A1 polymorphisms indicate the role of a minor fibrillar collagen consisting of type V/XI α-chains in tendinopathy. Alternatively, the traditional type XI collagen heterotrimer may functionally replace or compensate for type V collagen in tendons. The possible role of type XI collagen in pathology of the mature tendon needs to be investigated.
Although we are mindful of the theoretical biological effects of the amino acid substitutions implicated by the risk-associated pseudohaplotype within the α1(XI) and α2(XI) chains, it is nevertheless tempting to speculate that the association between the COL11A1, COL11A2 and COL5A1 genes and increased risk of chronic Achilles tendinopathy is due to their collective effect on (1) mRNA stability, (2) potential effects on types V and XI collagen production, (3) regulation of collagen fibril diameter and (4) altered biomechanical properties of the collagen fibril that is reflected at the tissue level (figure 4).9 It is therefore reasonable to conceive that individuals who have these functional alleles as reflected from the pseudohaplotypes could possibly be producing a functionally altered types XI and V collagen which collectively are responsible for the altered biomechanical property of the tendon collagen fibrils. As previously mentioned, although type XI collagen is classified as a cartilage protein, it is produced in the developing tendon.11 ,12 To our knowledge, there is no evidence that the protein is produced in mature tendons. It is therefore possible that the proposed biological consequences of the reported association in this study could be due to altered protein profiling: (1) the expression of type XI collagen in the mature, diseased and/or healing tendon, (2) its expression only in the fibrocartilaginous regions of the mature tendon and (3) the result of its expression and function during tendon development. Irrespective of the mechanism(s), further research is required to replicate these findings in larger independent populations and to explore the functional mechanisms underlying the complex genetic associations with the type XI collagen-encoding genes and their interactions with the type V collagen-encoding gene.
In conclusion, the functional variants within the type XI collagen genes investigated in this study were not independently associated with chronic Achilles tendinopathy. This study does, however, provide evidence suggesting that the genes that encode for the structurally and functionally related type XI (COL11A1 and COL11A2) and type V (COL5A1) collagens interact with one another to collectively modulate the risk for Achilles tendinopathy. Although expressed in the developing tendon, the role of type XI collagen in the pathology of the mature tendon requires future investigation.
What are the new findings?
Functional polymorphisms within the type XI collagen genes (COL11A1 and COL11A2) were not independently associated with chronic Achilles tendinopathy.
These functional type XI collagen polymorphisms interact with a functional polymorphism within the type V collagen (COL5A1) gene to modulate the risk for Achilles tendinopathy in two independent populations.
Since types XI and V collagen are structurally and functionally related, we propose that these polymorphisms interact to regulate collagen fibril assembly.
How might it impact on clinical practice in the near future?
Although there are no immediate clinical applications, the results of this study provide additional evidence that interindividual variations in collagen fibril assembly, the basic building blocks of tendons, might be an important molecular mechanism in the aetiology of chronic Achilles tendinopathy.
Genetic risk factors could some day be included into multifactorial models to determine an individual’s risk for Achilles tendinopathy.
Genetic risk factors could, however, never to be used in isolation to diagnose or predict these injuries.
The authors would like to thank Dr GG Mokone for recruiting the South African (SA) participants, Professor MP Schwellnus for the clinical diagnoses of the SA chronic Achilles tendinopathy (AT) participants, as well as N Kariem for his laboratory assistance. MP was supported by the Thembakazi Trust. MC and AVS have filed patents on the application of specific sequence variations (not included in this manuscript) related to risk assessment of AT and anterior cruciate ligament injuries.
Contributors MH contributed to laboratory work, genotyping, analysis and manuscript preparation. JP, RC, AB and JC contributed to subject recruitment and diagnosis and manuscript editing. CJH contributed to laboratory work and manuscript editing. MP and MC was responsible for project development, analysis and manuscript preparation. AVS was responsible for project development, genotyping QC and manuscript preparation.
Funding This study was supported in part by funds from the National Research Foundation of South Africa (grant number: CPR20110712000020673), University of Cape Town, and the South African Medical Research Council.
Competing interests Authors MC and AVS have filed patents on the application of specific sequence variations (type XI collagen genes are not included) related to risk assessment of AT and anterior cruciate ligament injuries.
Patient consent Obtained.
Ethics approval Human Research Ethics Committee, University of Cape Town, South Africa and La Trobe and Deakin Universities Melbourne, Australia.
Provenance and peer review Not commissioned; externally peer reviewed.
▸ References to this paper are available online at http://bjsm.bmj.com
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.