Background—Up to 15% of colorectal cancers are characterised by DNA microsatellite instability (MIN), shown by the presence of DNA replication errors (RERs).
Aims—To identify pathological features that are discriminating for colorectal cancer (CRC) showing extensive MIN.
Subjects—A prospective series of 303 patients with CRC and no family history of either familial adenomatous polyposis or hereditary non-polyposis colorectal cancer.
Methods—DNA was extracted from fresh tissue samples and the presence of MIN was studied at nine loci that included TGFβRII, IGFIIR, and BAX. The 61 cases showing RERs were compared with 63 RER negative cases with respect to a comprehensive set of clinical and pathological variables. Predictive utility of the variables was tested by decision tree analysis.
Results—Twenty seven patients with CRC showed extensive RERs (three loci or more) (RER+) and 34 had limited RERs only (28 = one locus; 6 = two loci) (RER+/−), yielding a bimodal distribution. RER+ cancers differed from RER− and RER+/− cases. Tumour type (adenocarcinoma, mucinous carcinoma, and undifferentiated carcinoma) (p=0.001), tumour infiltrating lymphocytes (p=0.001), and anatomical site (p=0.001) were the most significant of the discriminating variables. Algorithms developed by decision tree analysis allowed cases to be assigned to RER+ versus RER− and +/− status with a global sensitivity of 81.5%, specificity of 96%, and overall accuracy of 93%.
Conclusion—Pathological examination of CRC allows assignment of RER+ status; assignment is specific and relatively sensitive. Conversely RER− and RER+/− CRC are indistinguishable.
Statistics from Altmetric.com
Approximately 15% of sporadic large bowel cancers show DNA replication errors (RERs) or DNA microsatellite instability.1 2 Strand et al 3reported that mutations of DNA mismatch repair genes produced destabilisation of tracts of simple repetitive DNA in yeast and linked this observation to similar findings in both sporadic colorectal cancer and hereditary non-polyposis colorectal cancer (HNPCC).4This led to the cloning of human DNA mismatch repair genes, of which hMSH2 and hMLH1 were shown to be implicated in most cases of HNPCC.5-8
Although sporadic RER positive colorectal cancers are viewed as non-familial counterparts of HNPCC, only a subset of sporadic RER positive cancers shows somatic mutation of hMSH2, hMLH1, or other DNA mismatch repair genes such as hPMS1 and hPMS2.9 10 This suggests that a proportion of RER positive CRC is not necessarily an exact sporadic counterpart of HNPCC. Either different DNA mismatch repair genes are implicated or the genes are switched off by altered methylation or the DNA microsatellite instability may be mild and epiphenomenal. For example, mild DNA microsatellite instability (one locus only) was not associated with certain features characterising HNPCC including poor tumour differentiation, location in proximal colon, and diploid DNA content.11 Conversely these features were seen when two or more loci were implicated.11 Further features of strongly RER positive sporadic cancers have been recognised including mucinous differentiation,12 a Crohn’s like peritumoural lymphocytic reaction,13 14 and tumour multiplicity.15 16
RER positive cancers, whether sporadic or associated with HNPCC, appear to show a molecular genetic spectrum that is distinct from common bowel cancer or cancer developing in familial adenomatous polyposis. APC, p53, and K-ras mutations occur with reduced frequency in HNPCC.17 In sporadic RER positive cancers there is a reduced frequency of p53 immunostaining13 14 whereas mutations within small repeated sequences are found in three genes implicated in tumour progression: TGFβRII,18IGFIIR,19 and BAX.20 Mutations in TGFβRII are also found in adenomas and carcinomas from patients with HNPCC.17 21
The preceding data indicate that sporadic RER positive cancers differ in clinical, pathological, and molecular respects from sporadic RER negative cancers, but at least partly overlap with HNPCC. Conflicting data are explained by less stringent definitions of RER positivity.22 Given the fact that RER positive cancers will include some that are hereditary, may have a relatively favourable prognosis,11 and may conceivably respond differently to adjuvant therapy,23 it is desirable that they are recognised. The aim of this study was to assess the independent value of pathological criteria in the diagnosis of RER positive cancers presenting in the absence of a family history of bowel cancer. Cribriform differentiation and tumour infiltrating lymphocytes were included in addition to the features reviewed earlier, as these were shown recently to be relatively sensitive for RER positive cancers.24 In this study RER positivity was classified as mild (up to two out of six loci) or extensive (three or more positive loci). Three additional loci within genes directly implicated in RER positive tumorigenesis—TGFβRII,18IGFIIR,19 and BAX20—were also studied.
Materials and methods
The material was derived from a prospectively collected series of 303 fresh specimens of colorectal cancer obtained through the Royal Brisbane and Greenslopes Hospitals. The study was approved by the Royal Brisbane Hospital Ethics Committee and all patients gave written informed consent.
DNA REPLICATION ERROR ASSAYS
DNA was extracted from the fresh specimens of cancer and from matching germline control samples (normal mucosa or, when not available, blood lymphocytes). In order to screen for RERs, these samples were amplified by polymerase chain reaction (PCR) at microsatellite loci MYCL, AT3, D2S123, F13B,25 BAT-26, and BAT-40.26 PCR reactions were performed in a final volume of 20 μl, containing 100 ng of genomic DNA, 20 pmol of each oligonucleotide primer, 1.25 μM of dATP, 10 μM of dCTP, dTTP, and dGTP, 6μCi [α-35S]dATP, and 1 unit of Taq polymerase (Boehringer Mannheim, Mannheim, Germany). Reaction conditions consisted of three minutes at 92°C, followed by 31 cycles (45 seconds at 94°C, one and a half minutes at 55°C, one and a half minutes at 72°C), and a final extension for five minutes at 72°C. PCR products were electrophoresed on denaturing 5% polyacrylamide (19:1) gels and visualised by autoradiography. Of the 303 cancers analysed for RERs, cases showing electrophoretic shifts at one or more loci were submitted for histopathological review (n=61) together with a randomly selected group of RER negative cancers (n=63). In addition, cancers with genetic alterations at one or more loci were analysed for the presence of TGFβRII (polyA), IGFIIR (polyG), and BAX mutations as previously described.19 20 27
Cases were reviewed by the first author without knowledge of the DNA RER status. Potential predictive variables for DNA microsatellite instability included: location (proximal colon up to and including splenic flexure, distal colon, and rectum); type of cancer (adenocarcinoma, mucinous carcinoma, undifferentiated carcinoma)28; differentiation (well, moderate, poor)28; architecture (simple acini, cribriform structures, no lumina)29; invasive margin (expanding, infiltrating)30; peritumoural chronic inflammation, with or without Crohn’s like lymphocytic nodules (present, absent)30; tumour infiltrating lymphocytes (present, absent)24; contiguous adenoma (present, absent); nodal status (involved, not involved); local spread (within wall, beyond wall); and modified Dukes’ stage31 (A, B, C, and D for distant spread). For the purposes of this study, cancers were termed undifferentiated when they were composed primarily of large cells in solid sheets or broad trabeculae and had a well circumscribed margin. Unlike the series described by Gibbs32 these cases included a minor component with glandular spaces and were strictly a subset of poorly differentiated adenocarcinomas. A single signet ring cell carcinoma was classified as a poorly differentiated mucinous carcinoma.
For univariate analyses, all key independent variables were investigated for possible associations with RER positive cancers by χ2 analyses of contingency tables (SAS PROC FREQ) (SAS Inst Inc. Cary, North Carolina, USA). All variables were subsequently included for multivariate analysis using classification tree methodology. The program CART (classification and regression trees) was used to construct a decision tree that can be used to predict a response variable (California Statistical Software Inc., Lafayette, California, USA, 1992).33 CART is a computer intensive non-parametric tool (in contrast to its parametric competitors such as Fisher’s linear discriminant analysis or logistic regression) since it does not depend on any underlying distributional assumptions.34 CART allows for non-linear relations between predictive factors and outcomes and for mixed data types (numerical and categorical), isolates outliers, and incorporates a pruning process using cross validation as an alternative to testing for unbiasedness with a second data set.
The tree is derived by recursive partitioning, beginning with the total sample population and all variables. At each step the program determines for each possible predictor variable a cutpoint which optimally splits the population into prespecified subgroups, and then selects the variable which performs best (according to some criterion based on impurities of the subgroups). If an observation has no data for this variable, the program uses the next most important variable (the surrogate) to sort that observation. It then takes the resulting subpopulations and repeats the process until no further partitioning is warranted: either that a subpopulation contains only one class of the observed response variable or the subpopulation is too small to subdivide further. A “pruning” procedure then recombines subgroups if classification error is not significantly increased. During the partitioning process, the program keeps track of how well each predictor performs on each split and thus can evaluate its overall discriminating ability relative to the other factors in the analysis measured by variable importance rankings. The final result is a decision tree. Assessment of the predictive power of each variable is given by the variable importance ranking, rather than the level where it appears on the tree.
The RER positive cancers included 34 with instability at two loci or less (RER+/−) and 27 with instability at three loci or more (RER+) (fig 1). Of the RER+/− cases only six showed instability at two loci and none had mutations of TGFβRII, IGFIIR, or BAX. By contrast, all but four of the RER+ cancers showed at least one mutation implicating TGFβRII, IGFIIR, or BAX (fig 2). The distribution of RER positive cancers (+ and +/−) was bimodal (fig 2).
Variables showed a similar distribution in RER− and RER+/− cancers with the single exception of peritumoural lymphocytic infiltration, though the trend for this feature fell just short of significance (χ2=3.2, p=0.07) (table 1). RER− and RER+/− cancers were therefore combined. No differences between RER+ versus RER− and RER+/− cancers were seen for nodal spread, direct spread, or contiguous adenomas. Differences were seen for the remaining variables, with type, tumour infiltrating lymphocytes, and location showing the largest χ2 values (table 1). Mucinous (fig 3) and undifferentiated cancer (fig 4) were over represented among RER+ cancers. Tumour infiltrating lymphocytes were most notable in undifferentiated cancers (fig 5), but were also observed in four RER+ adenocarcinomas (fig 6) and two RER+ mucinous carcinomas. The designation “undifferentiated” is not strictly correct, but serves to distinguish this large cell variant of poorly differentiated adenocarcinoma showing good circumscription and tumour infiltrating lymphocytes.
Decision tree analysis was performed for adenocarcinoma (fig 7) and mucinous carcinoma (fig 8). There were only four undifferentiated cancers and three of these were RER+. The decision tree analysis for adenocarcinoma yielded a test with an overall accuracy of 95% (overall correct assignment), a sensitivity of 64%, and a specificity of 99%. The decision tree analysis for mucinous carcinoma yielded a test with an overall accuracy of 85.7%, a sensitivity of 92%, and a specificity of 75%. The global assignment accuracy for all types of carcinoma (adenocarcinoma, mucinous carcinoma, and undifferentiated carcinoma) was 93%.
In order to study morphological characteristics of RER positive colorectal cancer that might be diagnostically discriminating, a subset of RER+ cancers was recognised in which microsatellite instability was present in at least three of six loci. Only four of 27 cancers with microsatellite instability at three loci lacked a mutation of TGFβIIR,17 IGFIIR,18 or BAX19genes directly implicated in the tumorigenesis of RER positive cancers. Conversely, of the 34 cancers showing mild RER positivity (RER+/−), all cases lacked mutation in TGFβRII, IGFIIR, or BAX (p<0.001). MYCL was the most frequently mutated locus among the RER+/− cases. Tetranucleotide (MYCL) instability may be less specific as a marker of extensive and significant RER positivity. Figure 2 indicates a bimodal distribution of RER+ and RER+/− cancers.
None of the variables was distributed differently between the RER− and RER+/− cancers, with the possible exception of peritumoural lymphocytes (table 1). RER− and RER+/− cancers were therefore grouped together for the purposes of further analysis. Certain variables were distributed differently between RER+ and the remaining combined group (RER− and +/−). The excess of poorly differentiated adenocarcinomas in the RER+ group included a distinct subset in which cells were grouped in irregular trabeculae or extensive sheets and the tumour was well circumscribed with a pushing margin. Three of the four cancers with this morphology were characterised by large numbers of tumour infiltrating lymphocytes, these cells being evident within the malignant epithelium as well as in the surrounding stromal elements (fig 5). Although these cancers were recognisably adenocarcinomas, they were predominantly undifferentiated. Gibbs32 described a series of eight undifferentiated large bowel cancers which showed a pushing tumour margin. He stressed the importance of recognising this subgroup because of the excellent prognosis belying the lack of glandular differentiation. It is likely that some of these cancers were RER+. Furthermore, since two of the eight subjects in the study by Gibbs were under 50 years at the time of diagnosis (a woman aged 31 and a man aged 39) it is likely that these were HNPCC family members. Such undifferentiated or poorly differentiated cancers are known to be overexpressed in HNPCC35 as well as among sporadic RER+ cancers.11 13 36
Mucinous carcinomas also occurred more frequently in the RER+ group. Indeed, they were the commonest subtype. The over expression of mucinous carcinoma has been recognised previously in both sporadic RER+ cancers12 13 14 and HNPCC.35 Since typing into adenocarcinoma, mucinous carcinoma, and undifferentiated carcinoma is fundamental to the classification of colorectal cancer and was the most significant discriminant, we commenced tree decision tree analysis with this variable. The additional useful discriminants for adenocarcinoma were the presence of tumour infiltrating lymphocytes, site, and differentiation. The decision tree for adenocarcinoma allowed RER+ cancers to be recognised with 99% sensitivity and 64% specificity. There were only four undifferentiated cancers and further subdivision of this group was inappropriate. Three were RER+. Interestingly, the single RER− case lacked tumour infiltrating lymphocytes.
The large mucinous group required a decision tree analysis of greater complexity, but this yielded a test that was 75% sensitive and 92% specific. Although tumoural lymphocytes have been noted to be prominent in subsets of HNPCC34 and in sporadic RER+ cancers,13 14 this study has shown a more selective association with tumour infiltrating lymphocytes (figs 5 and 6), an observation that should be credited to Krishna et al.24 There is a negative correlation between peritumoural lymphocytic infiltration and notable mucin production.37 As a high proportion of RER+ cancers are mucinous (sporadic and HNPCC associated) this fact must confound any association with peritumoural lymphocytic infiltration. Nevertheless, tumour infiltrating lymphocytes were conspicuous in two (15%) of 13 RER+ mucinous carcinomas. Of the 27 RER+ cancers, nine (33%) showed tumour infiltrating lymphocytes.
The biological significance of tumour infiltrating lymphocytes is unclear. It has been suggested that RER+ cancers would be prime targets for cytotoxic T cells because they must express neoantigens translated from multiply mutated genes. At the same time, RER+ cancers will be under severe selective pressure to escape T cell cytotoxicity through loss of HLA expression and this has been shown to be the case.38 Tumour infiltrating lymphocytes have low proliferative capacity, even when exposed to interleukin 2.39 40 However when cultured with interleukin 2, these cells show non-MHC restricted cytotoxicity to autologous tumour cell targets.41 Although such cytotoxicity is occurring within an artificial environment, it is conceivable that loss of HLA expression may not render RER+ cancers immune from T cell destruction. Indeed apoptotic bodies may be numerous in RER+ cancers (unpublished observation), requiring careful distinction from tumour infiltrating lymphocytes. The apoptotic effect could be triggered by the lymphocytes, but mediated by BAX.20 Among RER+ cancers, tumour infiltrating lymphocytes were as frequent in specimens with a BAX mutation (40%) as those without (31%). We are currently examining the possibility that apoptosis may be reduced in specimens with BAX mutations, despite the presence of tumour infiltrating lymphocytes.
When RER positivity was diagnosed using stringent molecular criteria, morphological features combined with site and distant spread were found to be specific and sensitive for the recognition of this subgroup of CRC. Using the algorithms, 93% of cancers were correctly assigned to the RER+ versus RER− and +/− groups (global accuracy), 81.5% of RER+ cancers were correctly assigned (global sensitivity), and 96% of RER− and +/− were correctly assigned (global specificity). Recognition of RER+ cancers is of practical value because of the association with HNPCC. In the present series, none of the RER+ cancers was from an individual with a family history of HNPCC and the advanced age of these subjects (mean 71.1 years) makes it likely that most, if not all, were sporadic cases. Twenty seven of 303 cancers (8.9%) were RER+. Recognition of sporadic RER+ CRC is of importance because of the different prognosis,11 possible increased likelihood of multiplicity,15 16 different profile of molecular tumorigenesis, and possibly different response to chemotherapy.23 In this study, RER+ cancers were less likely to be associated with distant spread (1/27 cases) but the difference fell short of significance (p=0.08). Multiple colorectal cancer occurred in 11.0% (3/27) of RER+ but only 2.0% (2/97) of RER− and +/− cases in the present study (Fisher’s exact test, p=0.046).
We conclude that pathological examination of colorectal cancer allows identification of cancers showing extensive DNA microsatellite instability (RER+) and that assignment is both specific and sensitive. Similar algorithms would assist in the morphological identification of CRC in HNPCC, although the right sided predilection may be more pronounced in the case of sporadic RER+ cancers. Weakly RER positive (RER+/−) cancers are indistinguishable from RER negative CRC. Weak RER positivity may be epiphenomenal and lacking in clinical significance or biological significance apart from an association with peritumoural lymphocytic infiltration. Although this study used decision tree modelling with multiple cross validation steps (using subsets of the data), the relatively small sample size and the subjectivity of some of the variables may limit the general applicability of the algorithms. On the other hand, the possibility of studying two or more cancers from members of a suspected HNPCC family would enhance diagnostic recognition considerably. It is unlikely that RER testing will be routinely available for the foreseeable future and DNA obtained from old formalin fixed material often fails to amplify adequately. Histopathological and clinical features that distinguish RER+ cancers are therefore likely to assume diagnostic importance and should be recorded as a routine.
We thank Mrs B Mason for typing the manuscript and Mrs L Reid and Mr C Winterford for photographic assistance. CW and SPP thank the Sir Edward Dunlop Medical Research Foundation for financial assistance. During this study LAS was supported by the Queensland Cancer Fund.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.