Article Text
Abstract
Background and aims: Nodal metastases are indisputable determinants of prognosis for colon and rectal cancer. Using classical histological criteria, many attempts to predict nodal metastasis have failed, preventing the adequate management of stage I (pT1) cancer. We investigated the role of tumour matrilysin in predicting metastatic potential, and discuss its potential use in individualising treatment of pT1 colon and rectal cancer.
Methods: The gene signature associated with nodal metastasis was investigated by cDNA array in 24 colon and rectal cancers. We studied 494 colon and rectal cancer patients to identify risk factors for nodal metastasis and evaluated the potential to predict nodal metastasis by either the logistic regression model or the Bayesian neural network model with builtin matrilysin. We then inferred possible causality of nodal metastasis from structural equation modelling.
Results: cDNA array revealed that matrilysin was maximally upregulated in the metastasis signature identified. Tumour matrilysin expression emerged as a stage independent risk factor for nodal metastasis, resulting in a similar predictive performance in receiver operating characteristic curve analysis in the two models. A Bayesian approach called automatic relevance determination identified matrilysin as one of the most relevant predictors examined. Structural equation modelling suggested possible direct causality between matrilysin and nodal metastasis.
Conclusions: We have provided evidence that tumour matrilysin expression is a promising biomarker predicting nodal metastasis of colon and rectal cancer. Analysis of tumour matrilysin expression would help clinicians achieve the goal of individualised cancer treatment based on the metastatic potential of pT1 colon and rectal cancer.
 ARD, automatic relevance determination
 ANN, artificial neural network
 CRC, colon and rectal cancer
 EMR, endoscopic mucosal resection
 MMP, matrix metalloproteinase
 MMP7, matrilysin
 SAM, significance analysis of microarray
 SEM, structural equation modelling
 matrilysin
 Bayesian neural network
 structural equation modelling
 nodal metastasis
 colon and rectal cancer
Statistics from Altmetric.com
 ARD, automatic relevance determination
 ANN, artificial neural network
 CRC, colon and rectal cancer
 EMR, endoscopic mucosal resection
 MMP, matrix metalloproteinase
 MMP7, matrilysin
 SAM, significance analysis of microarray
 SEM, structural equation modelling
 matrilysin
 Bayesian neural network
 structural equation modelling
 nodal metastasis
 colon and rectal cancer
Combined minimally invasive and target oriented treatment, individually tailored in accordance with metastatic potential, is an ideal but still remote goal of cancer treatment, which would improve outcome and reduce medical costs for cancer patients. In the postgenomic era, cancer treatment has become increasingly target oriented^{1} and the many molecular targets identified by high throughput technologies encourage this trend and provide attractive candidates for clinical use.^{2–}^{4} Moreover, individualised cancer treatment based on pharmacogenomics is now a real clinical possibility, while minimally invasive surgery is the current preferred approach to many cancers.^{5} Despite many histological studies, we still cannot assess the metastatic potential of cancer.^{6} Our primary purpose in this study was to assess and predict the metastatic potential of cancer, which is difficult to evaluate directly. Fortunately, locoregional nodes are almost invariably the initial site of metastatic spread of stage I (pT1) colon and rectal cancers (CRC), so the development of nodal metastasis should exactly reflect the metastatic potential of the primary tumour of stage I (pT1) CRC. Our secondary aim was to propose individualised treatment against stage I (pT1N0M0) CRC based on the metastatic potential in the clinical setting.
Matrix metalloproteinases (MMP) are promising targets for cancer therapy based on their massive upregulation in malignant tissues and their unique ability to degrade all components of the extracellular matrix.^{7} We and other authors reported that matrilysin (MMP7) is exclusively produced by cancer cells and has critical implications for tumour growth, invasion, metastasis, and prognosis of gastrointestinal cancers.^{8,}^{9,}^{10,}^{11,}^{12,}^{13} Although matrilysin is one of the targets transactivated by βcatenin, a key mediator in the Wnt pathway, interacting with the family of Tcf/Lef1 transcriptional factors,^{13–}^{15} and is implicated directly in early tumorigenesis,^{9} the precise role of matrilysin in the potential for nodal metastasis is still unclear.
Neural computing and graphical modelling has recently emerged as a practical technique, with successful applications in many unrelated fields. Artificial neural networks (ANNs) are algorithms applicable to complicated nonlinear statistical modelling that provide a new alternative to logistic regression which has been the most commonly used method for developing predictive models for dichotomous outcomes in medicine.^{16,}^{17} Structural equation modelling (SEM) is a statistical approach to infer causality between latent factors impossible to observe directly, and selected measurable variables in social and physical phenomenon.^{18} However, discriminatory use of logistic regression, ANN, or SEM to produce the best possible results in the clinical decision making process is still not fully understood.
In this study, we identified that matrilysin was maximally upregulated in the metastatic gene signature in cDNA array analysis. Using a large number of CRC patients, we determined that tumour matrilysin expression is a stage independent risk factor for nodal metastasis. We then developed and evaluated two novel predictive models based on logistic regression and Bayesian ANN with builtin matrilysin. Moreover, possible causality of matrilysin in nodal metastasis was inferred from SEM. Based on these findings, we propose that tumour matrilysin measurements would refine evaluation of the metastatic potential of CRC, and endoscopic intervention combined with matrilysin measurement would be the most rational strategy against pT1 CRC, enabling individualised cancer treatment adjusted to the risk of metastasis.
MATERIALS AND METHODS
Design and participants
We obtained fresh tissues from 24 patients with CRC. All had undergone definitive surgery according to the International Union Against Cancer (UICC) guidelines. To compare gene expression profiles in CRC with nodal metastasis (n = 16) with those without nodal metastasis (n = 8), each specimen containing more than 90% tumour cells was subjected to cDNA array analysis. All tumours were adenocarcinomas, and their clinicopathological features were classified according to the UICC’s TNM classification.
We divided 494 Japanese patients with pT1–T4 CRC into two study populations to assess the risk of nodal metastasis. We conducted a cross sectional survey in one group of 180 consecutive patients with pT1, defined as when the tumour invades the submucosa (tables 1, 2). These were subdivided into two subgroups: a surgical subgroup (n = 94) who underwent radical surgery alone, and an endoscopic mucosal resection (EMR) subgroup (n = 86) who were treated by initial endoscopic local excision followed by additional radical surgery. The other main group comprised 244 pT1–T4 patients consisting of 122 matched pairs for age (within five years) and sex, analysed according to a case (node positive), control (node negative) design (tables 3, 4). To compare the new biomarker (tumour matrilysin) with current standards, we deliberately introduced “an unfavourable score”, which was defined as an additive sum of the standard risk factors for nodal metastasis in pT1 CRC. These factors include: depth of invasion (scored scanty 0 or massive 1); differentiation grade (well 1, moderate 2, and poor 3); lymphatic invasion (negative 0 or positive 1); and vascular invasion (negative 0 or positive 1).^{19} Treatment of all patients was carried out between 1990 and 2001 in hospitals affiliated with our university, and all patients gave full informed consent. We excluded patients with multiple primary cancers, familial cancer syndromes, and idiopathic inflammatory bowel disease. The institutional review board of Sapporo Medical University approved the study. We retrieved clinicopathological data from the medical and pathological records (tables 1–4).
The 494 patients were then randomly divided into two groups: one was a training data set of 402 patients and the other was a test data set of 92 patients. Using the training data set, we independently developed two predictive models, a logistic model and Bayesian ANN model. We then evaluated the predictive performance of these two models using the same test data set. Furthermore, we developed a causal model of nodal metastasis with SEM using the training data set (see appendix for details).
cDNA array analysis
We performed cDNA array analysis according to the manufacturer’s protocol, using the filter (human cancer; Toyobo) consisting of 550 cancer related genes and 11 housekeeping genes in duplicate. A complete list is available on the URL http://www.toyobo.co.jp. Gene expression profiles of nodal metastasis were compared using significance analysis of microarray (SAM; http://wwwstat.stanford.edu/~tibs/SAM/)^{20} (see appendix for details).
Immunohistochemistry and its evaluation
Archival formalin fixed paraffin embedded sections were immunostained using a standard avidinbiotin peroxidase technique, as recommended by the manufacturer (Dako, Glostrup, Denmark). The mouse monoclonal antibodies used were those against human MMP7 (1417B2; Daiichi Fine Chemical, Toyama, Japan), cytokeratin (56 kDa) (KL1; Serotec, Oxford, UK), and βcatenin (Transduction Laboratories, Lexington, Kentucky, UK). The simplest criterion to evaluate immunoreactivity was adopted according to an allornothing principle. Matrilysin immunostaining was judged as positive provided basilar and/or cytoplasmic staining was observed at the invasive edge (fig 1). Cytokeratin was used to enhance visualisation of the small numbers of budding cancer cells. Tumour budding was defined as microscopic clusters of undifferentiated cancer cells just ahead of the invasive front of the tumour and has also been referred to as “focal dedifferentiation”.^{21} In this analysis, evaluation of positive βcatenin expression was restricted to nuclear accumulation at the invasive front of the tumour. Two independent researchers (YA and TS) evaluated all immunostaining in a blind manner.
Statistical analysis
Categorical comparison of the clinicopathological characteristics was carried out using the χ^{2} test, Fisher’s exact test, the exact p values based on Pearson’s statistic or the Monte Carlo method. Continuous variables were tested by the Student’s t test. Logistic regression analysis for nodal metastasis was performed using a backward stepwise method (likelihood ratio) (software, SPSS for Windows, version 11; Chicago, Illinois, USA). Clinicopathological characteristics for the matched data were examined using conditional logistic regression analysis. A p value of less than 0.05 was considered statistically significant. All tests were two tailed.
Artificial neural networks (ANNs)
The most popular three layer feed forward perceptron under the learning algorithm, the error backpropagation, was adopted.^{22} It was composed of nine input units, 20 hidden units, and one output unit. The nine input nodes included age, sex, tumour size, location of tumour, depth of invasion, differentiation grade, lymphatic and venous invasion, and expression of matrilysin. The one output node represented nodal metastasis. We used a Bayesian implementation of learning in neural networks using Markov Chain Monte Carlo sampling developed by Neal.^{23} Automatic relevance determination (ARD), proposed by MacKay and Neal,^{24,}^{25} is a hierarchical Bayesian approach where there are hyperparameters explicitly representing the relevance of different input features (fig 2). All of the analyses on Bayesian neural networks employed Software for Flexible Bayesian Modelling (copyright 1995–2003 by Radford M Neal, version 20030629; the software and its documentations are available from the following URL http://www.cs.utoronto.ca/~radford/). Moreover, we performed receiver operating characteristic (ROC) curve analysis of two predictive models developed with ROCKIT (developed by Charles E Metz; ROC software including ROCKIT and the user manual are available at the following URL: http://home.uchicago.edu/~junji/KRL_HP/rocold.htm) (fig 3)^{26} (see appendix for details).
Structural equation modelling (SEM)
We tried to infer the causality of nodal metastasis from SEM using the identical training data set of ANN analysis. A path diagram as a causal model for nodal metastasis, consisting of nine measurable and two latent variables, was developed. The model was statistically evaluated by t test for all paths, and r^{2} values or squared multiple correlation were calculated. The SEM analysis used EQS6.1 for Windows (Multivariate Software, Inc., California, USA) (fig 4) (see appendix for details).
RESULTS
We identified three genes that were upregulated by at least fourfold (matrilysin, laminin β2, and PMS1) and one that was downregulated by at least fourfold (cadherin6) in CRC tissues with nodal metastasis (see appendix for details). These differentially expressed four genes were statistically significant in SAM analysis, and semiquantitative reverse transcriptionpolymerase chain reaction analysis gave results consistent with those by cDNA array analysis (data not shown).
Matrilysin and βcatenin were preferentially expressed, but in different locations at the invasive front of the tumour (fig 1A). Cancer cells that still formed well to moderately differentiated tubular glandular structures expressed matrilysin polarised to the apical cell surface of the glands while many cases of focal dedifferentiated or budding cancer cells exhibited matrilysin depolarised in the cytoplasmic and/or on the basilar cell surface (fig 1B).
Twenty of the 180 pT1 patients (11.1%) were node positive, as shown in table 1. The retrieved lymph nodes averaged at 14.8, which was not dependent on the status of nodal metastasis. Fifty six of 180 patients (31.1%) were matrilysin positive. Tumour budding was demonstrated in 66.0% (95/144) of the specimens, and aberrant nuclear expression of βcatenin was detected in 70.9% (100/141). Nodal metastasis and tumour matrilysin were statistically associated with identical factors: depth of tumour invasion (p = 0.016 v 0.011), presence of tumour budding (p = 0.0025 v <0.0001), positive matrilysin expression (p<0.001), lymphatic invasion (p = 0.039 v 0.0188), venous invasion (p = 0.024 v 0.01), and nodal metastasis (p<0.0001), respectively. βCatenin did not correlate with any of the factors examined. Although positivity of tumour matrilysin and lymphatic invasion in the surgical subgroup were significantly higher than in the EMR group, other factors were not (table 1). Multiple logistic analysis showed that tumour matrilysin expression (odds ratio 10.4, 6.7 (95% confidence intervals (CI) 3.18–33.98, 1.10–41.48)) was an independent risk factor for nodal metastasis in pT1 CRC and in the EMR subgroup, respectively, and venous invasion (odds ratio 2.8 (95% CI 1.39–5.61)) was an independent risk factor for nodal metastasis in pT1 CRC (table 2, upper part). In the analysis using the unfavourable score, tumour matrilysin (odds ratio 9.2, 6.1 (95% CI 2.83–30.08, 1.08–34.91)) was an independent risk factor in pT1 CRC and in the EMR subgroup, respectively, and unfavourable score (odds ratio 1.5 (95% CI 1.11–2.12)) was an independent risk factor in pT1 CRC (table 2, lower part).
The clinicopathological characteristics of the 244 pT1–T4 CRC patients of the other study population are summarised in table 3. The retrieved lymph nodes averaged at 29.4, which was independent of the status of nodal metastasis. Matrilysin expression was positive in 176 patients (72.1%) and all parameters examined except tumour location were statistically significant for both nodal metastasis and positivity of matrilysin. Conditional logistic analysis for nodal metastasis identified that lymphatic invasion (adjusted odds ratio 5.7 (95% CI 2.66–12.37)) and tumour matrilysin expression (adjusted odds ratio 5.27 (95%CI 1.75–15.84)) were stage independent risk factors for nodal metastasis (table 4, upper). Using the unfavourable score, tumour matrilysin expression (adjusted odds ratio 4.3 (95%CI 1.71–10.64)) and the unfavourable score (adjusted odds ratio 1.9 (95%CI 1.46–2.58)) were independent risk factors (table 4, lower).
The logistic model and Bayesian ANN model were developed from the same 402 training data set and evaluated by the same 92 test data set. ROC curve analysis showed that the predictive accuracy of the test data set with our trained ANN model had a maximum sensitivity of 88.0% and a specificity of 86.6% when accepting a cut off value of 0.37. However, those values for the logistic model were maximally 88.0% and 82.1%, respectively, at a cut off value of 0.46 (fig 3). The areas under the curve were 0.904 and 0.874 for the neural network and logistic regression analysis, respectively; the difference between the two ROC curves did not reach statistical significance.
The Bayesian technique of ARD model determined the significance of the input parameters in the test data set. The input parameters were divided into two classes and ordered from the most relevant to the least relevant in each class as follows. Relevant parameters: tumour matrilysin expression, lymphatic invasion, depth of invasion, and venous invasion. Nonrelevant parameters: differentiation grade, sex, tumour size, age, and location of tumour (fig 2).
We developed a multiple indicator model of SEM introducing two latent factors designated as tumour growth and metastatic potential. The explored model had good values for goodness of fit: χ^{2} = 8.735 (p = 0.810), χ^{2}/df = 0.582, NFI = 0.986, NNFI = 1.020, CFI = 1.000, and RMSEA = 0.000. This suggested that matrilysin (path coefficient = 0.42) and lymphatic invasion (path coefficient = 0.35) were the causal parents of nodal metastasis. We could trace the three paths on the causal graph from a causal ancestor (tumour growth) to a descendant (nodal metastasis): tumour growth → metastatic potential → ly → nodal metastasis (0.35×0.77×0.35 = 0.09); tumour growth → metastatic potential → MMP7 → nodal metastasis (0.35×0.56×0.42 = 0.08); tumour growth → MMP7 → nodal metastasis (0.25×0.42 = 0.11). Overall association between them was the sum of these effects so that the effect mediated by ly is 0.09 while those mediated by MMP7 is 0.19. The relative contribution of causal effect of matrilysin was approximately twofold (0.19/0.09) greater than that of lymphatic invasion in this causal model (fig 4).
DISCUSSION
Our goal was to assess the metastatic potential of CRC in order to develop clinically effective individualised treatment based on that potential against stage I of CRC (pT1N0M0). Using cDNA array, we first identified that of the many possible candidates, matrilysin was a major biomarker involved in nodal metastasis. We reported previously that matrilysin expression was confined to cancer cells, which not only correlated with tumour metastatic potential in a mouse model but was also implicated in the poor prognosis of gastrointestinal cancer patients. Moreover, matrilysin is an established target of the Wnt signal pathway. For these reasons, we specifically focused on matrilysin for further analysis.
MMPs have long been a potential target for cancer therapy. However, clinical trials of synthetic metalloproteinase inhibitors have been disappointing.^{26} One major reason for this lack of success may result from the apparent uncertainty regarding the precise spectrum of inhibitory activity required. In other words, it is necessary to elucidate the role of individual MMPs in cancer progression by stage and type of cancer. To address this and to illustrate intuitive clinical applications, we focused on matrilysin and nodal metastasis of pT1 in stage I CRC. If targets can be identified earlier in the disease, a target oriented approach is likely to be more successful. Intriguingly, there were no matrilysin positive carcinomas in situ or adenomas examined using our criterion for immunoreactivity for tumour matrilysin as if it reflected their metastatic potential. pT1 CRC is the earliest stage of disease and nodal metastasis in pT1 CRC is the initial step in tumour spread and must precisely reflect the metastatic potential of the primary tumour. Therefore, analysis of nodal metastasis in pT1 CRC should create new therapeutic avenues involving metastatic potential individualised or target orientated approaches.
To date, therapy for pT1 CRC has varied from endoscopic to definitive surgery. The overall frequency of nodal metastasis in pT1 CRC may be as low as 10% (table 1),^{27} raising the possibility that endoscopic local excision is a valid approach promising a cure for most pT1 patients without metastasis (pT1N0M0). Despite extensive histological studies, we still cannot predict the metastatic potential of pT1 CRC.^{28} This inability is the crux of the problem of how to improve treatment of pT1 CRC. In our series of 180 pT1 patients, even in the EMR subgroup (n = 86) who were treated by initial endoscopic local excision followed by additional radical surgery, approximately 90% of patients underwent unnecessarily extensive surgery. Tumour matrilysin expression emerged as a new biomarker superior to conventional risk factors for nodal metastasis when tumour matrilysin was compared with either individual risk factors (tables 2 and 4, upper part) or the unfavourable score defined as the whole sum of risk factors (tables 2 and 4, lower part) in this study. Using our ANN model, the incidence of over surgery in the EMR subgroup (in other words, false positive rate) should be reduced by approximately 12% (11/86), while it resulted in about 92% (79/86) under the current standard assessment of nodal metastasis. Based on the concept of individualised cancer treatment adjusted to the risk of metastasis, endoscopic local excision combined with an analysis of tumour matrilysin expression seems to be the most rational strategy for basing therapeutic decisions in patients with pT1 CRC. This needs confirmation in further clinical studies and a broader consensus before its widespread application.
For further causal inference and to develop a robust prediction model for nodal positivity, the other arm of the 244 pT1–T4 CRC, matched individually for age and sex, was studied according to a case control design.^{29} Conditional logistic regression analysis of these data also revealed that tumour matrilysin expression and lymphatic invasion were stage independent risk factors for nodal metastasis. This part of our study could not be strictly categorised as a longitudinal study because a conclusion that matrilysin expression preceded nodal metastasis was only warranted in the 86 pT1 patients in the EMR subgroup who were treated by initial endoscopic local excision followed by additional radical surgery. However, the following findings imply positive causality for matrilysin in nodal metastasis.^{30} The presence of matrilysin expression was consistent with the biological plausibility of “no invasion no metastasis” and the consistency of this association was sustained by the corresponding results from the two different study populations (tables 1–4) and the two different statistical models (ANN and logistic model). The strength of the association is supported by previous studies.^{12,}^{13} SEM analysis also implied direct causality between tumour matrilysin and nodal metastasis (fig 4).
We hypothesised that depolarised matrilysin expression contributed directly to the mechanism of tumour budding formation, allowing cancer cells to acquire the most aggressive phenotype to overwhelm the host’s defensive machinery in the invasive frontline. Although this is only a hypothesis and the precise mechanisms underlying the process need clarifying, aberrant nuclear accumulation of oncogenic βcatenin is thought to be crucial in reprogramming cancer cells.^{31} However, our data suggested that aberrant nuclear βcatenin accumulation at the invasive front did not correlate significantly with any of the factors examined, including matrilysin expression. One possible explanation for this is that nuclear βcatenin alone was not sufficient to induce intrinsic matrilysin, as reported by Crawford et al and ourselves.^{32,}^{33}
Overfitting is a critical issue in developing a generalised ANN as it may lead to an extremely close fitting of a training data set, whereas in contrast, external test data sets always result in poor performance. There are several approaches to avoiding overfitting in limited training data,^{22} all of which relate to the complexity of the network employed. Here, we adopted a “Bayesian framework for neural network” advocated by Neal, so that we could develop the novel predictive ANN model for nodal metastasis, avoiding overfitting without the need for any validation set,^{23} whose diagnostic ability was equivalent to that of the logistic model in ROC analysis. ARD prior to input to hidden weight was used to this to extract relevancy among the inputs of ANN.^{23,}^{24} As depicted in fig 2, it revealed that matrilysin expression was one of the most relevant inputs in the ANN model, reinforcing the results of logistic regression and SEM, and establishing possible causality between matrilysin expression and nodal metastasis.
While SEM will provide a novel insight into medical care in a causality based manner, an ANN model as well as a logistic model can complement this by offering a precise prediction. The results of this study should improve the clinical decision as to who should receive definitive surgery in stage I (pT1N0M0) or may help resolve the controversy as to who adjuvant chemotherapy should be given in stage II (pT3/4N0M0) CRC. Whether metastatic potential of the primary tumour, through tumour matrilysin measurement, could predict recurrence or survival for stage II CRC warrants further clinical studies.
In conclusion, we propose that analysis of tumour matrilysin expression can refine predictions of individual metastatic potential for CRC, and help clinicians move closer towards the goal of individualised treatment based on tumour metastatic potential of pT1 CRC.
APPENDIX
MATERIALS AND METHODS
cDNA array analysis
Total RNA isolated from frozen tissues using Trizol reagent (Life Technologies, Rockville, Maryland, USA) was used for mRNA extraction using a MagExptractor mRNA isolation kit (Toyobo, Tokyo, Japan) according to the manufacturer’s protocol. Biotin labelled cDNA targets were made from 0.5 μg mRNA using the Gene Navigator cDNA amplification system (Toyobo). The Gene Navigator cDNA array filter (human cancer; Toyobo) consisted of 550 cancer related genes and 11 housekeeping genes in duplicate. A complete list is available on the URL http://www.toyobo.co.jp. Hybridisation was performed overnight at 68°C in PerfectHyb (Toyobo). Filters were washed three times in 2×SSC/0.1% sodium dodecyl sulphate followed by three washes in 0.1×SSC/0.1% sodium dodecyl sulphate at 68°C for 10 minutes each. Specific signals on the filters were detected by a chemiluminescence detection kit (Imaging high; Toyobo). Quantitative assessment of the signals on the filters was performed by scanning on a FluorS MultiImager System (BioRad, Richmond, California, USA) followed by image analysis using ImaGene software (BioDiscovery, Los Angeles, California, USA). Those of the duplicated spots within an array were averaged and normalised according to the median signal intensity for a given array.
Artificial neural networks (ANNs)
The most popular three layer feed forward perceptron under the learning algorithm, the error backpropagation, was adopted.^{22} It is composed of nine input units, 20 hidden units, and one output unit. The nine input nodes included age, sex, tumour size, location of tumour, depth of invasion, differentiation grade, lymphatic and venous invasion, and the expression of matrilysin. The one output node represented nodal metastasis. We used a Bayesian implementation of learning in neural networks using Markov Chain Monte Carlo sampling, developed by Neal.^{23} A brief description of the algorithm along with the heuristics employed follows. The priors for the network parameters (the weight and biases) were defined hierarchically, using hyperparameters that control the standard deviations for parameters in various groups. The priors were different for each group of weights, in which higher level hyperparameters were shared between all the weights in the group, thereby introducing dependencies between weights. This could be achieved using Gaussian prior distribution with mean zero and standard deviation σ, for weight given hyperparameter α. We set a vague prior p(α) to Gamma distribution given by
p(α) ~ Gamma (μ, a) ∝ α^{α/2−1} exp (−αa/2μ)
where μ is the mean and a is a shape parameter. These priors were specified using the following syntax in the software used.
[x] Width [: [Alphagroup] [: [Alphasubgroup] [: [Alphaparameter]]]]
The “Width” part of the specification was used to specify the mean of the precision, defined as τ = σ^{−2}. We used μ = 400 (corresponding to Width = 0.05). The Alpha part gave the shape parameter, α = 0.5 used in this study. Alphagroup was used when selecting the common precision for all parameters in the group. Alphasubgroup used for one of the subgroups, and Alphaparameter for a single parameter. The prefix “x” caused the width of this prior to be automatically scaled. Predictions were made using the entire posterior distribution obtained by Gibbs sampler updating for the hyperparameters, and hybrid Monte Carlo sampler updating for the network parameters. A hybrid Monte Carlo update with a trajectory 2000 leapfrog steps long, a window of 10, and a stepsize adjustment factor of 0.4, and the 600 iterations of sampling phase was used in this study. The initial one third of these nets (200 iterations) was discarded as the algorithm may need time to reach regions of high posterior probability. Networks sampled during the remainder of the run (400 iterations) were saved for making predictions. Random selection data of the 402 were designated as the training data set, and the remaining 92 served as a validation test data set.
ARD proposed by MacKay and Neal^{23,}^{24} is a hierarchical Bayesian approach where there are hyperparameters explicitly representing the relevance of different input features by modelling the width of a zero mean Gaussian prior on the parameters (fig 2). The syntax of ARD prior, “×0.2:0.5:0.5,” is meant to have two “alpha” values, including high level and lower level hyperparameters for each input. All of the analyses on Bayesian neural networks were performed using Software for Flexible Bayesian Modelling (copyright 1995–2003 by Radford M Neal, version 20030629; the software and its documentations are available from the following URL: http://www.cs.utoronto.ca/~radford/). Moreover, we performed ROC curve analysis of two predictive models developed with ROCKIT (developed by Charles E Metz, ROC software including ROCKIT and the user manual are available at the following URL: http://home.uchicago.edu/~junji/KRL_HP/rocold.htm) (fig 3).^{25}
Structural equation modelling (SEM)
We tried to infer the causality of nodal metastasis from SEM using the identical training data set of ANN analysis. Firstly, in an exploratory factor analysis, two factors were extracted with direct oblimin rotation, as one of oblique solution. Next, confirmatory factor analytic model was tested for these data. Finally, considering goodness of fit using several indices of fit (χ^{2}, χ^{2}/df, NFI, NNFI, CFI, and RMSA etc), the path diagram as a causal model for nodal metastasis, consisting of nine measurable and two latent variables, was developed. The model was statistically evaluated by t test of all paths, and r^{2} values or squared multiple correlations were calculated. Moreover, the model was modified by multivariate Largrange Multiplier and Wald test. Detailed handling of binary data in ordinal or categorical variables, such as sex, MMP7, and nodal metastasis, in this study was not explained here but depends on modification of the “LeePoonBentler” approach using polychoric and polyserial correlation coefficients.^{34} Statistical parameter estimation used the maximally likelihood method with the “ROBUST” option implemented in the software.^{35} Analysis of SEM was performed by EQS6.1 for Windows (Multivariate Software Inc.) (fig 4).
Acknowledgments
This study was supported by a grant in aid from the Ministry of Education and Science, Sports and Culture (KI), in Japan. We would like to thank Dr Peter M Olley, English Instructor, for linguistic assistance, and Dr Sonoda T, Department of Public Health at Sapporo Medical University, for statistical advice.
REFERENCES
Footnotes

Conflict of interest: None declared.
Request Permissions
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.
Copyright information:
Linked Articles
 Digest