Table 4

Assessment of linearity and non-linearity on subsite–molecular relationship in colorectal cancer by multivariate logistic or linear regression model

Multivariate regression modelOutcome variable (molecular feature)Bowel subsite variable (from rectum to ascending colon)Squared subsite variableCubic subsite variableLRT
p Value (Wald test)Includedp Value (Wald test)Includedp Value (Wald test)Degrees of freedomp Value*
LogisticCIMP<0.0001NoNoReferent
0.0017Yes0.17No10.17
0.68Yes0.15Yes0.09620.098
LogisticMSI<0.0001NoNoReferent
0.020Yes0.56No10.56
0.93Yes0.48Yes0.4220.61
LogisticBRAF mutation<0.0001NoNoReferent
0.0041Yes0.26No10.26
0.56Yes0.76Yes0.6320.47
LogisticKRAS mutation0.66NoNoReferent
0.48Yes0.42No10.42
0.16Yes0.19Yes0.2320.35
LogisticPIK3CA mutation0.0034NoNoReferent
0.070Yes0.20No10.20
0.096Yes0.23Yes0.3020.26
LinearLINE-1 methylation level0.020NoNoReferent
0.0070Yes0.0006No10.0036
0.036Yes0.0022Yes<0.00012<0.0001
  • A multivariate regression model included age, sex, year of diagnosis, family history of colorectal cancer, body mass index, physical activity, smoking, alcohol consumption and the bowel subsite variable with or without the squared and cubic subsite variables, as indicated in the table.

  • * A significant p value by the LRT indicates a non-linearity, and a combination of insignificant p values by LRT and a significant p value by the Wald test on the subsite variable in the model without the squared or cubic location variable indicates a linear relationship.

  • CIMP, CpG island methylator phenotype; LRT, likelihood ratio test; MSI, microsatellite instability.