Sequential model selection‐based segmentation to detect DNA copy number variation

Publisher: John Wiley & Sons Inc

E-ISSN: 1541-0420|72|3|815-826

ISSN: 0006-341x

Source: BIOMETRICS, Vol.72, Iss.3, 2016-09, pp. : 815-826

Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.

Previous Menu Next

Abstract

Summary

Array‐based CGH experiments are designed to detect genomic aberrations or regions of DNA copy‐number variation that are associated with an outcome, typically a state of disease. Most of the existing statistical methods target on detecting DNA copy number variations in a single sample or array. We focus on the detection of group effect variation, through simultaneous study of multiple samples from multiple groups. Rather than using direct segmentation or smoothing techniques, as commonly seen in existing detection methods, we develop a sequential model selection procedure that is guided by a modified Bayesian information criterion. This approach improves detection accuracy by accumulatively utilizing information across contiguous clones, and has computational advantage over the existing popular detection methods. Our empirical investigation suggests that the performance of the proposed method is superior to that of the existing detection methods, in particular, in detecting small segments or separating neighboring segments with differential degrees of copy‐number variation.