For citing, please use (manuscript submitted).

Name: Smeets et al. (24 arrays)Group A: HPV+ (12)X1Cy3_042204_1015.1Cy5_042204_1015, X1Cy3_042904_2138.1Cy5_042904_2138, X2Cy3_042204_1035.2Cy5_042204_1035, X2Cy3_042904_2156.2Cy5_042904_2156, X2Cy3_050604_2159.2Cy5_050604_2159, X3Cy3_042904_2211.3Cy5_042904_2211, X3Cy3_050604_2214.3Cy5_050604_2214, X4Cy3_042904_2225.4Cy5_042904_2225, X4Cy3_050604_2229.4Cy5_050604_2229, X4Cy3_073004_1644.4Cy5_073004_1644, X5Cy3_050604_2242.5Cy5_050604_2242, X6Cy3_052604_1102.6Cy5_052604_1102Group B: HPV- (12)X1Cy3_052604_0942.1Cy5_052604_0942, X1Cy3_052804_1314.1Cy5_052804_1314, X1Cy3_073004_1601.1Cy5_073004_1601, X2Cy3_052604_0956.2Cy5_052604_0956, X2Cy3_052804_1328.2Cy5_052804_1328, X2Cy3_073004_1615.2Cy5_073004_1615, X3Cy3_052604_1021.3Cy5_052604_1021, X3Cy3_052804_1347.3Cy5_052804_1347, X3Cy3_073004_1628.3Cy5_073004_1628, X4Cy3_052604_1035.4Cy5_052604_1035, X4Cy3_052804_1401.4Cy5_052804_1401, X5Cy3_052604_1047.5Cy5_052604_1047Unused: (0)

Average power plotted as a function of sample size.

Copy number profiles produced with CGHcall. The mean probability of losses is shown in red, and the values can be red from the Y axis. Mean probability of gains is in green, and the values are 1 - the value from the Y axis. Output from CGHregions. Chromosomes are plotted individually, and each bump represents a breakpoint between regions. The loss/gain frequencies are shown in red/green.

To help interpreting the these plots, comparison can be made to the evaluation data sets. The two estimators of G should not be in too much of a disagreement compared to each other. If the difference is severe, the quality of parameter estimation is questionable, and so is the reliability of the power calculations. The density of the p values should increase for small values and the function should be convex. Gamma is the proportion of non-differentially behaving regions. The plot shows the distance between the two estimators of G, and the value of gamma is chosen so that this difference is minimized. The function should have a minimum somewhere in the middle. A minimum very close to 0 is a sign of problems in parameter estimation. Effect size is the difference between the groups. The function can have one ore more peaks, depending on the particular data set. Skewness of the RWLRs is plotted and that of a normal distribution is superimposed in red. RWLRs are assumed to be approximately normally distributed, so a large deviation might negatively affect the performance of the method. Kurtosis of the RWLRs is plotted and that of a normal distribution is superimposed in red. RWLRs are assumed to be approximately normally distributed, so a large deviation might negatively affect the performance of the method.