Skip to main content

Table 2 Summary of simulation studies with hsegHMM-N and hsegHMM-T models based on 500 simulated datasets

From: hsegHMM: hidden Markov model-based allele-specific copy number alteration analysis accounting for hypersegmentation

  

hsegHMM-N

hsegHMM-T

Simulation 1

 

True

Est

SEs

SE\(^{\ast }_{\mathrm {H}}\)

Est

SEs

SE H

ψ

1.6

1.61

0.018 â–³

0.009

1.60

0.014

0.007

α

0.9

0.90

0.004

0.003

0.90

0.003

0.003

κ 2

0.3

N/A

0.30

0.007

0.007

V(W)

0.6

0.55

0.039

0.008

0.61 a

0.017

0.020 b

Ï„ 2

0.5

0.50

0.033

0.025

0.50

0.028

0.024

v

4

N/A

3.91

0.150

0.159

Simulation 2

 

True

Est

SEs

SE H

Est

SEs

SE H

ψ

1.6

1.62

0.028

0.015

1.60

0.013

0.011

α

0.9

0.90

0.003

0.003

0.90

0.003

0.003

κ 2

N/A

N/A

0.64

0.017

0.018

V(W)

0.65

1.46

0.055

0.023

2.28 a

0.133

0.159 b

Ï„ 2

0.5

0.48

0.026

0.024

0.49

0.025

0.024

v

N/A

N/A

2.79

0.076

0.093

  1. Simulation 1 and Simulation 2 are the t-distribution-based and the normal-mixture-based studies. Each dataset consists of 10,000 observations of logR and logOR. Est is average estimates from 500 datasets ;ψ is the ploidy, α is the tumor purity; κ2 is the variance component of logR in hsegHMM-T; V(W) and τ2 are the variance of logR and logOR in both models, respectively; SEs indicates the Monte-Carlo standard errors calculated from 500 datasets; SEH indicates the average asymptotic standard errors of estimates based on the Hessian matrices
  2. ∗ the average asymptotic standard errors based on the hsegHMM-N model are reported based on 486 datasets where 2.8% of 500 datasets cannot produce invertable Hessian matrices due to numerical problems
  3. aV(W)=E(V(W|u))+V(E(W|u))\(=\kappa ^{2}\times \frac {v}{v-2}\)
  4. bthe asymptotic standard error of V(W) with the hsegHMM-T is calculated by using the Delta method
  5. â–³ The distribution of the ploidy estimates is skewed so the SE s of the ploidy appears to be larger than SE H. Using the scaled MAD (median absolute deviation) gives a closer value (0.008) to SE H; \(MAD=1.4826\times med(|\widehat {\theta }_{m}-\widehat {\theta }_{med}|)\), where \(\widehat {\theta }_{m}\) is the estimate for the mth dataset and \(\widehat {\theta }_{med}\) is the median calculated from 500 simulated datasets