Skip to main content

Table 4 Variable number of insertions: the experiment shows the effect of increasing the number of insertions on choice of the reference sequence

From: Optimal reference sequence selection for genome assembly using minimum description length principle

Ref. Seq.

SNPs

No. of inversions

No. of insertions

No. of deletions

Code-length using proposed scheme (Kb)

1

0

0

136 / 196

0

1200.3

2

0

0

132 / 203

0

1228.25

  1. The location and length of these insertions was chosen randomly. 136 196 shows that out of 196 insertions in SR 1 only 136 were removed. The remaining insertions were not recovered due to the choice of τ1 and τ2. SR 2 has higher number of insertions as opposed to SR 1. The code-length suggests that SR 1is the model of choice as it has a smaller code-length.