Skip to main content

Table 3 Variable number of SNPs: the experiment shows the effect of increasing the number of SNPs on choice of the reference sequence

From: Optimal reference sequence selection for genome assembly using minimum description length principle

Ref. Seq.

SNPs

No. of inversions

No. of insertions

No. of deletions

Code-length using proposed scheme (Kb)

1

183

52 / 52

62 / 59

62

1815.14

2

224

50 / 51

66 / 58

63

1843.35

  1. SR 2 has higher number of SNPs as opposed to SR 1. The code-length suggests that SR 1 is the model of choice as it has a smaller code-length. The results show that the MDL scheme works successfully on variable number of SNPs by choosing the model with a lower number of SNPs in them.