Skip to main content

Table 1 Distribution of sequences across different classes in training and test data combined together

From: A top-down approach to classify enzyme functional classes and sub-classes using random forest

Class

Sub-classes

Number of sequences

1 Oxidoreductases

1.1, 1.2, 1.3, 1.4, 1.5, 1.10, 1.16

986

2 Transferases

2.1, 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8

734

3 Hydrolases

3.1, 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8, 3.11

674

4 Lyases

4.1, 4.2, 4.3, 4.4, 4.6, 4.99

828

5 Isomerases

5.1, 5.2, 5.3, 5.4, 5.5

664

6 Ligases

6.1, 6.2, 6.3, 6.4

845

  1. The sequences extracted from SWISS-PROT enzyme database are spread over a total of 40 sub-classes. Sequences have been extracted from the sub-classes having the largest bank of sequences. The number of sequences shown represent sequences with 100% reduced identity.