A set of 648 cleavage sites and 648 parts of mature proteins shortly after cleavage sites derived from human proteome.
A data frame with 1296 observations on the following 10 variables. Columns from
P1
to P9
describes positions in an extracted peptide. tar
is a target vector. It
has value 1 if a peptide is a cleavage site and 0 if not.
Each peptide in the data set is nine amino acid residues long. In case of cleavage sites, the clevage is located between fifth and sixth peptide. The non-cleavage sites are parts of mature proteins starting five positions after cleavage site.
Amino acid residues were recoded as integers.
#> #> 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 #> 136 36 57 64 23 120 29 32 56 97 16 21 160 71 66 128 86 64 17 17