Skip to main content

Table 1 Denomination and short description of the 27 features of the dataset for individual residues classification

From: Application of an interpretable classification model on Early Folding Residues during protein folding

Feature

Description

e

Computed energy values

ePred

Predicted energy values

SecSize

Size of the surrounding secondary structure elements

LF

Fraction of surrounding unordered secondary structure elements

Rasa

Relative accessible surface area

PlipLC

Absolute count of local PLIP contacts

PlipHbLC

Absolute count of local PLIP hydrogen bonds

PlipHpLC

Absolute count of local PLIP hydrophobic interactions

PlipBbLC

Absolute count of local PLIP backbone contacts

PlipLR

Absolute count of long-range PLIP contacts

PlipHbLR

Absolute count of long-range PLIP hydrogen bonds

PlipHpLR

Absolute count of long-range PLIP hydrophobic interactions

PlipBbLR

Absolute count of long-range PLIP backbone contacts

PlipBN

Betweenness using all PLIP contacts

PlipCL

Closeness using all PLIP contacts

PlipCC

Clustering coefficient using all PLIP contacts

PlipHbBN

Betweenness using PLIP hydrogen bonds

PlipHbCL

Closeness using PLIP hydrogen bonds

PlipHbCC

Clustering coefficient using PLIP hydrogen bonds

PlipHpBN

Betweenness using PLIP hydrophobic interactions

PlipHpCL

Closeness using PLIP hydrophobic interactions

PlipHpCC

Clustering coefficient using PLIP hydrophobic interactions

ConvBN

Betweenness using the distance-based contact definition

ConvCL

Closeness using the distance-based contact definition

ConvCC

Clustering coefficient using the distance-based contact definition

PlipNC

Distinct neighborhood count using all PLIP contacts

ConvNC

Distinct neighborhood count using the distance-based contact definition

  1. References to these features are given in italic font