Skip to main content
Figure 2 | BioData Mining

Figure 2

From: Graph representation of high-dimensional alpha-helical membrane protein data

Figure 2

Motif Extraction. The present pseudo code describes a naive algorithmic procedure of variable word extraction from any string. We have applied this procedure to the context of motif extraction out of different protein sequence information. Ultimately, this algorithm returns the starting X amino acid at each protein sequence position i and the ending amino acid Y at i + n of the corresponding extracted word as representative motif. A resulting list consists of motifs which are all written in the regular expression XYn by n = {4 - 7}. A list without duplicate entries will be updated if the supplied current motif does not exist.

Back to article page