Relative accessible surface area

Relative accessible surface area or relative solvent accessibility (RSA) of a protein residue is a measure of residue solvent exposure. It can be calculated by formula:

\text{RSA} = \text{ASA} / \text{MaxASA}

{{cite journal

| last1 = Tien | first1 = M. Z.

| last2 = Meyer | first2 = A. G.

| last3 = Sydykova | first3 = D. K.

| last4 = Spielman | first4 = S. J.

| last5 = Wilke | first5 = C. O.

| date = 2013

| title = Maximum allowed solvent accessibilites of residues in proteins

| journal = PLOS ONE

| volume = 8

| number = 11

| pages = e80635

| doi = 10.1371/journal.pone.0080635| pmid = 24278298 | pmc=3836772

| bibcode = 2013PLoSO...880635T

| arxiv = 1211.4251

| doi-access = free

}}

where ASA is the solvent accessible surface area and MaxASA is the maximum possible solvent accessible surface area for the residue. Both ASA and MaxASA are commonly measured in {\AA}^2 .

To measure the relative solvent accessibility of the residue side-chain only, one usually takes MaxASA values that have been obtained from Gly-X-Gly tripeptides, where X is the residue of interest. Several MaxASA scales have been published

{{cite journal

| last1 = Miller | first1 = S.

| last2 = Janin | first2 = J.

| last3 = Lesk | first3 = A. M.

| last4 = Chothia | first4 = C.

| date = 1987

| title = Interior and surface of monomeric proteins

| journal = J. Mol. Biol.

| volume = 196

| issue = 3

| pages = 641–656 | doi=10.1016/0022-2836(87)90038-6| pmid = 3681970

}}

{{ cite journal

| last1 = Rose | first1 = G. D.

| last2 = Geselowitz | first2 = A. R.

| last3 = Lesser | first3 = G. J.

| last4 = Lee | first4 = R. H.

| last5 = Zehfus | first5 = M. H.

| date = 1985

| title = Hydrophobicity of amino acid residues in globular proteins

| journal = Science

| volume = 229

| issue = 4716

| pages = 834–838 | doi=10.1126/science.4023714| pmid = 4023714

| bibcode = 1985Sci...229..834R

| s2cid = 22227053

}}

and are commonly used (see Table).

class="wikitable sortable"
ResidueTien et al. 2013 (theor.)Tien et al. 2013 (emp.)Miller et al. 1987Rose et al. 1985
Alanine129.0121.0113.0118.1
Arginine274.0265.0241.0256.0
Asparagine195.0187.0158.0165.5
Aspartate193.0187.0151.0158.7
Cysteine167.0148.0140.0146.1
Glutamate223.0214.0183.0186.2
Glutamine225.0214.0189.0193.2
Glycine104.097.085.088.1
Histidine224.0216.0194.0202.5
Isoleucine197.0195.0182.0181.0
Leucine201.0191.0180.0193.1
Lysine236.0230.0211.0225.8
Methionine224.0203.0204.0203.4
Phenylalanine240.0228.0218.0222.8
Proline159.0154.0143.0146.8
Serine155.0143.0122.0129.8
Threonine172.0163.0146.0152.5
Tryptophan285.0264.0259.0266.3
Tyrosine263.0255.0229.0236.8
Valine174.0165.0160.0164.5

In this table, the more recently published MaxASA values (from Tien et al. 2013) are systematically larger than the older values (from Miller et al. 1987 or Rose et al. 1985). This discrepancy can be traced back to the conformation in which the Gly-X-Gly tripeptides are evaluated to calculate MaxASA. The earlier works used the extended conformation, with backbone angles of \phi=-120^\circ and \psi=140^\circ. However, Tien et al. 2013 demonstrated that tripeptides in extended conformation fall among the least-exposed conformations. The largest ASA values are consistently observed in alpha helices, with backbone angles around \phi=-50^\circ and \psi=-45^\circ. Tien et al. 2013 recommend to use their theoretical MaxASA values (2nd column in Table), as they were obtained from a systematic enumeration of all possible conformations and likely represent a true upper bound to observable ASA.

ASA and hence RSA values are generally calculated from a protein structure, for example with the software DSSP.

{{cite journal

| last1 = Kabsch | first1 = W.

| last2 = Sander | first2 = C.

| date = 1983

| title = Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features

| journal = Biopolymers

| volume = 22

| issue = 12

| pages = 2577–2637

| doi=10.1002/bip.360221211

| pmid=6667333| s2cid = 29185760

}}

However, there is also an extensive literature attempting to predict RSA values from sequence data, using machine-learning approaches.

{{cite web

| url = http://www.cc.gatech.edu/~hpark/papers/surface.pdf

| title = Prediction of Protein Relative Solvent Accessibility with Support Vector Machines and Long-range Interaction 3D Local Descriptor

| date = 2003

| accessdate = 10 April 2015

| last1 = Hyunsoo | first1 = Kim

| last2 = Haesun | first2 = Park | author2-link = Haesun Park}}

{{cite journal

| last1 = Rost | first1 = Burkhard

| last2 = Sander | first2 = Chris

| title = Conservation and prediction of solvent accessibility in protein families

| journal = Proteins

| date = 1994

| volume = 20

| issue = 3

| pages = 216–26

| pmid = 7892171

| url = https://www.researchgate.net/publication/15307400

| accessdate = 10 April 2015

| doi = 10.1002/prot.340200303| s2cid = 19285647

}}

Prediction tools

Experimentally predicting RSA is an expensive and time-consuming task. In recent decades, several computational methods have been introduced for RSA prediction.{{Cite journal|last1=Kaleel|first1=Manaz|last2=Torrisi|first2=Mirko|last3=Mooney|first3=Catherine|last4=Pollastri|first4=Gianluca|date=2019-09-01|title=PaleAle 5.0: prediction of protein relative solvent accessibility by deep learning|journal=Amino Acids|language=en|volume=51|issue=9|pages=1289–1296|doi=10.1007/s00726-019-02767-6|pmid=31388850|issn=1438-2199|hdl=10197/11324|s2cid=199469523|hdl-access=free}}{{Cite journal|last1=Wang|first1=Sheng|last2=Li|first2=Wei|last3=Liu|first3=Shiwang|last4=Xu|first4=Jinbo|date=2016-07-08|title=RaptorX-Property: a web server for protein structure property prediction|url= |journal=Nucleic Acids Research|language=en|volume=44|issue=W1|pages=W430–W435|doi=10.1093/nar/gkw306|pmid=27112573|pmc=4987890|issn=0305-1048}}{{Cite journal|last1=Magnan|first1=Christophe N.|last2=Baldi|first2=Pierre|date=2014-09-15|title=SSpro/ACCpro 5: almost perfect prediction of protein secondary structure and relative solvent accessibility using profiles, machine learning and structural similarity|url= |journal=Bioinformatics|language=en|volume=30|issue=18|pages=2592–2597|doi=10.1093/bioinformatics/btu352|pmid=24860169|pmc=4215083|issn=1367-4803}}

References