Online Inquiry

  • *
  •   
  • *
  •   
  •   
  • *
  • Please input "lifetech" as verification code

Ordering Information

The pocketome of G protein coupled receptors reveals previously untargeted allosteric sites

Abstract

G-protein-coupled receptors do not only feature the orthosteric pockets, where most endogenous agonists bind, but also a multitude of other allosteric pockets that have come into the focus as potential binding sites for synthetic modulators. Here, to better characterise such pockets, we investigate 557 GPCR structures by exhaustively docking small molecular probes in silico and converting the ensemble of binding locations to pocket-defining volumes. Our analysis confirms all previously identified pockets and reveals nine previously untargeted sites. In order to test for the feasibility of functional modulation of receptors through binding of a ligand to such sites, we mutate residues in two sites, in two model receptors, the muscarinic acetylcholine receptor M3 and β2-adrenergic receptor. Moreover, we analyse the correlation of inter-residue contacts with the activation states of receptors and show that contact patterns closely correlating with activation indeed coincide with these sites.

Introduction

G-protein-coupled receptors (GPCRs) have evolved to transduce signals from the outside of a cell to the inside, thereby allowing the cell to respond to changes in its environment1. As a consequence of their role as transducers, GPCRs feature at least two interaction sites: one on the extracellular side, sensing the signalling agents (from photons to peptides), the other on the intracellular side, providing a place for the effector proteins to bind2. As the repertoires of extracellular signalling agents and intracellular effector proteins are quite limited, these sites are oftentimes conserved within a receptor subclass. This can pose a challenge to ligand and drug discovery efforts when the treatment of an ailment requires the selective targeting of a particular receptor subtype. An example of such a challenge are the β1- and β2-adrenergic receptors (β1- and β2AR), which differ only by a Phe/Tyr substitution in their orthosteric sites. Blockade of the β1AR in heart by beta-blockers (such as bisoprolol) is desired for cardiovascular disease, but antagonising the β2AR in lung tissue is detrimental for chronic obstructive pulmonary disease or asthma. Conversely, stimulation of the β2AR (by e.g. salmeterol) helps asthma patients but potentially damages their heart through concomitant agonism of the β1AR3.

As a possible way of circumventing this challenge of highly similar pockets, the targeting of allosteric pockets is billed as a sensible alternative4. Due to the nature of GPCRs as bundles of seven transmembrane helices that are only relatively loosely coupled5, one could indeed expect that a ligand binding to one of these pockets is able to modulate the response of a receptor. Moreover, it is generally claimed—but has never been shown—that these alternative pockets share lower sequence homology4. There are examples of individual ligands binding to non-orthosteric sites on a few receptors (e.g. refs. 6,7,8,9), but it is currently unknown to what extent such binding sites exist across the receptorome and how different or similar they are in shape and sequence.

In this work, we therefore identify and analyse the ensemble of all discernible pockets—the pocketome—of 557 GPCR structures of 113 different receptors. We discover potential pockets by exhaustive docking of small molecular probes, taking into account the different electrostatics of the solvent-exposed and transmembrane parts of the receptors, and compare these data across all receptors. Based on class A and B1 structures in active and inactive conformations, we compute residue contacts including both backbone and side chain atoms. In doing so, we identify interhelical residue contacts crucial for an active or inactive state of both class A and class B1 GPCRs (we follow the nomenclature in IUPHAR’s “Guide to Pharmacology” and refer to classes of GPCRs rather than families). We are then able to show that known and as-of-yet-untargeted (orphan) allosteric sites (abbreviated as KS and OS, respectively, in the following) contain such contacts of importance, speaking to the likelihood of their functional relevance. These computational investigations are strengthened with experimental studies of two model class A receptors, the muscarinic acetylcholine receptor M3 (M3R) and the β2AR. Through mutations of two pockets that have not been targeted by a synthetic ligand before, we demonstrate that the residues forming these pockets are indeed involved in receptor activation after stimulation with an orthosteric agonist. Last, but not least, we compare the sequence similarity of the most frequently occurring pockets, thereby providing a quantitative assessment of their overall selectivity potential. This therefore represents the currently most exhaustive analysis of the GPCR pocketome, spanning receptors from classes A, B1, B2, C, D1, and F.

Results

Probe docking & conversion to volumes

Our definition of a pocket is based on the computational docking of small molecules (probes; while the probes we used are probably too small to bind strongly to a receptor by themselves, they represent chemical moieties that are typical for GPCR ligands and are thus suited to investigate the details of cavities on receptors) to the surface of each GPCR structure individually. We therefore first show the results of our docking calculations and the conversion to volumes before turning to the identified hotspots (the pockets) themselves. Please note that, for our approach, we did not consider dimerisation of the 7TM bundle (as has been described for class C GPCRs), but rather docked to the individual monomers. Moreover, we treated each receptor structure as rigid. Exhaustively docking the 40 small, chemically diverse molecular probes (see Methods and Supplementary Table 1) into 557 structures from 113 distinct receptors, we obtained 1621367 poses in total (a more detailed description of the statistics is provided in the Supplementary Notes and Supplementary Fig. 1). We provide a list of all analysed structures together with the docking files as Supplementary Data 110.

To analyse the vast number of docked molecules in a statistical manner, we used our volumetric averaging algorithm (see Methods) in order to transform the poses of each docking into visualisable probe density maps. These maps are divided into equal volume elements, each of them giving information about how often a probe atom occupied a particular region. On average, each of the obtained maps consisted of 1000000 up to 3500000 volume elements. Since we wanted to investigate the density maps for trends across the different receptor classes, maps of individual receptors were added up for each class to yield a single map with higher populations overall.

General distribution of pockets

The class-specific density maps provided with this work can be visualised using Pymol (see Supplementary Data 210 for the grid files, template, and README) and might aid a reader with the following description. Said density maps reveal multiple contiguous regions that represent common cavities on the surface of all GPCRs analysed in this study (Fig. 1). Particularly for class A GPCRs, these pockets are distributed in a notably symmetric manner: both at the intra- and extracellular end of the 7TM bundle, pockets can be seen between each pair of adjacent helices. The density maps for the other classes are somewhat less well-defined and more scattered overall. This is owed to the lower numbers of structures and therefore poorer statistics, as individual structures—and possible deviations in them—carry a relatively higher weight than for the more numerous class A structures.

Fig. 1: Representative depiction of the GPCR pocketome.

Cumulative densities for all class A GPCRs (orange volumes) are shown projected on the structure with PDB 1F88 ([https://doi.org/10.2210/pdb1F88/pdb]; ice blue ribbon). The surface is indicated white transparent. Visible hotspots (pockets) located at the lipid-facing receptor portion around the 7TM bundle are labelled either as OS (Orphan Site) or KS (Known Site). A more detailed description of their location is provided in the text and Table 1. We note that OS3 was described in the most recent X-ray structure PDB 7M3J64 [https://doi.org/10.2210/pdb7M3J/pdb] during writing of this manuscript. We therefore re-labelled this site to KS12. Furthermore, OS4 was not found in the class A densities. Three known and three orphan pockets (red labels) are discussed in more detail in the text. The red sphere indicates the tip of HVIII and has been included for ease of orientation. Source data are provided as a source_data.xlsx file.

Here, we present only those pockets that we will discuss and examine in depth, whereas the rest of them is described in the Supplementary Notes. We chose to focus on three of the largest and—by our analysis—best-defined orphan sites and contrast them with an equal number of known sites, which we picked because they are clearly defined and because they host synthetic ligands. While the vast majority of sites defined by the densities is located at the outward-facing receptor portion (i.e. receptor residues in contact with the membrane), we also were able to identify regions of density inside the 7TM bundle. In each class, a large interhelical site (Interhelical Binding Site 1, IBS1) and adjacent secondary binding pockets (IBS2 and IBS3) can clearly be discerned. Whereas IBS1 represents the classic orthosteric site in class A GPCRs, it forms—together with the extra-cellular domain (ECD)—the peptide binding site in class B GPCRs. Furthermore, IBS2 and IBS3 are two known exosites in class A GPCRs. Since the orthosteric site of class C receptors is located in the extracellular Venus flytrap (VFT) domain, IBS1 is commonly referred to as an allosteric site in class C receptors. Our methodology was able to correctly depict the size and shape of these known pockets for different classes, and we therefore hypothesized that the other pockets identified in this work can indeed also host ligands. By aligning our density maps with each other, one can see that the average IBS1 for class C receptors protrudes significantly deeper than the one of class B1, which again goes slightly deeper than the one in class A. This is perfectly consistent with experimental evidence11. Due to the overall higher flexibility and thus often worse resolution of extra- and intracellular loops, pockets found within these regions will not be further analysed or discussed. Comparing the densities on the outward-facing receptor portion for all analysed GPCR classes, we assigned pocket identifiers to several volumes that appeared well-defined and clearly distinct from their neighbouring densities. This facilitated later analysis and provided the means for a common orientation and discussion. However, since not only the GPCR structures themselves but also the density map shapes differ across the classes, the reader’s view on whether a particular region is an individual pocket might differ from ours. That being said, our general conclusions are independent of any such small differences in definitions. The full list of pockets is presented in Table 1 and Supplementary Table 2. Going around the 7TM bundle, one can observe regions of density at the upper and lower ends between helices V and VI. These sites are referred to as KS12 and OS5, respectively. For some classes, another separated hotspot resides right between these two sites (OS4). At the lower end of the 7TM bundle, OS5 shows a large spot for classes A and F. When directly compared to class A, the density of class B1 is subdivided into multiple regions. While for classes B2 and C a small hotspot is visible, class D1 only shows some fragmented density in front of helix V.

Another larger spot is visible between helices I and VII above helix VIII for classes A, B1, C, and F (OS9). The classes B2 and D1 maps only show a small spot in this region, which might be due to the lack of helix VIII in the available structures.

Encouragingly, we identified density near the region of the sodium binding pocket (SODIUM) for some classes. While classes A, B1, and F show somewhat weaker densities, the class C IBS1 extends down into this region, which makes it clearly defined.

Lastly, two regions of density were found at the intracellular portion of the 7TM bundle for all classes. Here, one spot could be identified as the G-Protein binding site between helices II, III, V and VI (GPROT). Adjacent to it, density for KS11 resides between helices I, II, VII, and VIII.

Despite the fact that we only considered monomeric subunits of the 7TM bundle in our calculations, our methodology was able to also reveal all dimerisation interfaces, which have predominantly been described for class C GPCRs. The conserved helix VI-helix VI dimerisation interface in active-state class C receptors encompasses KS7, KS8, KS9, and partially OS5 and is known to bind positive allosteric modulators (PAMs)12,13. Two other dimerisation interfaces can be found between helix III-helix IV (mGlu2) or helix III-helix V (GABAB) in inactive-state class C GPCRs14,15. While the former is mainly formed by residues at the extracellular end of the helices and is thus represented by KS2, the latter dimerisation interface is located in the region of KS5.

1.Hilger, D., Masureel, M. & Kobilka, B. K. Structure and dynamics of GPCR signaling complexes. Nat. Struct. Mol. Biol. 25, 4–12 (2018).
2.Weis, W. I. & Kobilka, B. K. The molecular basis of G protein-coupled receptor activation. Annu. Rev. Biochem. 87, 897–919 (2018).
3.Baker, J. G. & Wilcox, R. G. β-blockers, heart disease and COPD: current controversies and uncertainties. Thorax 72, 271–276 (2017).
4.Christopoulos, A. Advances in G protein-coupled receptor allostery: From function to structure. Mol. Phys. 86, 463–478 (2014).
 5. Hilger, D. The role of structural dynamics in GPCR-mediated signaling. FEBS J. 288, 2461–2489 (2021).
6. Oswald, C. et al. Intracellular allosteric antagonism of the CCR9 receptor. Nature 540, 462–465 (2016).
7. Zheng, Y. et al. Structure of CC chemokine receptor 2 with orthosteric and allosteric antagonists. Nature 540, 458–461 (2016).
8. Liu, X. et al. Mechanism of intracellular allosteric β2AR antagonist revealed by X-ray crystal structure. Nature 548, 480–484 (2017).
9. Song, G. et al. Human GLP-1 receptor transmembrane domain structure in complex with allosteric modulators. Nature 546, 312–315 (2017).
10. Hedderich, J. B. & Kolb, P. The pocketome of G-protein-coupled receptors reveals previously untargeted allosteric sites. (2022).
11. Munk, C., Harpsøe, K., Hauser, A. S., Isberg, V. & Gloriam, D. E. Integrating structural and mutagenesis data to elucidate GPCR ligand binding. Curr. Opin. Pharmacol. 30, 51–58 (2016).
12. Koehl, A. et al. Structural insights into the activation of metabotropic glutamate receptors. Nature 566, 79–84 (2019).
13. Shaye, H. et al. Structural basis of the activation of a metabotropic GABA receptor. Nature 584, 298–303 (2020).
14. Du, J. et al. Structures of human mGlu2 and mGlu7 homo-and heterodimers. Nature 594, 589–593 (2021).
15. Papasergi-Scott, M. M. et al. Structures of metabotropic GABA B receptor. Nature 584, 310–314 (2020).