HLA-C*06:02 binding "ARFNDLRFV" at 2.80Å resolution
Data provenance
Information sections
- Publication
- Peptide details
- Peptide neighbours
- Binding cleft pockets
- Chain sequences
- Downloadable data
- Data license
- Footnotes
Complex type
HLA-C*06:02
ARFNDLRFV
Species
Locus / Allele group
The molecular basis for peptide repertoire selection in the Human Leucocyte Antigen (HLA) C*06:02 molecule.
Human leukocyte antigen (HLA)-C*06:02 is identified as the allele associated with the highest risk for the development of the autoimmune skin disease psoriasis. However, the diversity and mode of peptide presentation by the HLA-C*06:02 molecule remains unclear. Here, we describe the endogenous peptide repertoire of ∼3,000 sequences for HLA-C*06:02 that defines the peptide-binding motif for this HLA allomorph. We found that HLA-C*06:02 predominantly presents nonamer peptides with dominant arginine anchors at the P2 and P7 positions and a preference for small hydrophobic residues at the C terminus (PΩ). To determine the structural basis of this selectivity, we determined crystal structures of HLA-C*06:02 in complex with two self-peptides (ARTELYRSL and ARFNDLRFV) and an analogue of a melanocyte autoantigen (ADAMTSL5, VRSRR-abu-LRL) implicated in psoriasis. These structures revealed that HLA-C*06:02 possesses a deep peptide-binding groove comprising two electronegative B- and E-pockets that coincide with the preference for P2 and P7 arginine anchors. The ADAMTSL5 autoantigen possessed a P7-Leu instead of the P7-Arg residue, but nevertheless was accommodated within the HLA-C*06:02 antigen-binding cleft. Collectively, our results provide the structural basis for understanding peptide repertoire selection in HLA-C*06:02.
Structure deposition and release
Data provenance
Publication data retrieved from PDBe REST API8 and PMCe REST API9
Other structures from this publication
Data provenance
MHC:peptide complexes are visualised using PyMol. The peptide is superimposed on a consistent cutaway slice of the MHC binding cleft (displayed as a grey mesh) which best indicates the binding pockets for the P1/P5/PC positions (side view - pockets A, E, F) and for the P2/P3/PC-2 positions (top view - pockets B, C, D). In some cases peptides will use a different pocket for a specific peptide position (atypical anchoring). On some structures the peptide may appear to sterically clash with a pocket. This is an artefact of picking a standardised slice of the cleft and overlaying the peptide.
Peptide neighbours
P1
ALA
GLU63
TRP167
PHE33
MET5
TYR171
TYR7
TYR159
TYR59
LYS66
|
P2
ARG
SER24
TYR99
GLN70
TYR159
GLU63
LYS66
TYR67
ASP9
PHE22
TYR7
|
P3
PHE
GLN155
TRP156
TYR159
LYS66
TYR99
|
P4
ASN
TYR159
LYS66
|
P5
ASP
GLN70
GLN155
|
P6
LEU
ARG69
ALA73
GLU152
GLN70
LYS66
|
P7
ARG
ALA73
ASP114
GLN70
ASN77
ASP74
TYR99
TRP147
TRP156
ASP9
PHE22
TRP97
|
P8
PHE
VAL76
THR143
TRP147
LYS146
ASN77
ALA73
|
P9
VAL
LEU81
ASN77
THR143
TRP147
TYR84
TYR123
ILE142
LYS146
TRP97
LYS80
|
Colour key
Data provenance
Neighbours are calculated by finding residues with atoms within 5Å of each other using BioPython Neighboursearch module. The list of neighbours is then sorted and filtered to inlcude only neighbours where between the peptide and the MHC Class I alpha chain.
Colours selected to match the YRB scheme. [https://www.frontiersin.org/articles/10.3389/fmolb.2015.00056/full]
A Pocket
TYR159
THR163
TRP167
TYR171
MET5
TYR59
GLU63
LYS66
TYR7
|
B Pocket
SER24
VAL34
GLY45
GLU63
LYS66
TYR67
TYR7
GLN70
ASP9
TYR99
|
C Pocket
GLN70
ALA73
ASP74
ASP9
TRP97
|
D Pocket
ASP114
GLN155
TRP156
TYR159
LEU160
TYR99
|
E Pocket
ASP114
TRP147
GLU152
TRP156
TRP97
|
F Pocket
SER116
TYR123
THR143
LYS146
TRP147
ASN77
LYS80
LEU81
TYR84
LEU95
|
Colour key
Data provenance
1. Beta 2 microglobulin
Beta 2 microglobulin
|
10 20 30 40 50 60
MIQRTPKIQVYSRHPAENGKSNFLNCYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKD 70 80 90 WSFYLLYYTEFTPTEKDEYACRVNHVTLSQPKIVKWDRDM |
2. Class I alpha
HLA-C*06:02
IPD-IMGT/HLA
[ipd-imgt:HLA35196] |
10 20 30 40 50 60
CSHSMRYFDTAVSRPGRGEPRFISVGYVDDTQFVRFDSDAASPRGEPRAPWVEQEGPEYW 70 80 90 100 110 120 DRETQKYKRQAQADRVNLRKLRGYYNQSEDGSHTLQWMYGCDLGPDGRLLRGYDQSAYDG 130 140 150 160 170 180 KDYIALNEDLRSWTAADTAAQITQRKWEAAREAEQWRAYLEGTCVEWLRRYLENGKETLQ 190 200 210 220 230 240 RAEHPKTHVTHHPVSDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDGT 250 260 270 FQKWAAVVVPSGEEQRYTCHVQHEGLPEPLTLRWEP |
3. Peptide
|
ARFNDLRFV
|
Data provenance
Sequences are retrieved via the Uniprot method of the RSCB REST API. Sequences are then compared to those derived from the PDB file and matched against sequences retrieved from the IPD-IMGT/HLA database for human sequences, or the IPD-MHC database for other species. Mouse sequences are matched against FASTA files from Uniprot. Sequences for the mature extracellular protein (signal petide and cytoplasmic tail removed) are compared to identical length sequences from the datasources mentioned before using either exact matching or Levenshtein distance based matching.
Downloadable data
Complete structures
Components
Data license
Footnotes
- Protein Data Bank Europe - Coordinate Server
- 1HHK - HLA-A*02:01 binding LLFGYPVYV at 2.5Å resolution - PDB entry for 1HHK
- Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. - PyMol CEALIGN Method - Publication
- PyMol - PyMol.org/pymol
- Levenshtein distance - Wikipedia entry
- Protein Data Bank Europe REST API - Molecules endpoint
- 3Dmol.js: molecular visualization with WebGL - 3DMol.js - Publication
- Protein Data Bank Europe REST API - Publication endpoint
- PubMed Central Europe REST API - Articles endpoint
This work is licensed under a Creative Commons Attribution 4.0 International License.