HLA-A*02:01 presenting "GLCTLVAML" to Alpha/Beta T cell receptor at 2.54Å resolution
Data provenance
Information sections
- Publication
- Peptide details
- Peptide neighbours
- Binding cleft pockets
- Chain sequences
- Downloadable data
- Data license
- Footnotes
Complex type
Class i with peptide and alpha beta tcr
HLA-A*02:01
GLCTLVAML
TRAV5
TRBV20
Species
Locus / Allele group
Genetic and structural basis for selection of a ubiquitous T cell receptor deployed in Epstein-Barr virus infection.
Despite the ∼10(18) αβ T cell receptor (TCR) structures that can be randomly manufactured by the human thymus, some surface more frequently than others. The pinnacles of this distortion are public TCRs, which exhibit amino acid-identical structures across different individuals. Public TCRs are thought to result from both recombinatorial bias and antigen-driven selection, but the mechanisms that underlie inter-individual TCR sharing are still largely theoretical. To examine this phenomenon at the atomic level, we solved the co-complex structure of one of the most widespread and numerically frequent public TCRs in the human population. The archetypal AS01 public TCR recognizes an immunodominant BMLF1 peptide, derived from the ubiquitous Epstein-Barr virus, bound to HLA-A*0201. The AS01 TCR was observed to dock in a diagonal fashion, grasping the solvent exposed peptide crest with two sets of complementarity-determining region (CDR) loops, and was fastened to the peptide and HLA-A*0201 platform with residue sets found only within TCR genes biased in the public response. Computer simulations of a random V(D)J recombination process demonstrated that both TCRα and TCRβ amino acid sequences could be manufactured easily, thereby explaining the prevalence of this receptor across different individuals. Interestingly, the AS01 TCR was encoded largely by germline DNA, indicating that the TCR loci already comprise gene segments that specifically recognize this ancient pathogen. Such pattern recognition receptor-like traits within the αβ TCR system further blur the boundaries between the adaptive and innate immune systems.
Structure deposition and release
Data provenance
Publication data retrieved from PDBe REST API8 and PMCe REST API9
Other structures from this publication
Data provenance
MHC:peptide complexes are visualised using PyMol. The peptide is superimposed on a consistent cutaway slice of the MHC binding cleft (displayed as a grey mesh) which best indicates the binding pockets for the P1/P5/PC positions (side view - pockets A, E, F) and for the P2/P3/PC-2 positions (top view - pockets B, C, D). In some cases peptides will use a different pocket for a specific peptide position (atypical anchoring). On some structures the peptide may appear to sterically clash with a pocket. This is an artefact of picking a standardised slice of the cleft and overlaying the peptide.
Peptide neighbours
P1
GLY
TYR59
TYR7
GLU63
TYR171
TYR159
LYS66
TRP167
MET5
PHE33
|
P2
LEU
TYR159
HIS70
LYS66
MET45
VAL67
PHE9
TYR7
TYR99
GLU63
|
P3
CYS
TYR99
TYR159
HIS70
LYS66
|
P4
THR
LYS66
|
P5
LEU
VAL152
LEU156
GLN155
HIS70
ARG97
|
P6
VAL
THR73
ALA69
|
P7
ALA
THR73
TRP147
TYR116
VAL152
|
P8
MET
THR73
THR143
TRP147
LYS146
ASP77
VAL76
|
P9
LEU
TYR84
TYR123
VAL95
THR143
TRP147
TYR116
LEU81
THR142
LYS146
ASP77
THR80
|
Colour key
Data provenance
Neighbours are calculated by finding residues with atoms within 5Å of each other using BioPython Neighboursearch module. The list of neighbours is then sorted and filtered to inlcude only neighbours where between the peptide and the MHC Class I alpha chain.
Colours selected to match the YRB scheme. [https://www.frontiersin.org/articles/10.3389/fmolb.2015.00056/full]
A Pocket
TYR159
THR163
TRP167
TYR171
MET5
TYR59
GLU63
LYS66
TYR7
|
B Pocket
ALA24
VAL34
MET45
GLU63
LYS66
VAL67
TYR7
HIS70
PHE9
TYR99
|
C Pocket
HIS70
THR73
HIS74
PHE9
ARG97
|
D Pocket
HIS114
GLN155
LEU156
TYR159
LEU160
TYR99
|
E Pocket
HIS114
TRP147
VAL152
LEU156
ARG97
|
F Pocket
TYR116
TYR123
THR143
LYS146
TRP147
ASP77
THR80
LEU81
TYR84
VAL95
|
Colour key
Data provenance
1. Beta 2 microglobulin
Beta 2 microglobulin
|
10 20 30 40 50 60
MIQRTPKIQVYSRHPAENGKSNFLNCYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKD 70 80 90 WSFYLLYYTEFTPTEKDEYACRVNHVTLSQPCIVKWDRDM |
2. Class I alpha
HLA-A*02:01
IPD-IMGT/HLA
[ipd-imgt:HLA35266] |
10 20 30 40 50 60
GSHSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGPEYW 70 80 90 100 110 120 DGETRKVKAHSQTHRVDLGTLRGYYNQSEAGSHTVQRMYGCDVGSDWRFLRGYHQYAYDG 130 140 150 160 170 180 KDYIALKEDLRSWTAADMAAQTTKHKWEAAHVAEQLRAYLEGTCVEWLRRYLENGKETLQ 190 200 210 220 230 240 RTDAPKTHMTHHAVSDHEATLRCWALSFYPAEITLTWQRDGEDQTQDTELVETRPAGDGT 250 260 270 FQKWAAVVVPSGQEQRYTCHVQHEGLPKPLTLRWEP |
3. Peptide
|
GLCTLVAML
|
4. T cell receptor alpha
T cell receptor alpha
TRAV5
|
10 20 30 40 50 60
QSLFLSVREGDSSVINCTYTDSSSTYLYWYKQEPGAGLQLLTYIFSNMDMKQDQRKTVLL 70 80 90 100 110 120 NKKDKHLSLRIADTQTGDSAIYFCAEDNNARLMFGDGTQLVVKPNIQNPDPAVYQLRDSK 130 140 150 160 170 180 SSDKSVCLFTDFDSQTNVSQSKDSDVYITDKCVLDMRSMDFKSNSAVAWSNKSDFACANA 190 FNNSIIPEDTFFPSP |
5. T cell receptor beta
T cell receptor beta
TRBV20
|
10 20 30 40 50 60
GAVVSQHPSWVICKSGTSVKIECRSLDFQATTMFWYRQFPKQSLMLMATSNEGSKATYEQ 70 80 90 100 110 120 GVEKDKFLINHASLTLSTLTVTSAHPEDSSFYICSARDGTGNGYTFGSGTRLTVVEDLNK 130 140 150 160 170 180 VFPPEVAVFEPSEAEISHTQKATLVCLATGFYPDHVELSWWVNGKEVHSGVCTDPQPLKE 190 200 210 220 230 240 QPALNDSRYALSSRLRVSATFWQDPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEA WGRAD |
Data provenance
Sequences are retrieved via the Uniprot method of the RSCB REST API. Sequences are then compared to those derived from the PDB file and matched against sequences retrieved from the IPD-IMGT/HLA database for human sequences, or the IPD-MHC database for other species. Mouse sequences are matched against FASTA files from Uniprot. Sequences for the mature extracellular protein (signal petide and cytoplasmic tail removed) are compared to identical length sequences from the datasources mentioned before using either exact matching or Levenshtein distance based matching.
Downloadable data
Components
Data license
Footnotes
- Protein Data Bank Europe - Coordinate Server
- 1HHK - HLA-A*02:01 binding LLFGYPVYV at 2.5Å resolution - PDB entry for 1HHK
- Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. - PyMol CEALIGN Method - Publication
- PyMol - PyMol.org/pymol
- Levenshtein distance - Wikipedia entry
- Protein Data Bank Europe REST API - Molecules endpoint
- 3Dmol.js: molecular visualization with WebGL - 3DMol.js - Publication
- Protein Data Bank Europe REST API - Publication endpoint
- PubMed Central Europe REST API - Articles endpoint
This work is licensed under a Creative Commons Attribution 4.0 International License.