Alpha This is a work in progress and may change. Your feedback is very welcome.
  


3O4L

HLA-A*02:01 presenting "GLCTLVAML" to Alpha/Beta T cell receptor at 2.54Å resolution

Data provenance

Structure downloaded from PDB Europe using the Coordinate Server. Aligned to residues 1-180 of 1HHK2 using the CEALIGN3 function of PyMol4. Chain assigment using a Levenshtein distance5 method using data from the PDBe REST API6. Organism data from PDBe REST API. Data for both of these operations from the Molecules endpoint. Structure visualised with 3DMol7.

Information sections


Complex type

Class i with peptide and alpha beta tcr

1. Beta 2 microglobulin
['B']
2. Class I alpha
HLA-A*02:01
['A']
3. Peptide
GLCTLVAML
['C']
4. T cell receptor alpha
TRAV5
['D']
5. T cell receptor beta
TRBV20
['E']

Species


Locus / Allele group


Publication

Genetic and structural basis for selection of a ubiquitous T cell receptor deployed in Epstein-Barr virus infection.

Miles JJ, Bulek AM, Cole DK, Gostick E, Schauenburg AJ, Dolton G, Venturi V, Davenport MP, Tan MP, Burrows SR, Wooldridge L, Price DA, Rizkallah PJ, Sewell AK
PLoS Pathog. (2010) 6, e1001198 [doi:10.1371/journal.ppat.1001198]  [pubmed:21124993

Despite the ∼10(18) αβ T cell receptor (TCR) structures that can be randomly manufactured by the human thymus, some surface more frequently than others. The pinnacles of this distortion are public TCRs, which exhibit amino acid-identical structures across different individuals. Public TCRs are thought to result from both recombinatorial bias and antigen-driven selection, but the mechanisms that underlie inter-individual TCR sharing are still largely theoretical. To examine this phenomenon at the atomic level, we solved the co-complex structure of one of the most widespread and numerically frequent public TCRs in the human population. The archetypal AS01 public TCR recognizes an immunodominant BMLF1 peptide, derived from the ubiquitous Epstein-Barr virus, bound to HLA-A*0201. The AS01 TCR was observed to dock in a diagonal fashion, grasping the solvent exposed peptide crest with two sets of complementarity-determining region (CDR) loops, and was fastened to the peptide and HLA-A*0201 platform with residue sets found only within TCR genes biased in the public response. Computer simulations of a random V(D)J recombination process demonstrated that both TCRα and TCRβ amino acid sequences could be manufactured easily, thereby explaining the prevalence of this receptor across different individuals. Interestingly, the AS01 TCR was encoded largely by germline DNA, indicating that the TCR loci already comprise gene segments that specifically recognize this ancient pathogen. Such pattern recognition receptor-like traits within the αβ TCR system further blur the boundaries between the adaptive and innate immune systems.

Structure deposition and release

Deposited: 2010-07-27
Released: 2011-01-12
Revised: 2021-10-06

Data provenance

Publication data retrieved from PDBe REST API8 and PMCe REST API9

Other structures from this publication


Peptide details

Length: Nonamer (9 amino acids)

Sequence: GLCTLVAML

Interactive view
Cutaway side view (static)
Surface top view (static - coloured by atom property)
Cutaway top view (static)

Data provenance

MHC:peptide complexes are visualised using PyMol. The peptide is superimposed on a consistent cutaway slice of the MHC binding cleft (displayed as a grey mesh) which best indicates the binding pockets for the P1/P5/PC positions (side view - pockets A, E, F) and for the P2/P3/PC-2 positions (top view - pockets B, C, D). In some cases peptides will use a different pocket for a specific peptide position (atypical anchoring). On some structures the peptide may appear to sterically clash with a pocket. This is an artefact of picking a standardised slice of the cleft and overlaying the peptide.


Peptide neighbours

P1 GLY

TYR59
TYR7
GLU63
TYR171
TYR159
LYS66
TRP167
MET5
PHE33
P2 LEU

TYR159
HIS70
LYS66
MET45
VAL67
PHE9
TYR7
TYR99
GLU63
P3 CYS

TYR99
TYR159
HIS70
LYS66
P4 THR

LYS66
P5 LEU

VAL152
LEU156
GLN155
HIS70
ARG97
P6 VAL

THR73
ALA69
P7 ALA

THR73
TRP147
TYR116
VAL152
P8 MET

THR73
THR143
TRP147
LYS146
ASP77
VAL76
P9 LEU

TYR84
TYR123
VAL95
THR143
TRP147
TYR116
LEU81
THR142
LYS146
ASP77
THR80

Colour key

Aromatic Hydrophobic Acidic Basic Neutral/polar

Data provenance

Neighbours are calculated by finding residues with atoms within 5Å of each other using BioPython Neighboursearch module. The list of neighbours is then sorted and filtered to inlcude only neighbours where between the peptide and the MHC Class I alpha chain.

Colours selected to match the YRB scheme. [https://www.frontiersin.org/articles/10.3389/fmolb.2015.00056/full]


Binding cleft pockets


Peptide sidechain binding pockets (static)
Peptide terminii and backbone binding residues (static)
A Pocket

TYR159
THR163
TRP167
TYR171
MET5
TYR59
GLU63
LYS66
TYR7
B Pocket

ALA24
VAL34
MET45
GLU63
LYS66
VAL67
TYR7
HIS70
PHE9
TYR99
C Pocket

HIS70
THR73
HIS74
PHE9
ARG97
D Pocket

HIS114
GLN155
LEU156
TYR159
LEU160
TYR99
E Pocket

HIS114
TRP147
VAL152
LEU156
ARG97
F Pocket

TYR116
TYR123
THR143
LYS146
TRP147
ASP77
THR80
LEU81
TYR84
VAL95

Colour key

Binds N-terminus Binds P1 backbone Binds P2 backbone Binds PC-1 backbone Binds C-terminus

Data provenance

N-/C-terminus and peptide backbone binding residues are assigned according to previously published information and pockets are assigned according to an adaptation of a previously published set of residues. All numbering is currently that of the 'canonical' structures of human and mouse MHC Class I molecules.

Chain sequences

1. Beta 2 microglobulin
Beta 2 microglobulin
        10        20        30        40        50        60
MIQRTPKIQVYSRHPAENGKSNFLNCYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKD
        70        80        90
WSFYLLYYTEFTPTEKDEYACRVNHVTLSQPCIVKWDRDM

2. Class I alpha
HLA-A*02:01
IPD-IMGT/HLA
[ipd-imgt:HLA35266]
        10        20        30        40        50        60
GSHSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGPEYW
        70        80        90       100       110       120
DGETRKVKAHSQTHRVDLGTLRGYYNQSEAGSHTVQRMYGCDVGSDWRFLRGYHQYAYDG
       130       140       150       160       170       180
KDYIALKEDLRSWTAADMAAQTTKHKWEAAHVAEQLRAYLEGTCVEWLRRYLENGKETLQ
       190       200       210       220       230       240
RTDAPKTHMTHHAVSDHEATLRCWALSFYPAEITLTWQRDGEDQTQDTELVETRPAGDGT
       250       260       270
FQKWAAVVVPSGQEQRYTCHVQHEGLPKPLTLRWEP

3. Peptide
GLCTLVAML

4. T cell receptor alpha
T cell receptor alpha
TRAV5
        10        20        30        40        50        60
QSLFLSVREGDSSVINCTYTDSSSTYLYWYKQEPGAGLQLLTYIFSNMDMKQDQRKTVLL
        70        80        90       100       110       120
NKKDKHLSLRIADTQTGDSAIYFCAEDNNARLMFGDGTQLVVKPNIQNPDPAVYQLRDSK
       130       140       150       160       170       180
SSDKSVCLFTDFDSQTNVSQSKDSDVYITDKCVLDMRSMDFKSNSAVAWSNKSDFACANA
       190
FNNSIIPEDTFFPSP

5. T cell receptor beta
T cell receptor beta
TRBV20
        10        20        30        40        50        60
GAVVSQHPSWVICKSGTSVKIECRSLDFQATTMFWYRQFPKQSLMLMATSNEGSKATYEQ
        70        80        90       100       110       120
GVEKDKFLINHASLTLSTLTVTSAHPEDSSFYICSARDGTGNGYTFGSGTRLTVVEDLNK
       130       140       150       160       170       180
VFPPEVAVFEPSEAEISHTQKATLVCLATGFYPDHVELSWWVNGKEVHSGVCTDPQPLKE
       190       200       210       220       230       240
QPALNDSRYALSSRLRVSATFWQDPRNHFRCQVQFYGLSENDEWTQDRAKPVTQIVSAEA

WGRAD


Data provenance

Sequences are retrieved via the Uniprot method of the RSCB REST API. Sequences are then compared to those derived from the PDB file and matched against sequences retrieved from the IPD-IMGT/HLA database for human sequences, or the IPD-MHC database for other species. Mouse sequences are matched against FASTA files from Uniprot. Sequences for the mature extracellular protein (signal petide and cytoplasmic tail removed) are compared to identical length sequences from the datasources mentioned before using either exact matching or Levenshtein distance based matching.


Downloadable data

Data can be downloaded to your local machine from the links below.
Clicking on the clipboard icon will copy the url for the data to your clipboard.
This can then be used to load the structure/data directly from the url into an application like PyMol (for 3D structures) using the load command:
   e.g. load http://www.histo.fyi/structures/downloads/1hhk_1_peptide.cif
or in the case of JSON formatted files to retrieve it and use it as part of notebooks such as Jupyter or GoogleColab.
Please take note of the data license. Using data from this site assumes that you have read and will comply with the license.

Complete structures

Aligned structures [cif]
  1. 3O4L assembly 1  

Components

MHC Class I alpha chain [cif]
  1. 3O4L assembly 1  
MHC Class I antigen binding domain (alpha1/alpha2) [cif]
  1. 3O4L assembly 1  
Peptide only [cif]
  1. 3O4L assembly 1  

Derived data

Data for this page [json]
https://api.histo.fyi/v1/structures/3o4l

Data license

The data above is made available under a Creative Commons CC-BY 4.0 license. This means you can copy, remix, transform, build upon and redistribute the material, but you must give appropriate credit, provide a link to the license, and indicate if changes were made.
If you use any data downloaded from this site in a publication, please cite 'https://www.histo.fyi/'. A preprint is in preparation.

Footnotes