HLA-A*01:01 binding "AQDIYRASYY" at 2.95Å resolution
Data provenance
Information sections
- Publication
- Peptide details
- Peptide neighbours
- Binding cleft pockets
- Chain sequences
- Downloadable data
- Data license
- Footnotes
Complex type
HLA-A*01:01
AQDIYRASYY
Species
Locus / Allele group
A Recurrent Mutation in Anaplastic Lymphoma Kinase with Distinct Neoepitope Conformations.
The identification of recurrent human leukocyte antigen (HLA) neoepitopes driving T cell responses against tumors poses a significant bottleneck in the development of approaches for precision cancer therapeutics. Here, we employ a bioinformatics method, Prediction of T Cell Epitopes for Cancer Therapy, to analyze sequencing data from neuroblastoma patients and identify a recurrent anaplastic lymphoma kinase mutation (ALK R1275Q) that leads to two high affinity neoepitopes when expressed in complex with common HLA alleles. Analysis of the X-ray structures of the two peptides bound to HLA-B*15:01 reveals drastically different conformations with measurable changes in the stability of the protein complexes, while the self-epitope is excluded from binding due to steric hindrance in the MHC groove. To evaluate the range of HLA alleles that could display the ALK neoepitopes, we used structure-based Rosetta comparative modeling calculations, which accurately predict several additional high affinity interactions and compare our results with commonly used prediction tools. Subsequent determination of the X-ray structure of an HLA-A*01:01 bound neoepitope validates atomic features seen in our Rosetta models with respect to key residues relevant for MHC stability and T cell receptor recognition. Finally, MHC tetramer staining of peripheral blood mononuclear cells from HLA-matched donors shows that the two neoepitopes are recognized by CD8+ T cells. This work provides a rational approach toward high-throughput identification and further optimization of putative neoantigen/HLA targets with desired recognition features for cancer immunotherapy.
Structure deposition and release
Data provenance
Publication data retrieved from PDBe REST API8 and PMCe REST API9
Other structures from this publication
Data provenance
MHC:peptide complexes are visualised using PyMol. The peptide is superimposed on a consistent cutaway slice of the MHC binding cleft (displayed as a grey mesh) which best indicates the binding pockets for the P1/P5/PC positions (side view - pockets A, E, F) and for the P2/P3/PC-2 positions (top view - pockets B, C, D). In some cases peptides will use a different pocket for a specific peptide position (atypical anchoring). On some structures the peptide may appear to sterically clash with a pocket. This is an artefact of picking a standardised slice of the cleft and overlaying the peptide.
Peptide neighbours
P1
ALA
GLU63
GLY167
MET5
TYR171
TYR159
TYR7
ARG170
CYS164
TYR59
ARG163
|
P10
TYR
ASN77
ASP116
LEU81
ILE95
TYR123
ASP74
LYS146
ILE124
TRP147
ARG114
THR80
ILE142
TYR84
THR143
ILE97
|
P2
GLN
ARG163
MET67
TYR159
GLU63
PHE9
ASN66
MET45
TYR7
TYR99
|
P3
ASP
ASN66
GLN155
TYR99
ARG156
TYR159
|
P4
ILE
GLN155
ARG163
ARG156
ASN66
|
P5
TYR
ARG114
HIS70
THR73
TYR99
ASP74
ILE97
ASN77
ASP116
ARG156
ASN66
|
P6
ARG
GLN155
ARG156
|
P7
ALA
ARG156
VAL150
TRP147
ALA152
GLN155
|
P8
SER
TRP147
THR73
ARG114
ASN77
ARG156
|
P9
TYR
ASN77
GLN72
THR143
ALA76
THR73
LYS146
TRP147
|
Colour key
Data provenance
Neighbours are calculated by finding residues with atoms within 5Å of each other using BioPython Neighboursearch module. The list of neighbours is then sorted and filtered to inlcude only neighbours where between the peptide and the MHC Class I alpha chain.
Colours selected to match the YRB scheme. [https://www.frontiersin.org/articles/10.3389/fmolb.2015.00056/full]
A Pocket
ARG159
LEU163
CYS167
LEU171
SER5
GLY59
TRP63
GLU66
SER7
|
B Pocket
ARG24
THR34
SER45
TRP63
GLU66
THR67
SER7
MET70
ARG9
GLN99
|
C Pocket
MET70
HIS73
SER74
ARG9
THR97
|
D Pocket
ARG114
ALA155
ALA156
ARG159
ARG160
GLN99
|
E Pocket
ARG114
LYS147
ALA152
ALA156
THR97
|
F Pocket
TYR116
GLY123
ALA143
THR146
LYS147
ASP77
ASN80
LEU81
LEU84
SER95
|
Colour key
Data provenance
1. Beta 2 microglobulin
Beta 2 microglobulin
|
10 20 30 40 50 60
MIQRTPKIQVYSRHPAENGKSNFLNCYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKD 70 80 90 WSFYLLYYTEFTPTEKDEYACRVNHVTLSQPKIVKWDRDM |
2. Class I alpha
HLA-A*01:01
IPD-IMGT/HLA
[ipd-imgt:HLA34767] |
10 20 30 40 50 60
MASGSHSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQKMEPRAPWIEQEGP 70 80 90 100 110 120 EYWDQETRNMKAHSQTDRANLGTLRGYYNQSEDGSHTIQIMYGCDVGPDGRFLRGYRQDA 130 140 150 160 170 180 YDGKDYIALNEDLRSWTAADMAAQITKRKWEAVHAAEQRRVYLEGRCVDGLRRYLENGKE 190 200 210 220 230 240 TLQRTDPPKTHMTHHPISDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAG 250 260 270 280 DGTFQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRWELSSQP |
3. Peptide
|
AQDIYRASYY
|
Data provenance
Sequences are retrieved via the Uniprot method of the RSCB REST API. Sequences are then compared to those derived from the PDB file and matched against sequences retrieved from the IPD-IMGT/HLA database for human sequences, or the IPD-MHC database for other species. Mouse sequences are matched against FASTA files from Uniprot. Sequences for the mature extracellular protein (signal petide and cytoplasmic tail removed) are compared to identical length sequences from the datasources mentioned before using either exact matching or Levenshtein distance based matching.
Downloadable data
Components
Data license
Footnotes
- Protein Data Bank Europe - Coordinate Server
- 1HHK - HLA-A*02:01 binding LLFGYPVYV at 2.5Å resolution - PDB entry for 1HHK
- Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. - PyMol CEALIGN Method - Publication
- PyMol - PyMol.org/pymol
- Levenshtein distance - Wikipedia entry
- Protein Data Bank Europe REST API - Molecules endpoint
- 3Dmol.js: molecular visualization with WebGL - 3DMol.js - Publication
- Protein Data Bank Europe REST API - Publication endpoint
- PubMed Central Europe REST API - Articles endpoint
This work is licensed under a Creative Commons Attribution 4.0 International License.