HLA-C*05:01 binding "GADGVGKSAL" at 1.70Å resolution
Data provenance
Information sections
- Publication
- Peptide details
- Peptide neighbours
- Binding cleft pockets
- Chain sequences
- Downloadable data
- Data license
- Footnotes
Complex type
HLA-C*05:01
GADGVGKSAL
Species
Locus / Allele group
Rational discovery of a cancer neoepitope harboring the KRAS G12D driver mutation.
Cytotoxic T cells targeting cancer neoantigens harboring driver mutations can lead to durable tumor regression in an HLAI-dependent manner. However, it is difficult to extend the population of patients who are eligible for neoantigen-based immunotherapy, as immunogenic neoantigen-HLA pairs are rarely shared across different patients. Thus, a way to find other human leukocyte antigen (HLA) alleles that can also present a clinically effective neoantigen is needed. Recently, neoantigen-based immunotherapy targeting the KRAS G12D mutation in patients with HLA-C*08:02 has shown effectiveness. In a proof-of-concept study, we proposed a combinatorial strategy (the combination of phylogenetic and structural analyses) to find potential HLA alleles that could also present KRAS G12D neoantigen. Compared to in silico binding prediction, this strategy avoids the uneven accuracy across different HLA alleles. Our findings extend the population of patients who are potentially eligible for immunotherapy targeting the KRAS G12D mutation. Additionally, we provide an alternative way to predict neoantigen-HLA pairs, which maximizes the clinical usage of shared neoantigens.
Structure deposition and release
Data provenance
Publication data retrieved from PDBe REST API8 and PMCe REST API9
Other structures from this publication
Data provenance
MHC:peptide complexes are visualised using PyMol. The peptide is superimposed on a consistent cutaway slice of the MHC binding cleft (displayed as a grey mesh) which best indicates the binding pockets for the P1/P5/PC positions (side view - pockets A, E, F) and for the P2/P3/PC-2 positions (top view - pockets B, C, D). In some cases peptides will use a different pocket for a specific peptide position (atypical anchoring). On some structures the peptide may appear to sterically clash with a pocket. This is an artefact of picking a standardised slice of the cleft and overlaying the peptide.
Peptide neighbours
P1
GLY
GLU63
TRP167
MET5
PHE33
TYR171
TYR159
TYR59
TYR7
LYS66
|
P10
LEU
ASN77
TYR84
TRP147
THR143
TYR123
LEU95
LYS146
ILE124
LEU81
LYS80
PHE116
|
P2
ALA
TYR67
TYR9
TYR159
TYR7
LYS66
TYR99
GLU63
TRP167
|
P3
ASP
TYR9
TYR159
TYR99
LYS66
ARG156
ARG97
|
P4
GLY
ARG69
LYS66
|
P5
VAL
ARG69
ARG156
GLN155
|
P6
GLY
GLN155
ARG69
|
P7
LYS
GLU152
GLN155
THR73
GLN72
ARG69
|
P8
SER
PHE116
ASN77
ARG156
TRP147
GLU152
THR73
ARG97
|
P9
ALA
TRP147
ASN77
THR143
LYS146
THR73
|
Colour key
Data provenance
Neighbours are calculated by finding residues with atoms within 5Å of each other using BioPython Neighboursearch module. The list of neighbours is then sorted and filtered to inlcude only neighbours where between the peptide and the MHC Class I alpha chain.
Colours selected to match the YRB scheme. [https://www.frontiersin.org/articles/10.3389/fmolb.2015.00056/full]
A Pocket
LEU159
CYS163
LEU167
LEU171
ARG5
TRP59
THR63
TYR66
PHE7
|
B Pocket
VAL24
GLN34
GLU45
THR63
TYR66
LYS67
PHE7
ALA70
THR9
GLY99
|
C Pocket
ALA70
ASP73
ARG74
THR9
MET97
|
D Pocket
GLN114
ARG155
ARG156
LEU159
GLU160
GLY99
|
E Pocket
GLN114
GLU147
ALA152
ARG156
MET97
|
F Pocket
ALA116
ILE123
GLN143
TRP146
GLU147
LEU77
LEU80
ARG81
TYR84
GLN95
|
Colour key
Data provenance
1. Beta 2 microglobulin
Beta 2 microglobulin
|
10 20 30 40 50 60
IQRTPKIQVYSRHPAENGKSNFLNCYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKDW 70 80 90 SFYLLYYTEFTPTEKDEYACRVNHVTLSQPKIVKWDRD |
2. Class I alpha
HLA-C*05:01
IPD-IMGT/HLA
[ipd-imgt:HLA35265] |
10 20 30 40 50 60
SHSMRYFYTAVSRPGRGEPRFIAVGYVDDTQFVQFDSDAASPRGEPRAPWVEQEGPEYWD 70 80 90 100 110 120 RETQKYKRQAQTDRVNLRKLRGYYNQSEAGSHTLQRMYGCDLGPDGRLLRGYNQFAYDGK 130 140 150 160 170 180 DYIALNEDLRSWTAADKAAQITQRKWEAAREAEQRRAYLEGTCVEWLRRYLENGKKTLQR 190 200 210 220 230 240 AEHPKTHVTHHPVSDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDGTF 250 260 270 QKWAAVVVPSGEEQRYTCHVQHEGLPEPLTLRW |
3. Peptide
|
GADGVGKSAL
|
Data provenance
Sequences are retrieved via the Uniprot method of the RSCB REST API. Sequences are then compared to those derived from the PDB file and matched against sequences retrieved from the IPD-IMGT/HLA database for human sequences, or the IPD-MHC database for other species. Mouse sequences are matched against FASTA files from Uniprot. Sequences for the mature extracellular protein (signal petide and cytoplasmic tail removed) are compared to identical length sequences from the datasources mentioned before using either exact matching or Levenshtein distance based matching.
Downloadable data
Components
Data license
Footnotes
- Protein Data Bank Europe - Coordinate Server
- 1HHK - HLA-A*02:01 binding LLFGYPVYV at 2.5Å resolution - PDB entry for 1HHK
- Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. - PyMol CEALIGN Method - Publication
- PyMol - PyMol.org/pymol
- Levenshtein distance - Wikipedia entry
- Protein Data Bank Europe REST API - Molecules endpoint
- 3Dmol.js: molecular visualization with WebGL - 3DMol.js - Publication
- Protein Data Bank Europe REST API - Publication endpoint
- PubMed Central Europe REST API - Articles endpoint
This work is licensed under a Creative Commons Attribution 4.0 International License.