HLA-B*57:01 binding "TSMSFVPRPW" at 1.88Å resolution
Data provenance
Information sections
- Publication
- Peptide details
- Peptide neighbours
- Binding cleft pockets
- Chain sequences
- Downloadable data
- Data license
- Footnotes
Complex type
HLA-B*57:01
TSMSFVPRPW
Species
Locus / Allele group
A subset of HLA-I peptides are not genomically templated: Evidence for cis- and trans-spliced peptide ligands.
The diversity of peptides displayed by class I human leukocyte antigen (HLA) plays an essential role in T cell immunity. The peptide repertoire is extended by various posttranslational modifications, including proteasomal splicing of peptide fragments from distinct regions of an antigen to form nongenomically templated cis-spliced sequences. Previously, it has been suggested that a fraction of the immunopeptidome constitutes such cis-spliced peptides; however, because of computational limitations, it has not been possible to assess whether trans-spliced peptides (i.e., the fusion of peptide segments from distinct antigens) are also bound and presented by HLA molecules, and if so, in what proportion. Here, we have developed and applied a bioinformatic workflow and demonstrated that trans-spliced peptides are presented by HLA-I, and their abundance challenges current models of proteasomal splicing that predict cis-splicing as the most probable outcome. These trans-spliced peptides display canonical HLA-binding sequence features and are as frequently identified as cis-spliced peptides found bound to a number of different HLA-A and HLA-B allotypes. Structural analysis reveals that the junction between spliced peptides is highly solvent exposed and likely to participate in T cell receptor interactions. These results highlight the unanticipated diversity of the immunopeptidome and have important implications for autoimmunity, vaccine design, and immunotherapy.
Structure deposition and release
Data provenance
Publication data retrieved from PDBe REST API8 and PMCe REST API9
Other structures from this publication
Data provenance
MHC:peptide complexes are visualised using PyMol. The peptide is superimposed on a consistent cutaway slice of the MHC binding cleft (displayed as a grey mesh) which best indicates the binding pockets for the P1/P5/PC positions (side view - pockets A, E, F) and for the P2/P3/PC-2 positions (top view - pockets B, C, D). In some cases peptides will use a different pocket for a specific peptide position (atypical anchoring). On some structures the peptide may appear to sterically clash with a pocket. This is an artefact of picking a standardised slice of the cleft and overlaying the peptide.
Peptide neighbours
P1
THR
TYR171
TYR159
LEU163
TRP167
TYR59
MET5
GLU63
TYR7
PHE33
|
P10
TRP
TYR123
LYS146
ILE95
ASN77
TYR84
TYR74
ILE80
TRP147
THR143
SER116
ILE142
ALA81
ALA117
TYR118
|
P2
SER
TYR7
TYR9
ASN66
MET45
TYR99
GLU63
MET67
TYR159
|
P3
MET
ASP114
TYR9
ASN66
TYR99
LEU156
TYR159
|
P4
SER
ASN66
|
P5
PHE
ASN66
GLN155
|
P6
VAL
GLN155
VAL152
LEU156
|
P7
PRO
GLN155
VAL152
|
P8
ARG
TYR99
VAL97
TYR74
TRP133
ASP114
THR73
ASN77
VAL152
LEU156
TRP147
SER116
|
P9
PRO
LYS146
THR73
ASN77
ILE80
TRP147
THR143
|
Colour key
Data provenance
Neighbours are calculated by finding residues with atoms within 5Å of each other using BioPython Neighboursearch module. The list of neighbours is then sorted and filtered to inlcude only neighbours where between the peptide and the MHC Class I alpha chain.
Colours selected to match the YRB scheme. [https://www.frontiersin.org/articles/10.3389/fmolb.2015.00056/full]
A Pocket
TYR159
LEU163
TRP167
TYR171
MET5
TYR59
GLU63
ASN66
TYR7
|
B Pocket
ALA24
VAL34
MET45
GLU63
ASN66
MET67
TYR7
SER70
TYR9
TYR99
|
C Pocket
SER70
THR73
TYR74
TYR9
VAL97
|
D Pocket
ASP114
GLN155
LEU156
TYR159
LEU160
TYR99
|
E Pocket
ASP114
TRP147
VAL152
LEU156
VAL97
|
F Pocket
SER116
TYR123
THR143
LYS146
TRP147
ASN77
ILE80
ALA81
TYR84
ILE95
|
Colour key
Data provenance
1. Beta 2 microglobulin
Beta 2 microglobulin
|
10 20 30 40 50 60
MIQRTPKIQVYSRHPAENGKSNFLNCYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKD 70 80 90 WSFYLLYYTEFTPTEKDEYACRVNHVTLSQPKIVKWDRDM |
2. Class I alpha
HLA-B*57:01
IPD-IMGT/HLA
[ipd-imgt:HLA34051] |
10 20 30 40 50 60
GSHSMRYFYTAMSRPGRGEPRFIAVGYVDDTQFVRFDSDAASPRMAPRAPWIEQEGPEYW 70 80 90 100 110 120 DGETRNMKASAQTYRENLRIALRYYNQSEAGSHIIQVMYGCDVGPDGRLLRGHDQSAYDG 130 140 150 160 170 180 KDYIALNEDLSSWTAADTAAQITQRKWEAARVAEQLRAYLEGLCVEWLRRYLENGKETLQ 190 200 210 220 230 240 RADPPKTHVTHHPISDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDRT 250 260 270 FQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRWEP |
3. Peptide
|
TSMSFVPRPW
|
Data provenance
Sequences are retrieved via the Uniprot method of the RSCB REST API. Sequences are then compared to those derived from the PDB file and matched against sequences retrieved from the IPD-IMGT/HLA database for human sequences, or the IPD-MHC database for other species. Mouse sequences are matched against FASTA files from Uniprot. Sequences for the mature extracellular protein (signal petide and cytoplasmic tail removed) are compared to identical length sequences from the datasources mentioned before using either exact matching or Levenshtein distance based matching.
Downloadable data
Components
Data license
Footnotes
- Protein Data Bank Europe - Coordinate Server
- 1HHK - HLA-A*02:01 binding LLFGYPVYV at 2.5Å resolution - PDB entry for 1HHK
- Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. - PyMol CEALIGN Method - Publication
- PyMol - PyMol.org/pymol
- Levenshtein distance - Wikipedia entry
- Protein Data Bank Europe REST API - Molecules endpoint
- 3Dmol.js: molecular visualization with WebGL - 3DMol.js - Publication
- Protein Data Bank Europe REST API - Publication endpoint
- PubMed Central Europe REST API - Articles endpoint
This work is licensed under a Creative Commons Attribution 4.0 International License.