HLA-A*24:02 binding "TYQWIIRNWET" at 2.95Å resolution
Data provenance
Information sections
- Publication
- Peptide details
- Peptide neighbours
- Binding cleft pockets
- Chain sequences
- Downloadable data
- Data license
- Footnotes
Complex type
HLA-A*24:02
TYQWIIRNWET
Species
Locus / Allele group
CD8+ T cell landscape in Indigenous and non-Indigenous people restricted by influenza mortality-associated HLA-A*24:02 allomorph.
Background
The majority of clinical genetic testing focuses almost exclusively on regions of the genome that directly encode proteins. The important role of variants in non-coding regions in penetrant disease is, however, increasingly being demonstrated, and the use of whole genome sequencing in clinical diagnostic settings is rising across a large range of genetic disorders. Despite this, there is no existing guidance on how current guidelines designed primarily for variants in protein-coding regions should be adapted for variants identified in other genomic contexts.Methods
We convened a panel of nine clinical and research scientists with wide-ranging expertise in clinical variant interpretation, with specific experience in variants within non-coding regions. This panel discussed and refined an initial draft of the guidelines which were then extensively tested and reviewed by external groups.Results
We discuss considerations specifically for variants in non-coding regions of the genome. We outline how to define candidate regulatory elements, highlight examples of mechanisms through which non-coding region variants can lead to penetrant monogenic disease, and outline how existing guidelines can be adapted for the interpretation of these variants.Conclusions
These recommendations aim to increase the number and range of non-coding region variants that can be clinically interpreted, which, together with a compatible phenotype, can lead to new diagnoses and catalyse the discovery of novel disease mechanisms.Structure deposition and release
Data provenance
Publication data retrieved from PDBe REST API8 and PMCe REST API9
Other structures from this publication
Data provenance
MHC:peptide complexes are visualised using PyMol. The peptide is superimposed on a consistent cutaway slice of the MHC binding cleft (displayed as a grey mesh) which best indicates the binding pockets for the P1/P5/PC positions (side view - pockets A, E, F) and for the P2/P3/PC-2 positions (top view - pockets B, C, D). In some cases peptides will use a different pocket for a specific peptide position (atypical anchoring). On some structures the peptide may appear to sterically clash with a pocket. This is an artefact of picking a standardised slice of the cleft and overlaying the peptide.
Peptide neighbours
P1
THR
GLU63
TYR7
PHE99
MET5
GLY167
PHE33
TYR171
THR163
CYS164
TYR159
TYR59
LYS66
|
P10
GLU
THR143
ARG83
LYS146
TYR84
ILE80
|
P11
THR
TRP147
LYS146
|
P2
TYR
TYR159
LYS66
TYR7
HIS70
GLU63
MET45
ALA24
PHE22
VAL67
SER9
PHE99
|
P3
GLN
GLN155
PHE99
GLN156
LEU160
TYR159
LYS66
HIS114
|
P4
TRP
LYS66
GLY65
GLN156
GLN155
ALA69
|
P5
ILE
PHE99
ASP74
ALA69
THR73
TYR116
HIS70
HIS114
MET97
|
P6
ILE
GLN155
ALA69
THR73
|
P7
ARG
ALA150
VAL152
ASN77
LYS146
GLN155
THR73
TRP147
|
P8
ASN
ASN77
GLU76
THR73
TRP147
ILE80
|
P9
TRP
TRP147
ILE124
ILE80
TYR116
ASN77
THR143
TYR118
ILE142
TYR123
LYS146
TYR84
LEU95
ALA81
|
Colour key
Data provenance
Neighbours are calculated by finding residues with atoms within 5Å of each other using BioPython Neighboursearch module. The list of neighbours is then sorted and filtered to inlcude only neighbours where between the peptide and the MHC Class I alpha chain.
Colours selected to match the YRB scheme. [https://www.frontiersin.org/articles/10.3389/fmolb.2015.00056/full]
A Pocket
TYR159
THR163
GLY167
TYR171
MET5
TYR59
GLU63
LYS66
TYR7
|
B Pocket
ALA24
VAL34
MET45
GLU63
LYS66
VAL67
TYR7
HIS70
SER9
PHE99
|
C Pocket
HIS70
THR73
ASP74
SER9
MET97
|
D Pocket
HIS114
GLN155
GLN156
TYR159
LEU160
PHE99
|
E Pocket
HIS114
TRP147
VAL152
GLN156
MET97
|
F Pocket
TYR116
TYR123
THR143
LYS146
TRP147
ASN77
ILE80
ALA81
TYR84
LEU95
|
Colour key
Data provenance
1. Beta 2 microglobulin
Beta 2 microglobulin
|
10 20 30 40 50 60
MIQRTPKIQVYSRHPAENGKSNFLNCYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKD 70 80 90 WSFYLLYYTEFTPTEKDEYACRVNHVTLSQPKIVKWDRDM |
2. Class I alpha
HLA-A*24:02
IPD-IMGT/HLA
[ipd-imgt:HLA34790] |
10 20 30 40 50 60
GSHSMRYFSTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGPEYW 70 80 90 100 110 120 DEETGKVKAHSQTDRENLRIALRYYNQSEAGSHTLQMMFGCDVGSDGRFLRGYHQYAYDG 130 140 150 160 170 180 KDYIALKEDLRSWTAADMAAQITKRKWEAAHVAEQQRAYLEGTCVDGLRRYLENGKETLQ 190 200 210 220 230 240 RTDPPKTHMTHHPISDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDGT 250 260 270 FQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRWEPSS |
3. Peptide
|
TYQWIIRNWET
|
Data provenance
Sequences are retrieved via the Uniprot method of the RSCB REST API. Sequences are then compared to those derived from the PDB file and matched against sequences retrieved from the IPD-IMGT/HLA database for human sequences, or the IPD-MHC database for other species. Mouse sequences are matched against FASTA files from Uniprot. Sequences for the mature extracellular protein (signal petide and cytoplasmic tail removed) are compared to identical length sequences from the datasources mentioned before using either exact matching or Levenshtein distance based matching.
Downloadable data
Components
Data license
Footnotes
- Protein Data Bank Europe - Coordinate Server
- 1HHK - HLA-A*02:01 binding LLFGYPVYV at 2.5Å resolution - PDB entry for 1HHK
- Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. - PyMol CEALIGN Method - Publication
- PyMol - PyMol.org/pymol
- Levenshtein distance - Wikipedia entry
- Protein Data Bank Europe REST API - Molecules endpoint
- 3Dmol.js: molecular visualization with WebGL - 3DMol.js - Publication
- Protein Data Bank Europe REST API - Publication endpoint
- PubMed Central Europe REST API - Articles endpoint
This work is licensed under a Creative Commons Attribution 4.0 International License.