Alpha This is a work in progress and may change. Your feedback is very welcome.
  


6XQA

HLA-A*24:02 binding "TYQWVLKNL" at 2.16Å resolution

Data provenance

Structure downloaded from PDB Europe using the Coordinate Server. Aligned to residues 1-180 of 1HHK2 using the CEALIGN3 function of PyMol4. Chain assigment using a Levenshtein distance5 method using data from the PDBe REST API6. Organism data from PDBe REST API. Data for both of these operations from the Molecules endpoint. Structure visualised with 3DMol7.

Information sections


Complex type

Class i with peptide

1. Beta 2 microglobulin
['B', 'E']
2. Class I alpha
HLA-A*24:02
['A', 'D']
3. Peptide
TYQWVLKNL
['C', 'F']

Species


Locus / Allele group


Publication

CD8+ T cell landscape in Indigenous and non-Indigenous people restricted by influenza mortality-associated HLA-A*24:02 allomorph.

Hensen L, Illing PT, Bridie Clemens E, Nguyen THO, Koutsakos M, van de Sandt CE, Mifsud NA, Nguyen AT, Szeto C, Chua BY, Halim H, Rizzetto S, Luciani F, Loh L, Grant EJ, Saunders PM, Brooks AG, Rockman S, Kotsimbos TC, Cheng AC, Richards M, Westall GP, Wakim LM, Loudovaris T, Mannering SI, Elliott M, Tangye SG, Jackson DC, Flanagan KL, Rossjohn J, Gras S, Davies J, Miller A, Tong SYC, Purcell AW, Kedzierska K
Nat Commun (2021) 12, 2931 [doi:10.1038/s41467-021-23212-x]  [pubmed:34006841

Background

The majority of clinical genetic testing focuses almost exclusively on regions of the genome that directly encode proteins. The important role of variants in non-coding regions in penetrant disease is, however, increasingly being demonstrated, and the use of whole genome sequencing in clinical diagnostic settings is rising across a large range of genetic disorders. Despite this, there is no existing guidance on how current guidelines designed primarily for variants in protein-coding regions should be adapted for variants identified in other genomic contexts.

Methods

We convened a panel of nine clinical and research scientists with wide-ranging expertise in clinical variant interpretation, with specific experience in variants within non-coding regions. This panel discussed and refined an initial draft of the guidelines which were then extensively tested and reviewed by external groups.

Results

We discuss considerations specifically for variants in non-coding regions of the genome. We outline how to define candidate regulatory elements, highlight examples of mechanisms through which non-coding region variants can lead to penetrant monogenic disease, and outline how existing guidelines can be adapted for the interpretation of these variants.

Conclusions

These recommendations aim to increase the number and range of non-coding region variants that can be clinically interpreted, which, together with a compatible phenotype, can lead to new diagnoses and catalyse the discovery of novel disease mechanisms.

Structure deposition and release

Deposited: 2020-07-09
Released: 2021-04-14
Revised: 2021-06-23

Data provenance

Publication data retrieved from PDBe REST API8 and PMCe REST API9

Other structures from this publication


Peptide details

Length: Nonamer (9 amino acids)

Sequence: TYQWVLKNL

Interactive view
Cutaway side view (static)
Surface top view (static - coloured by atom property)
Cutaway top view (static)

Data provenance

MHC:peptide complexes are visualised using PyMol. The peptide is superimposed on a consistent cutaway slice of the MHC binding cleft (displayed as a grey mesh) which best indicates the binding pockets for the P1/P5/PC positions (side view - pockets A, E, F) and for the P2/P3/PC-2 positions (top view - pockets B, C, D). In some cases peptides will use a different pocket for a specific peptide position (atypical anchoring). On some structures the peptide may appear to sterically clash with a pocket. This is an artefact of picking a standardised slice of the cleft and overlaying the peptide.


Peptide neighbours

P1 THR

ARG170
THR163
GLY167
GLU63
MET5
TYR171
TYR159
TYR59
TYR7
LYS66
CYS164
PHE33
P2 TYR

SER9
ALA24
PHE22
MET45
PHE99
THR163
GLU63
VAL67
TYR159
TYR7
LYS66
HIS70
P3 GLN

TYR159
LYS66
GLN156
HIS114
GLN155
PHE99
P4 TRP

ALA69
GLY65
GLU62
LYS66
GLN156
P5 VAL

MET97
THR73
HIS114
HIS70
ASP74
TYR116
ALA69
PHE99
P6 LEU

ALA69
THR73
P7 LYS

VAL152
THR73
ASN77
TYR116
TRP147
P8 ASN

THR73
GLU76
THR143
ASN77
ILE80
LYS146
TRP147
P9 LEU

ASN77
ILE124
TYR116
TYR84
THR143
ALA81
LEU95
TYR123
ILE80
LYS146
TRP147
ILE142

Colour key

Aromatic Hydrophobic Acidic Basic Neutral/polar

Data provenance

Neighbours are calculated by finding residues with atoms within 5Å of each other using BioPython Neighboursearch module. The list of neighbours is then sorted and filtered to inlcude only neighbours where between the peptide and the MHC Class I alpha chain.

Colours selected to match the YRB scheme. [https://www.frontiersin.org/articles/10.3389/fmolb.2015.00056/full]


Binding cleft pockets


Peptide sidechain binding pockets (static)
Peptide terminii and backbone binding residues (static)
A Pocket

TYR159
THR163
GLY167
TYR171
MET5
TYR59
GLU63
LYS66
TYR7
B Pocket

ALA24
VAL34
MET45
GLU63
LYS66
VAL67
TYR7
HIS70
SER9
PHE99
C Pocket

HIS70
THR73
ASP74
SER9
MET97
D Pocket

HIS114
GLN155
GLN156
TYR159
LEU160
PHE99
E Pocket

HIS114
TRP147
VAL152
GLN156
MET97
F Pocket

TYR116
TYR123
THR143
LYS146
TRP147
ASN77
ILE80
ALA81
TYR84
LEU95

Colour key

Binds N-terminus Binds P1 backbone Binds P2 backbone Binds PC-1 backbone Binds C-terminus

Data provenance

N-/C-terminus and peptide backbone binding residues are assigned according to previously published information and pockets are assigned according to an adaptation of a previously published set of residues. All numbering is currently that of the 'canonical' structures of human and mouse MHC Class I molecules.

Chain sequences

1. Beta 2 microglobulin
Beta 2 microglobulin
        10        20        30        40        50        60
MIQRTPKIQVYSRHPAENGKSNFLNCYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKD
        70        80        90
WSFYLLYYTEFTPTEKDEYACRVNHVTLSQPKIVKWDRDM

2. Class I alpha
HLA-A*24:02
IPD-IMGT/HLA
[ipd-imgt:HLA34790]
        10        20        30        40        50        60
GSHSMRYFSTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGPEYW
        70        80        90       100       110       120
DEETGKVKAHSQTDRENLRIALRYYNQSEAGSHTLQMMFGCDVGSDGRFLRGYHQYAYDG
       130       140       150       160       170       180
KDYIALKEDLRSWTAADMAAQITKRKWEAAHVAEQQRAYLEGTCVDGLRRYLENGKETLQ
       190       200       210       220       230       240
RTDPPKTHMTHHPISDHEATLRCWALGFYPAEITLTWQRDGEDQTQDTELVETRPAGDGT
       250       260       270
FQKWAAVVVPSGEEQRYTCHVQHEGLPKPLTLRWEPSS

3. Peptide
TYQWVLKNL


Data provenance

Sequences are retrieved via the Uniprot method of the RSCB REST API. Sequences are then compared to those derived from the PDB file and matched against sequences retrieved from the IPD-IMGT/HLA database for human sequences, or the IPD-MHC database for other species. Mouse sequences are matched against FASTA files from Uniprot. Sequences for the mature extracellular protein (signal petide and cytoplasmic tail removed) are compared to identical length sequences from the datasources mentioned before using either exact matching or Levenshtein distance based matching.


Downloadable data

Data can be downloaded to your local machine from the links below.
Clicking on the clipboard icon will copy the url for the data to your clipboard.
This can then be used to load the structure/data directly from the url into an application like PyMol (for 3D structures) using the load command:
   e.g. load http://www.histo.fyi/structures/downloads/1hhk_1_peptide.cif
or in the case of JSON formatted files to retrieve it and use it as part of notebooks such as Jupyter or GoogleColab.
Please take note of the data license. Using data from this site assumes that you have read and will comply with the license.

Complete structures

Aligned structures [cif]
  1. 6XQA assembly 1  
  2. 6XQA assembly 2  

Components

MHC Class I alpha chain [cif]
  1. 6XQA assembly 1  
  2. 6XQA assembly 2  
MHC Class I antigen binding domain (alpha1/alpha2) [cif]
  1. 6XQA assembly 1  
  2. 6XQA assembly 2  
Peptide only [cif]
  1. 6XQA assembly 1  
  2. 6XQA assembly 2  

Derived data

Data for this page [json]
https://api.histo.fyi/v1/structures/6xqa

Data license

The data above is made available under a Creative Commons CC-BY 4.0 license. This means you can copy, remix, transform, build upon and redistribute the material, but you must give appropriate credit, provide a link to the license, and indicate if changes were made.
If you use any data downloaded from this site in a publication, please cite 'https://www.histo.fyi/'. A preprint is in preparation.

Footnotes