Alpha This is a work in progress and may change. Your feedback is very welcome.
  


5WSH

HLA-A*02:01 binding "GVWIRTPTA" at 2.00Å resolution

Data provenance

Structure downloaded from PDB Europe using the Coordinate Server. Aligned to residues 1-180 of 1HHK2 using the CEALIGN3 function of PyMol4. Chain assigment using a Levenshtein distance5 method using data from the PDBe REST API6. Organism data from PDBe REST API. Data for both of these operations from the Molecules endpoint. Structure visualised with 3DMol7.

Information sections


Complex type

Class i with peptide

1. Beta 2 microglobulin
['B']
2. Class I alpha
HLA-A*02:01
['A']
3. Peptide
GVWIRTPTA
['C']

Species


Locus / Allele group


Publication

CD8+ T-Cell Response-Associated Evolution of Hepatitis B Virus Core Protein and Disease Progress.

Zhang Y, Wu Y, Deng M, Xu D, Li X, Xu Z, Hu J, Zhang H, Liu K, Zhao Y, Gao F, Bi S, Gao GF, Zhao J, Liu WJ, Meng S
J. Virol. (2018) 92, [doi:10.1128/jvi.02120-17]  [pubmed:29950410

Influenza A viruses (IAV) initiate infection by binding to glycans with terminal sialic acids on the cell surface. Hosts of IAV variably express two major forms of sialic acid, N-acetylneuraminic acid (NeuAc) and N-glycolylneuraminic acid (NeuGc). NeuGc is produced in most mammals, including horses and pigs, but is absent in humans, ferrets, and birds. The only known naturally occurring IAV that exclusively bind NeuGc are extinct highly pathogenic equine H7N7 viruses. We determined the crystal structure of a representative equine H7 hemagglutinin (HA) in complex with NeuGc and observed high similarity in the receptor-binding domain with an avian H7 HA. To determine the molecular basis for NeuAc and NeuGc specificity, we performed systematic mutational analyses, based on the structural insights, on two distant avian H7 HAs and an H15 HA. We found that the A135E mutation is key for binding α2,3-linked NeuGc but does not abolish NeuAc binding. The additional mutations S128T, I130V, T189A, and K193R converted the specificity from NeuAc to NeuGc. We investigated the residues at positions 128, 130, 135, 189, and 193 in a phylogenetic analysis of avian and equine H7 HAs. This analysis revealed a clear distinction between equine and avian residues. The highest variability was observed at key position 135, of which only the equine glutamic acid led to NeuGc binding. These results demonstrate that genetically distinct H7 and H15 HAs can be switched from NeuAc to NeuGc binding and vice versa after the introduction of several mutations, providing insights into the adaptation of H7 viruses to NeuGc receptors. IMPORTANCE Influenza A viruses cause millions of cases of severe illness and deaths annually. To initiate infection and replicate, the virus first needs to bind to a structure on the cell surface, like a key fitting in a lock. For influenza A viruses, these "keys" (receptors) on the cell surface are chains of sugar molecules (glycans). The terminal sugar on these glycans is often either N-acetylneuraminic acid (NeuAc) or N-glycolylneuraminic acid (NeuGc). Most influenza A viruses bind NeuAc, but a small minority bind NeuGc. NeuGc is present in species like horses, pigs, and mice but not in humans, ferrets, and birds. Here, we investigated the molecular determinants of NeuGc specificity and the origin of viruses that bind NeuGc.

Structure deposition and release

Deposited: 2016-12-07
Released: 2017-12-20
Revised: 2019-01-23

Data provenance

Publication data retrieved from PDBe REST API8 and PMCe REST API9

Other structures from this publication


Peptide details

Length: Nonamer (9 amino acids)

Sequence: GVWIRTPTA

Interactive view
Cutaway side view (static)
Surface top view (static - coloured by atom property)
Cutaway top view (static)

Data provenance

MHC:peptide complexes are visualised using PyMol. The peptide is superimposed on a consistent cutaway slice of the MHC binding cleft (displayed as a grey mesh) which best indicates the binding pockets for the P1/P5/PC positions (side view - pockets A, E, F) and for the P2/P3/PC-2 positions (top view - pockets B, C, D). In some cases peptides will use a different pocket for a specific peptide position (atypical anchoring). On some structures the peptide may appear to sterically clash with a pocket. This is an artefact of picking a standardised slice of the cleft and overlaying the peptide.


Peptide neighbours

P1 GLY

GLU63
TRP167
MET5
TYR7
TYR171
TYR59
LYS66
TYR159
PHE33
P2 VAL

TYR159
MET45
GLU63
PHE9
LYS66
TYR99
TYR7
VAL67
HIS70
P3 TRP

LEU156
LYS66
TYR99
HIS114
HIS70
GLN155
TYR159
VAL152
P4 ILE

LYS66
ARG65
P5 ARG

HIS70
GLN155
P6 THR

ALA69
LYS66
ARG97
HIS70
THR73
P7 PRO

VAL152
ASP77
THR73
ARG97
TRP147
P8 THR

ASP77
VAL76
LYS146
THR73
TRP147
P9 ALA

THR80
TYR116
TRP147
ASP77
TYR84
THR143
TYR123
LYS146
LEU81

Colour key

Aromatic Hydrophobic Acidic Basic Neutral/polar

Data provenance

Neighbours are calculated by finding residues with atoms within 5Å of each other using BioPython Neighboursearch module. The list of neighbours is then sorted and filtered to inlcude only neighbours where between the peptide and the MHC Class I alpha chain.

Colours selected to match the YRB scheme. [https://www.frontiersin.org/articles/10.3389/fmolb.2015.00056/full]


Binding cleft pockets


Peptide sidechain binding pockets (static)
Peptide terminii and backbone binding residues (static)
A Pocket

TYR159
THR163
TRP167
TYR171
MET5
TYR59
GLU63
LYS66
TYR7
B Pocket

ALA24
VAL34
MET45
GLU63
LYS66
VAL67
TYR7
HIS70
PHE9
TYR99
C Pocket

HIS70
THR73
HIS74
PHE9
ARG97
D Pocket

HIS114
GLN155
LEU156
TYR159
LEU160
TYR99
E Pocket

HIS114
TRP147
VAL152
LEU156
ARG97
F Pocket

TYR116
TYR123
THR143
LYS146
TRP147
ASP77
THR80
LEU81
TYR84
VAL95

Colour key

Binds N-terminus Binds P1 backbone Binds P2 backbone Binds PC-1 backbone Binds C-terminus

Data provenance

N-/C-terminus and peptide backbone binding residues are assigned according to previously published information and pockets are assigned according to an adaptation of a previously published set of residues. All numbering is currently that of the 'canonical' structures of human and mouse MHC Class I molecules.

Chain sequences

1. Beta 2 microglobulin
Beta 2 microglobulin
        10        20        30        40        50        60
MIQRTPKIQVYSRHPAENGKSNFLNCYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKD
        70        80        90
WSFYLLYYTEFTPTEKDEYACRVNHVTLSQPKIVKWDRDM

2. Class I alpha
HLA-A*02:01
IPD-IMGT/HLA
[ipd-imgt:HLA35266]
        10        20        30        40        50        60
GSHSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGPEYW
        70        80        90       100       110       120
DGETRKVKAHSQTHRVDLGTLRGYYNQSEAGSHTVQRMYGCDVGSDWRFLRGYHQYAYDG
       130       140       150       160       170       180
KDYIALKEDLRSWTAADMAAQTTKHKWEAAHVAEQLRAYLEGTCVEWLRRYLENGKETLQ
       190       200       210       220       230       240
RTDAPKTHMTHHAVSDHEATLRCWALSFYPAEITLTWQRDGEDQTQDTELVETRPAGDGT
       250       260       270
FQKWVAVVVPSGQEQRYTCHVQHEGLPKPLTLRWE

3. Peptide
GVWIRTPTA


Data provenance

Sequences are retrieved via the Uniprot method of the RSCB REST API. Sequences are then compared to those derived from the PDB file and matched against sequences retrieved from the IPD-IMGT/HLA database for human sequences, or the IPD-MHC database for other species. Mouse sequences are matched against FASTA files from Uniprot. Sequences for the mature extracellular protein (signal petide and cytoplasmic tail removed) are compared to identical length sequences from the datasources mentioned before using either exact matching or Levenshtein distance based matching.


Downloadable data

Data can be downloaded to your local machine from the links below.
Clicking on the clipboard icon will copy the url for the data to your clipboard.
This can then be used to load the structure/data directly from the url into an application like PyMol (for 3D structures) using the load command:
   e.g. load http://www.histo.fyi/structures/downloads/1hhk_1_peptide.cif
or in the case of JSON formatted files to retrieve it and use it as part of notebooks such as Jupyter or GoogleColab.
Please take note of the data license. Using data from this site assumes that you have read and will comply with the license.

Complete structures

Aligned structures [cif]
  1. 5WSH assembly 1  

Components

MHC Class I alpha chain [cif]
  1. 5WSH assembly 1  
MHC Class I antigen binding domain (alpha1/alpha2) [cif]
  1. 5WSH assembly 1  
Peptide only [cif]
  1. 5WSH assembly 1  

Derived data

Data for this page [json]
https://api.histo.fyi/v1/structures/5wsh

Data license

The data above is made available under a Creative Commons CC-BY 4.0 license. This means you can copy, remix, transform, build upon and redistribute the material, but you must give appropriate credit, provide a link to the license, and indicate if changes were made.
If you use any data downloaded from this site in a publication, please cite 'https://www.histo.fyi/'. A preprint is in preparation.

Footnotes