5WSH

HLA-A*02:01 binding "GVWIRTPTA" at 2.00Å resolution

Data provenance

Structure downloaded from PDB Europe using the Coordinate Server. Aligned to residues 1-180 of 1HHK2 using the CEALIGN3 function of PyMol4. Chain assigment using a Levenshtein distance5 method using data from the PDBe REST API6. Organism data from PDBe REST API. Data for both of these operations from the Molecules endpoint. Structure visualised with 3DMol7.

Complex type

Class i with peptide

1. Beta 2 microglobulin

['B']

2. Class I alpha
HLA-A*02:01

['A']

3. Peptide
GVWIRTPTA

['C']

Species

Homo sapiens (Human)

Locus / Allele group

HLA-A / HLA-A*02

Publication

CD8₊ T-Cell Response-Associated Evolution of Hepatitis B Virus Core Protein and Disease Progress.

Zhang Y, Wu Y, Deng M, Xu D, Li X, Xu Z, Hu J, Zhang H, Liu K, Zhao Y, Gao F, Bi S, Gao GF, Zhao J, Liu WJ, Meng S

J. Virol. (2018) 92, [doi:10.1128/jvi.02120-17] [pubmed:29950410]

Influenza A viruses (IAV) initiate infection by binding to glycans with terminal sialic acids on the cell surface. Hosts of IAV variably express two major forms of sialic acid, N-acetylneuraminic acid (NeuAc) and N-glycolylneuraminic acid (NeuGc). NeuGc is produced in most mammals, including horses and pigs, but is absent in humans, ferrets, and birds. The only known naturally occurring IAV that exclusively bind NeuGc are extinct highly pathogenic equine H7N7 viruses. We determined the crystal structure of a representative equine H7 hemagglutinin (HA) in complex with NeuGc and observed high similarity in the receptor-binding domain with an avian H7 HA. To determine the molecular basis for NeuAc and NeuGc specificity, we performed systematic mutational analyses, based on the structural insights, on two distant avian H7 HAs and an H15 HA. We found that the A135E mutation is key for binding α2,3-linked NeuGc but does not abolish NeuAc binding. The additional mutations S128T, I130V, T189A, and K193R converted the specificity from NeuAc to NeuGc. We investigated the residues at positions 128, 130, 135, 189, and 193 in a phylogenetic analysis of avian and equine H7 HAs. This analysis revealed a clear distinction between equine and avian residues. The highest variability was observed at key position 135, of which only the equine glutamic acid led to NeuGc binding. These results demonstrate that genetically distinct H7 and H15 HAs can be switched from NeuAc to NeuGc binding and vice versa after the introduction of several mutations, providing insights into the adaptation of H7 viruses to NeuGc receptors. IMPORTANCE Influenza A viruses cause millions of cases of severe illness and deaths annually. To initiate infection and replicate, the virus first needs to bind to a structure on the cell surface, like a key fitting in a lock. For influenza A viruses, these "keys" (receptors) on the cell surface are chains of sugar molecules (glycans). The terminal sugar on these glycans is often either N-acetylneuraminic acid (NeuAc) or N-glycolylneuraminic acid (NeuGc). Most influenza A viruses bind NeuAc, but a small minority bind NeuGc. NeuGc is present in species like horses, pigs, and mice but not in humans, ferrets, and birds. Here, we investigated the molecular determinants of NeuGc specificity and the origin of viruses that bind NeuGc.

Structure deposition and release

Deposited: 2016-12-07

Released: 2017-12-20

Revised: 2019-01-23

Data provenance

Publication data retrieved from PDBe REST API8 and PMCe REST API9

Other structures from this publication

Peptide details

Length: Nonamer (9 amino acids)

Sequence: GVWIRTPTA

Interactive view

Cutaway side view (static)

Surface top view (static - coloured by atom property)

Cutaway top view (static)

Data provenance

MHC:peptide complexes are visualised using PyMol. The peptide is superimposed on a consistent cutaway slice of the MHC binding cleft (displayed as a grey mesh) which best indicates the binding pockets for the P1/P5/PC positions (side view - pockets A, E, F) and for the P2/P3/PC-2 positions (top view - pockets B, C, D). In some cases peptides will use a different pocket for a specific peptide position (atypical anchoring). On some structures the peptide may appear to sterically clash with a pocket. This is an artefact of picking a standardised slice of the cleft and overlaying the peptide.

Peptide neighbours

P1 GLY

GLU63

TRP167

MET5

TYR7

TYR171

TYR59

LYS66

TYR159

PHE33

P2 VAL

TYR159

MET45

GLU63

PHE9

LYS66

TYR99

TYR7

VAL67

HIS70

P3 TRP

LEU156

LYS66

TYR99

HIS114

HIS70

GLN155

TYR159

VAL152

P4 ILE

LYS66

ARG65

P5 ARG

HIS70

GLN155

P6 THR

ALA69

LYS66

ARG97

HIS70

THR73

P7 PRO

VAL152

ASP77

THR73

ARG97

TRP147

P8 THR

ASP77

VAL76

LYS146

THR73

TRP147

P9 ALA

THR80

TYR116

TRP147

ASP77

TYR84

THR143

TYR123

LYS146

LEU81

Colour key

Aromatic Hydrophobic Acidic Basic Neutral/polar

Data provenance

Neighbours are calculated by finding residues with atoms within 5Å of each other using BioPython Neighboursearch module. The list of neighbours is then sorted and filtered to inlcude only neighbours where between the peptide and the MHC Class I alpha chain.

Colours selected to match the YRB scheme. [https://www.frontiersin.org/articles/10.3389/fmolb.2015.00056/full]

Binding cleft pockets

Peptide sidechain binding pockets (static)

Peptide terminii and backbone binding residues (static)

A Pocket

TYR159

THR163

TRP167

TYR171

MET5

TYR59

GLU63

LYS66

TYR7

B Pocket

ALA24

VAL34

MET45

GLU63

LYS66

VAL67

TYR7

HIS70

PHE9

TYR99

C Pocket

HIS70

THR73

HIS74

PHE9

ARG97

D Pocket

HIS114

GLN155

LEU156

TYR159

LEU160

TYR99

E Pocket

HIS114

TRP147

VAL152

LEU156

ARG97

F Pocket

TYR116

TYR123

THR143

LYS146

TRP147

ASP77

THR80

LEU81

TYR84

VAL95

Colour key

Binds N-terminus Binds P1 backbone Binds P2 backbone Binds PC-1 backbone Binds C-terminus

Data provenance

N-/C-terminus and peptide backbone binding residues are assigned according to previously published information and pockets are assigned according to an adaptation of a previously published set of residues. All numbering is currently that of the 'canonical' structures of human and mouse MHC Class I molecules.

Chain sequences

1. Beta 2 microglobulin Beta 2 microglobulin	10 20 30 40 50 60 MIQRTPKIQVYSRHPAENGKSNFLNCYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKD 70 80 90 WSFYLLYYTEFTPTEKDEYACRVNHVTLSQPKIVKWDRDM

2. Class I alpha HLA-A02:01 IPD-IMGT/HLA* [ipd-imgt:HLA35266]	10 20 30 40 50 60 GSHSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGPEYW 70 80 90 100 110 120 DGETRKVKAHSQTHRVDLGTLRGYYNQSEAGSHTVQRMYGCDVGSDWRFLRGYHQYAYDG 130 140 150 160 170 180 KDYIALKEDLRSWTAADMAAQTTKHKWEAAHVAEQLRAYLEGTCVEWLRRYLENGKETLQ 190 200 210 220 230 240 RTDAPKTHMTHHAVSDHEATLRCWALSFYPAEITLTWQRDGEDQTQDTELVETRPAGDGT 250 260 270 FQKWVAVVVPSGQEQRYTCHVQHEGLPKPLTLRWE

3. Peptide	GVWIRTPTA

Data provenance

Sequences are retrieved via the Uniprot method of the RSCB REST API. Sequences are then compared to those derived from the PDB file and matched against sequences retrieved from the IPD-IMGT/HLA database for human sequences, or the IPD-MHC database for other species. Mouse sequences are matched against FASTA files from Uniprot. Sequences for the mature extracellular protein (signal petide and cytoplasmic tail removed) are compared to identical length sequences from the datasources mentioned before using either exact matching or Levenshtein distance based matching.

Downloadable data

Data can be downloaded to your local machine from the links below.

Clicking on the clipboard icon will copy the url for the data to your clipboard.

This can then be used to load the structure/data directly from the url into an application like PyMol (for 3D structures) using the load command:

e.g. load http://www.histo.fyi/structures/downloads/1hhk_1_peptide.cif

or in the case of JSON formatted files to retrieve it and use it as part of notebooks such as Jupyter or GoogleColab.

Please take note of the data license. Using data from this site assumes that you have read and will comply with the license.

Complete structures

Aligned structures [cif]

5WSH assembly 1

Components

MHC Class I alpha chain [cif]

5WSH assembly 1

MHC Class I antigen binding domain (alpha1/alpha2) [cif]

5WSH assembly 1

Peptide only [cif]

5WSH assembly 1

Derived data

Data for this page [json]

https://api.histo.fyi/v1/structures/5wsh

Data license

The data above is made available under a Creative Commons CC-BY 4.0 license. This means you can copy, remix, transform, build upon and redistribute the material, but you must give appropriate credit, provide a link to the license, and indicate if changes were made.

If you use any data downloaded from this site in a publication, please cite 'https://www.histo.fyi/'. A preprint is in preparation.

Footnotes

Protein Data Bank Europe - Coordinate Server
1HHK - HLA-A*02:01 binding LLFGYPVYV at 2.5Å resolution - PDB entry for 1HHK
Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. - PyMol CEALIGN Method - Publication
PyMol - PyMol.org/pymol
Levenshtein distance - Wikipedia entry
Protein Data Bank Europe REST API - Molecules endpoint
3Dmol.js: molecular visualization with WebGL - 3DMol.js - Publication
Protein Data Bank Europe REST API - Publication endpoint
PubMed Central Europe REST API - Articles endpoint

This work is licensed under a Creative Commons Attribution 4.0 International License.

HLA-A*02:01 binding "GVWIRTPTA" at 2.00Å resolution

Data provenance

Information sections

Complex type

Species

Locus / Allele group

Publication

CD8+ T-Cell Response-Associated Evolution of Hepatitis B Virus Core Protein and Disease Progress.

Structure deposition and release

Data provenance

Peptide details

Data provenance

Peptide neighbours

Colour key

Data provenance

Binding cleft pockets

Colour key

Data provenance

Chain sequences

Data provenance

Downloadable data

Complete structures

Components

Derived data

Data license

Footnotes

CD8₊ T-Cell Response-Associated Evolution of Hepatitis B Virus Core Protein and Disease Progress.