ExPASy logo ExPASy Home page Site Map Search ExPASy Contact us Swiss-Prot
Notice: This page will be replaced with www.uniprot.org. Please send us your feedback!
Search for

UniProtKB/Swiss-Prot entry P03315


[Entry info] [Name and origin] [References] [Comments] [Cross-references] [Keywords] [Features] [Sequence] [Tools]

Note: most headings are clickable, even if they don't appear as links. They link to the user manual or other documents.
Entry information
Entry name POLS_SFV
Primary accession number P03315
Secondary accession numbers None
Integrated into Swiss-Prot on July 21, 1986
Sequence was last modified on July 21, 1986 (Sequence version 1)
Annotations were last modified on    September 2, 2008 (Entry version 94)
Name and origin of the protein
Protein name Structural polyprotein
Synonym p130
Contains Capsid protein
     (EC 3.4.21.-)
     (Coat protein)
     (C)
p62
     (E3/E2)
E3 protein
     (Spike glycoprotein E3)
E2 envelope glycoprotein
     (Spike glycoprotein E2)
6K protein
E1 envelope glycoprotein
     (Spike glycoprotein E1)
Gene name None
From
Semliki forest virus (SFV) [TaxID: 11033] 
Taxonomy Viruses; ssRNA positive-strand viruses, no DNA stage; Togaviridae; Alphavirus; SFV complex.
Virus hosts Aedes [TaxID: 7158]
Atelerix albiventris (Middle-African hedgehog) [TaxID: 9368]
Culex tritaeniorhynchus [TaxID: 7178]
Halcyon [TaxID: 170865]
Homo sapiens (Human) [TaxID: 9606]
Quelea [TaxID: 158617]
Rhipicephalus [TaxID: 34630]
Protein existence 1: Evidence at protein level;
References
[1]
NUCLEOTIDE SEQUENCE [GENOMIC RNA] OF 1-305.
PubMed=6935652 [NCBI, ExPASy, EBI, Israel, Japan]
Garoff H., Frischauf A.-M., Simons K., Lehrach H., Delius H.;
"The capsid protein of Semliki Forest virus has clusters of basic amino acids and prolines in its amino-terminal region.";
Proc. Natl. Acad. Sci. U.S.A. 77:6376-6380(1980).
[2]
NUCLEOTIDE SEQUENCE [GENOMIC RNA] OF 266-1253.
DOI=10.1038/288236a0; PubMed=6985476 [NCBI, ExPASy, EBI, Israel, Japan]
Garoff H., Frischauf A.-M., Simons K., Lehrach H., Delius H.;
"Nucleotide sequence of cDNA coding for Semliki Forest virus membrane glycoproteins.";
Nature 288:236-241(1980).
[3]
PROTEIN SEQUENCE OF C-TERMINUS FOR CAPSID PROTEIN; E3 PROTEIN; E2 ENVELOPE GLYCOPROTEIN; 6K PROTEIN AND E1 ENVELOPE GLYCOPROTEIN.
DOI=10.1016/0014-5793(80)81158-6; PubMed=7398872 [NCBI, ExPASy, EBI, Israel, Japan]
Kalkkinen N.;
"Carboxyl-terminal sequence analysis of the four structural proteins of Semliki Forest virus.";
FEBS Lett. 115:163-166(1980).
[4]
PROTEIN SEQUENCE OF 334-402 AND 816-881.
PubMed=6087344 [NCBI, ExPASy, EBI, Israel, Japan]
Bell J.R., Kinney R.M., Trent D.W., Strauss E.G., Strauss J.H.;
"An evolutionary tree relating eight alphaviruses, based on amino-terminal sequences of their glycoproteins.";
Proc. Natl. Acad. Sci. U.S.A. 81:4702-4706(1984).
[5]
STEAROYLATION AT CYS-1248, AND PALMITOYLATION.
PubMed=3143715 [NCBI, ExPASy, EBI, Israel, Japan]
Schmidt M., Schmidt M.F., Rott R.;
"Chemical identification of cysteine as palmitoylation site in a transmembrane protein (Semliki Forest virus E1).";
J. Biol. Chem. 263:18635-18639(1988).
[6]
CLEAVAGE SITE OF P62, AND MUTAGENESIS OF ARG-330 AND ARG-333.
PubMed=2005112 [NCBI, ExPASy, EBI, Israel, Japan]
Jain S.K., DeCandido S., Kielian M.;
"Processing of the p62 envelope precursor protein of Semliki Forest virus.";
J. Biol. Chem. 266:5756-5761(1991).
[7]
MUTAGENESIS OF ASP-890; LYS-894; GLY-898; PRO-901; MET-903 AND GLY-906.
PubMed=2072453 [NCBI, ExPASy, EBI, Israel, Japan]
Levy-Mintz P., Kielian M.;
"Mutagenesis of the putative fusion domain of the Semliki Forest virus spike protein.";
J. Virol. 65:4292-4300(1991).
[8]
CLEAVAGE BY SIGNAL PEPTIDASE, AND MUTAGENESIS OF ALA-755 AND ALA-815.
PubMed=1985194 [NCBI, ExPASy, EBI, Israel, Japan]
Liljestrom P., Garoff H.;
"Internally located cleavable signal sequences direct the formation of Semliki Forest virus membrane proteins from a polyprotein precursor.";
J. Virol. 65:147-154(1991).
[9]
FUNCTION OF 6K PROTEIN.
DOI=10.1016/0042-6822(91)90556-Q; PubMed=1962454 [NCBI, ExPASy, EBI, Israel, Japan]
Lusa S., Garoff H., Liljestrom P.;
"Fate of the 6K membrane protein of Semliki Forest virus during virus assembly.";
Virology 185:843-846(1991).
[10]
FUNCTION OF E1 PROTEIN.
PubMed=1433520 [NCBI, ExPASy, EBI, Israel, Japan]
Wahlberg J.M., Bron R., Wilschut J., Garoff H.;
"Membrane fusion of Semliki Forest virus involves homotrimers of the fusion protein.";
J. Virol. 66:7309-7318(1992).
[11]
FUNCTION OF 6K PROTEIN.
PubMed=7983743 [NCBI, ExPASy, EBI, Israel, Japan]
Loewy A., Smyth J., von Bonsdorff C.H., Liljestrom P., Schlesinger M.J.;
"The 6-kilodalton membrane protein of Semliki Forest virus is involved in the budding process.";
J. Virol. 69:469-475(1995).
[12]
MUTAGENESIS OF GLY-898 AND GLY-906.
DOI=10.1083/jcb.140.1.91; PubMed=9425157 [NCBI, ExPASy, EBI, Israel, Japan]
Vashishtha M., Phalen T., Marquardt M.T., Ryu J.S., Ng A.C., Kielian M.;
"A single point mutation controls the cholesterol dependence of Semliki Forest virus entry and exit.";
J. Cell Biol. 140:91-99(1998).
[13]
AUTOPROTEOLYTIC CLEAVAGE BY CAPSID PROTEIN.
DOI=10.1006/jmbi.1998.1817; PubMed=9642067 [NCBI, ExPASy, EBI, Israel, Japan]
Skoging U., Liljestrom P.;
"Role of the C-terminal tryptophan residue for the structure-function of the alphavirus capsid protein.";
J. Mol. Biol. 279:865-872(1998).
[14]
INTERACTION WITH HUMAN CHAPERONES P4HB/PDI AND PDIA3/ERP57.
DOI=10.1038/47062; PubMed=10573423 [NCBI, ExPASy, EBI, Israel, Japan]
Molinari M., Helenius A.;
"Glycoproteins form mixed disulphides with oxidoreductases during folding in living cells.";
Nature 402:90-93(1999).
[15]
MUTAGENESIS OF LEU-859 AND VAL-993.
DOI=10.1128/JVI.76.24.12712-12722.2002; PubMed=12438597 [NCBI, ExPASy, EBI, Israel, Japan]
Chatterjee P.K., Eng C.H., Kielian M.;
"Novel mutations that control the sphingolipid and cholesterol dependence of the Semliki Forest virus fusion protein.";
J. Virol. 76:12712-12722(2002).
[16]
PROTEOLYTIC PROCESSING OF P62 BY HOST FURIN.
DOI=10.1128/JVI.77.5.2981-2989.2003; PubMed=12584323 [NCBI, ExPASy, EBI, Israel, Japan]
Zhang X., Fugere M., Day R., Kielian M.;
"Furin processing and proteolytic activation of Semliki Forest virus.";
J. Virol. 77:2981-2989(2003).
[17]
X-RAY CRYSTALLOGRAPHY (3.1 ANGSTROMS) OF 119-267.
DOI=10.1002/(SICI)1097-0134(199703)27:3<345::AID-PROT3>3.0.CO;2-C; PubMed=9094737 [NCBI, ExPASy, EBI, Israel, Japan]
Choi H.-K., Lu G., Lee S., Wengler G., Rossmann M.G.;
"Structure of Semliki Forest virus core protein.";
Proteins 27:345-359(1997).
[18]
STRUCTURE BY ELECTRON MICROSCOPY (9.0 ANGSTROMS) OF 119-267, AND DISULFIDE BONDS.
DOI=10.1016/S1097-2765(00)80421-9; PubMed=10882067 [NCBI, ExPASy, EBI, Israel, Japan]
Mancini E.J., Clarke M., Gowen B.E., Rutten T., Fuller S.D.;
"Cryo-electron microscopy reveals the functional organization of an enveloped virus, Semliki Forest virus.";
Mol. Cell 5:255-266(2000).
[19]
X-RAY CRYSTALLOGRAPHY (3.0 ANGSTROMS) OF 816-1205.
DOI=10.1016/S0092-8674(01)00303-8; PubMed=11301009 [NCBI, ExPASy, EBI, Israel, Japan]
Lescar J., Roussel A., Wien M.W., Navaza J., Fuller S.D., Wengler G., Wengler G., Rey F.A.;
"The Fusion glycoprotein shell of Semliki Forest virus: an icosahedral assembly primed for fusogenic activation at endosomal pH.";
Cell 105:137-148(2001).
[20]
X-RAY CRYSTALLOGRAPHY (3.2 ANGSTROMS) OF 816-1206, GLYCOSYLATION AT ASN-956, AND DISULFIDE BONDS.
DOI=10.1038/nature02239; PubMed=14737160 [NCBI, ExPASy, EBI, Israel, Japan]
Gibbons D.L., Vaney M.C., Roussel A., Vigouroux A., Reilly B., Lepault J., Kielian M., Rey F.A.;
"Conformational change and protein-protein interactions of the fusion protein of Semliki Forest virus.";
Nature 427:320-325(2004).
Comments
  • FUNCTION: Capsid protein possesses a protease activity that results in its autocatalytic cleavage from the nascent structural protein. Following its self-cleavage, the capsid protein transiently associates with ribosomes, and within several minutes the protein binds to viral RNA and rapidly assembles into icosaedric core particles. The resulting nucleocapsid eventually associates with the cytoplasmic domain of E2 at the cell membrane, leading to budding and formation of mature virions. New virions attach to target cells, and after endocytosis their membrane fuses with the target cell membrane. This leads to the release of the nucleocapsid into the cytoplasm, followed by an uncoating event necessary for the genomic RNA to become accessible. The uncoating might be triggered by the interaction of capsid proteins with ribosomes. Binding of ribosomes would release the genomic RNA since the same region is genomic RNA-binding and ribosome-binding.
  • FUNCTION: E3 protein's function is unknown.
  • FUNCTION: E2 is responsible for viral attachment to target host cell, by binding to the cell receptor. Synthesized as a p62 precursor which is processed by furin at the cell membrane just before virion budding, giving rise to E2-E1 heterodimer. The p62-E1 heterodimer is stable, whereas E2-E1 is unstable and dissociate at low pH. p62 is processed at the last step, presumably to avoid E1 fusion activation before its final export to cell surface. E2 C-terminus contains a transitory transmembrane that would be disrupted by palmitoylation, resulting in reorientation of the C-terminal tail from lumenal to cytoplasmic side. This step is critical since E2 C-terminus is involved in budding by interacting with capsid proteins. This release of E2 C-terminus in cytoplasm occurs lately in protein export, and precludes premature assembly of particles at the endoplasmic reticulum membrane.
  • FUNCTION: 6K is a constitutive membrane protein involved in virus glycoprotein processing, membrane permeabilization, and the budding of viral particles. Present in low amount in virions, about 3% compared to viral glycoproteins. Because of its lipophilic properties, the 6K protein is postulated to influence the selection of lipids that interact with the transmembrane domains of the glycoproteins, which, in turn, affects the deformability of the bilayer required for the extreme curvature that occurs as budding proceeds.
  • FUNCTION: E1 is a class II viral fusion protein. Fusion activity is inactive as long as E1 is bound to E2 in mature virion. After virus attachment to target cell and endocytosis, acidification of the endosome would induce dissociation of E1/E2 heterodimer and concomitant trimerization of the E1 subunits. This E1 trimer is fusion active, and promotes release of viral nucleocapsid in cytoplasm after cell and viral membrane fusion. Efficient fusion requires the presence of cholesterol and sphingolipid in the target membrane. Fusion is optimal at levels of about 1 molecule of cholesterol per 2 molecules of phospholipids, and is specific for sterols containing a 3-beta-hydroxyl group.
  • SUBUNIT: p62 and E1 form a heterodimer shortly after synthesis. Processing of p62 into E2 and E3 results in a heterodimer of E2 and E1. Spike at virion surface are constituted of three E2-E1 heterodimers. After target cell attachment and endocytosis, E1 change conformation to form homotrimers.
  • SUBCELLULAR LOCATION: Capsid protein: Virion (By similarity). Cytoplasm (By similarity).
  • SUBCELLULAR LOCATION: p62: Virion membrane; Single-pass type I membrane protein (By similarity). Cell membrane; Single-pass type I membrane protein (By similarity).
  • SUBCELLULAR LOCATION: E2 envelope glycoprotein: Virion membrane; Single-pass type I membrane protein (By similarity). Cell membrane; Single-pass type I membrane protein (By similarity).
  • SUBCELLULAR LOCATION: E1 envelope glycoprotein: Virion membrane; Single-pass type I membrane protein (By similarity). Cell membrane; Single-pass type I membrane protein (By similarity).
  • SUBCELLULAR LOCATION: 6K protein: Cell membrane; Multi-pass membrane protein (By similarity). Virion membrane; Multi-pass membrane protein (By similarity).
  • PTM: Specific enzymatic cleavages in vivo yield mature proteins. Capsid protein is auto-cleaved during polyprotein translation, unmasking p62 signal peptide. The remaining polyprotein is then targeted to the endoplasmic reticulum, where host signal peptidase cleaves it into p62, 6K and E1 proteins. p62 is further processed to mature E3 and E2 by host furin in trans-Golgi vesicle. Protein processing process takes about 30 minutes at physiologic temperatures. The folding of the p62/6K/E1 precursor requires the formation of intrachain disulfide bonds and has been shown to involve a transient covalent interaction between the nascent and newly synthesized heterodimer and the host-cell chaperones, P4HB/PDI and PDIA3/ERp57. The folding pathway also includes non covalent interaction with human CANX/calnexin and CALR/calreticulin.
  • PTM: Envelope E1, E2 and E3 proteins are N-glycosylated.
  • PTM: E2 is palmitoylated via thioester bonds. These palmitoylations may induce disruption of the C-terminus transmembrane. This would result in the reorientation of E2 c-terminus from lumenal to cytoplasmic side. 6K protein is also palmitoylated with about four covalently bound fatty acids per molecule. E1 is stearoylated.
  • MISCELLANEOUS: The mature virion nucleocapsid consists of 240 copies of the capsid protein. 80 spike trimers of E1 and E2 are present at the surface of mature virion. They project about 100 Angstroms from the outer surface and are located at the local and strict three fold axis of the icosaedral lattice. The glycoproteins splay out to form a protein shell or skirt covering most of the outer surface of the membrane bilayer.
  • MISCELLANEOUS: Structural polyprotein is translated from a subgenomic RNA synthesized during togavirus replication.
  • SIMILARITY: Contains 1 peptidase S3 domain [view classification].
  • WEB RESOURCE: Name=Virus Particle ExploreR db; Note= Icosahedral capsid structure; URL="http://viperdb.scripps.edu/info_page.php?VDB=1dwn";.
Copyright
Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms. Distributed under the Creative Commons Attribution-NoDerivs License.
Cross-references
Sequence databases
EMBL
X04129; CAA27742.1; ALT_SEQ; Genomic_RNA.[EMBL / GenBank / DDBJ] [CoDingSequence]
PIR A93861; VHWV.
RefSeq NP_463458.1; -.
3D structure databases
PDB
1DYL; EM; 9.00 A; A/B/C/D=119-267.[ExPASy / RCSB / EBI]
1I9W; X-ray; 3.00 A; A=816-1205.[ExPASy / RCSB / EBI]
1RER; X-ray; 3.20 A; A/B/C=816-1206.[ExPASy / RCSB / EBI]
1VCP; X-ray; 3.00 A; A/B/C=119-267.[ExPASy / RCSB / EBI]
1VCQ; X-ray; 3.10 A; A/B=119-267.[ExPASy / RCSB / EBI]
2ALA; X-ray; 3.00 A; A=816-1206.[ExPASy / RCSB / EBI]
Detailed list of linked structures.
PDBsum 1DYL; -.
1I9W; -.
1RER; -.
1VCP; -.
1VCQ; -.
2ALA; -.
ModBase P03315.
Protein family/group databases
MEROPS S03.001; -.
Ontologies
GO
GO:0055036; Cellular component: virion membrane (inferred from electronic annotation from UniProtKB-SubCell).
QuickGo view.
Family and domain databases
InterPro IPR002548; Alpha_E1_glycop.
IPR000936; Alpha_E2_glycop.
IPR002533; Alpha_E3_glycop.
IPR000930; Peptidase_S3.
Graphical view of domain structure.
Pfam PF01589; Alpha_E1_glycop; 1.
PF00943; Alpha_E2_glycop; 1.
PF01563; Alpha_E3_glycop; 1.
PF00944; Peptidase_S3; 1.
Pfam graphical view of domain structure.
PRINTS PR00798; TOGAVIRIN.
BLOCKS P03315.
Genome annotation databases
GeneID 922351; -.
Other
LinkHub P03315; -.
ProtoNet P03315.
UniRef View cluster of proteins with at least 50% / 90% / 100% identity.
Keywords
3D-structure; Capsid protein; Cell membrane; Cleavage on pair of basic residues; Cytoplasm; Direct protein sequencing; Glycoprotein; Hydrolase; Lipoprotein; Membrane; Palmitate; Protease; Serine protease; Signal; Transmembrane; Virion.
Features
SEVIEWER logo Feature table viewer FT aligner logo Feature aligner
KeyFrom    To Length Description FTId
CHAIN   1    267  267     Capsid protein. PRO_0000041311
CHAIN   268    755  488     p62. PRO_0000226237
CHAIN   268    333  66     E3 protein. PRO_0000041312
SIGNAL   268    282  15     Not cleaved (Potential). 
CHAIN   334    755  422     E2 envelope glycoprotein. PRO_0000041313
CHAIN   756    815  60     6K protein. PRO_0000041314
CHAIN   816   1253  438     E1 envelope glycoprotein. PRO_0000041315
TOPO_DOM   268    701  434     Extracellular (Potential). 
TRANSMEM   702    722  21     Potential. 
TOPO_DOM   723    755  33     Cytoplasmic (Potential). 
TOPO_DOM   756    770  15     Extracellular (Potential). 
TRANSMEM   771    791  21     Potential. 
TOPO_DOM   792    792  1     Cytoplasmic (Potential). 
TRANSMEM   793    813  21     Potential. 
TOPO_DOM   814   1230  417     Extracellular (Potential). 
TRANSMEM   1231   1251  21     Potential. 
TOPO_DOM   1252   1253  2     Cytoplasmic (Potential). 
DOMAIN   112    267  156     Peptidase S3. 
REGION   1    113  113     Intrinsically disordered, in contact with genomic RNA in nucleocapsid (Potential). 
REGION   94    106  13     Ribosome-binding. 
REGION   728    748  21     Transient transmembrane before p62-6K protein processing (Potential). 
REGION   899    916  18     E1 fusion peptide loop. 
ACT_SITE   145    145        Charge relay system (By similarity). 
ACT_SITE   151    151        Charge relay system (By similarity). 
ACT_SITE   219    219        Charge relay system (By similarity). 
SITE   267    268  2     Cleavage; by capsid protein. 
SITE   333    334  2     Cleavage; by host furin. 
SITE   755    756  2     Cleavage; by host signal peptidase. 
SITE   815    816  2     Cleavage; by host signal peptidase. 
LIPID   718    718        S-palmitoyl cysteine; by host (Potential). 
LIPID   728    728        S-palmitoyl cysteine; by host (By similarity). 
LIPID   748    748        S-palmitoyl cysteine; by host (By similarity). 
LIPID   749    749        S-palmitoyl cysteine; by host (By similarity). 
LIPID   1248   1248        S-stearoyl cysteine; by host. 
CARBOHYD   280    280        N-linked (GlcNAc...) (Potential). 
CARBOHYD   327    327        N-linked (GlcNAc...) (Potential). 
CARBOHYD   533    533        N-linked (GlcNAc...) (Potential). 
CARBOHYD   595    595        N-linked (GlcNAc...) (Potential). 
CARBOHYD   956    956        N-linked (GlcNAc...). 
DISULFID   119    134         
DISULFID   864    929         
DISULFID   877    909         
DISULFID   878    911         
DISULFID   883    893         
DISULFID   1074   1086         
DISULFID   1116   1191         
DISULFID   1121   1195         
DISULFID   1143   1185         
MUTAGEN   267    267        W->A,R: Complete loss of cleavage by capsid protease. 
MUTAGEN   330    330        R->S: Complete loss of p62 precursor processing. 
MUTAGEN   333    333        R->F: Complete loss of p62 precursor processing. 
MUTAGEN   755    755        A->F: Complete loss of p62 precursor-6K cleavage. 
MUTAGEN   815    815        A->F: Complete loss of 6K protein-E1 envelope glycoprotein cleavage. 
MUTAGEN   859    859        L->F: E1 fusion is less cholesterol and sphingolipid dependent. 
MUTAGEN   890    890        D->A: Shifts the pH threshold for fusion to a more acidic range. 
MUTAGEN   894    894        K->Q: No effect on E1 fusion activity. 
MUTAGEN   898    898        G->A: Shifts the pH threshold for fusion to a more acidic range. 
MUTAGEN   898    898        G->D: No effect on E1 fusion activity. 
MUTAGEN   901    901        P->D: Retention of E1 protein in endoplasmic reticulum. 
MUTAGEN   903    903        M->L: No effect on E1 fusion activity. 
MUTAGEN   906    906        G->A: Shifts the pH threshold for fusion to a more acidic range. 
MUTAGEN   906    906        G->D: Complete loss of E1 fusion activity. 
MUTAGEN   906    906        G->P: Retention of E1 protein in endoplasmic reticulum. 
MUTAGEN   993    993        V->A: E1 fusion is less cholesterol and sphingolipid dependent. 
STRAND   120    125  6      
STRAND   128    134  7      
STRAND   139    141  3      
STRAND   148    152  5      
HELIX   153    156  4      
STRAND   161    163  3      
TURN   164    167  4      
STRAND   168    172  5      
HELIX   175    180  6      
STRAND   190    195  6      
STRAND   198    203  6      
STRAND   206    210  5      
STRAND   222    224  3      
STRAND   230    239  10      
STRAND   241    251  11      
STRAND   253    259  7      
Sequence information
Length: 1253 AA [This is the length of the unprocessed precursor] Molecular weight: 138017 Da [This is the MW of the unprocessed precursor] CRC64: 2A73228D08B82AC5 [This is a checksum on the sequence]
        10         20         30         40         50         60 
MNYIPTQTFY GRRWRPRPAA RPWPLQATPV APVVPDFQAQ QMQQLISAVN ALTMRQNAIA 

        70         80         90        100        110        120 
PARPPKPKKK KTTKPKPKTQ PKKINGKTQQ QKKKDKQADK KKKKPGKRER MCMKIENDCI 

       130        140        150        160        170        180 
FEVKHEGKVT GYACLVGDKV MKPAHVKGVI DNADLAKLAF KKSSKYDLEC AQIPVHMRSD 

       190        200        210        220        230        240 
ASKYTHEKPE GHYNWHHGAV QYSGGRFTIP TGAGKPGDSG RPIFDNKGRV VAIVLGGANE 

       250        260        270        280        290        300 
GSRTALSVVT WNKDMVTRVT PEGSEEWSAP LITAMCVLAN ATFPCFQPPC VPCCYENNAE 

       310        320        330        340        350        360 
ATLRMLEDNV DRPGYYDLLQ AALTCRNGTR HRRSVSQHFN VYKATRPYIA YCADCGAGHS 

       370        380        390        400        410        420 
CHSPVAIEAV RSEATDGMLK IQFSAQIGID KSDNHDYTKI RYADGHAIEN AVRSSLKVAT 

       430        440        450        460        470        480 
SGDCFVHGTM GHFILAKCPP GEFLQVSIQD TRNAVRACRI QYHHDPQPVG REKFTIRPHY 

       490        500        510        520        530        540 
GKEIPCTTYQ QTTAETVEEI DMHMPPDTPD RTLLSQQSGN VKITVGGKKV KYNCTCGTGN 

       550        560        570        580        590        600 
VGTTNSDMTI NTCLIEQCHV SVTDHKKWQF NSPFVPRADE PARKGKVHIP FPLDNITCRV 

       610        620        630        640        650        660 
PMAREPTVIH GKREVTLHLH PDHPTLFSYR TLGEDPQYHE EWVTAAVERT IPVPVDGMEY 

       670        680        690        700        710        720 
HWGNNDPVRL WSQLTTEGKP HGWPHQIVQY YYGLYPAATV SAVVGMSLLA LISIFASCYM 

       730        740        750        760        770        780 
LVAARSKCLT PYALTPGAAV PWTLGILCCA PRAHAASVAE TMAYLWDQNQ ALFWLEFAAP 

       790        800        810        820        830        840 
VACILIITYC LRNVLCCCKS LSFLVLLSLG ATARAYEHST VMPNVVGFPY KAHIERPGYS 

       850        860        870        880        890        900 
PLTLQMQVVE TSLEPTLNLE YITCEYKTVV PSPYVKCCGA SECSTKEKPD YQCKVYTGVY 

       910        920        930        940        950        960 
PFMWGGAYCF CDSENTQLSE AYVDRSDVCR HDHASAYKAH TASLKAKVRV MYGNVNQTVD 

       970        980        990       1000       1010       1020 
VYVNGDHAVT IGGTQFIFGP LSSAWTPFDN KIVVYKDEVF NQDFPPYGSG QPGRFGDIQS 

      1030       1040       1050       1060       1070       1080 
RTVESNDLYA NTALKLARPS PGMVHVPYTQ TPSGFKYWLK EKGTALNTKA PFGCQIKTNP 

      1090       1100       1110       1120       1130       1140 
VRAMNCAVGN IPVSMNLPDS AFTRIVEAPT IIDLTCTVAT CTHSSDFGGV LTLTYKTNKN 

      1150       1160       1170       1180