Figure 3 of
Klintworth, Mol Vis 2006;
12:159-176.
Figure 3. Alignment of CHST6 mutations with amino acid sequences of carbohydrate sulfotransferases
Shown here is the alignment of CHST6 mutations with amino acid sequences of human galactose/N-acetylgalactosamine/N-acetylglucosamine 6-O-sulfotransferase family of carbohydrate sulfotransferases (CHST1-7). In the figure, the red colored residues are identical to the consensus sequence and the blue colored residues are similar to the consensus sequence.The hydrophobic site, 5'-phosphate binding site, 3'-phosphate binding site and the N-glycosylation sites of CHST6 are also shown. The less than sign (<) indicates a deletion with frameshift, the greater than sign (>) indicates an insertion with frameshift, the sharp (#) indicates an insertion plus deletion with frameshift, the plus sign (+) indicates an in-frame insertion, and the plus/minus sign (±) indicates an in-frame insertion plus deletion.
CHST6 1 ------------------------------------------------------------ CHST5 1 ------------------------------------------------------------ CHST4 1 ------------------------------------------------------------ CHST7 1 -----------------------------------------------MKGRRRRRREYCK CHST2 1 MSRSPQRALPPGALPRLLQAAPAAAPRALLPQWPRRPGRRWPASPLGMKVFRRKALVLCA CHST1 1 ------------------------------------------------------------ CHST3 1 -------------------------------------MEKGLTLPQDCRDFVHSLKMRSK hydrophobic site ______ > | Changes ?XM << P CHST6 1 ---------------------------------------------MWLPRVSSTAVTALL CHST5 1 --------MGMRARVP----------------KVAHSTRRPPAARMWLPRFSSKTVTVLL CHST4 1 --------------------------------------------MLLPKKMKLLLFLVSQ CHST7 14 FALLLVLYTLVLLLVP----------------SVLDGGRDGDKGAEHCPGLQRSLGVWSL CHST2 61 GYALLLVLTMLNLLDYKWHKEPLQQCNPDGPLGAAAGAAGGSWGRPGPPPAGPPRAHARL CHST1 1 --------------------------------------------MQCSWKAVLLLALASI CHST3 24 YALFLVFVVIVFVFIEKENKIISRVSDKLKQIPQALADANSTDPALILAENASLLSLSEL hydrophobic site 5'-phosphate binding site _______________________________________________ __ | |L < C Changes X R S< Y P CL CHST6 16 LAQT------------------------FLLLFLVSRPGPSSPAGGEARVHVLVLSSWRS CHST5 37 LAQTT-----------------------CLLLFIISRPGPSSPAGGEDRVHVLVLSSWRS CHST4 17 MAILA-----------------------LFFHMYSHNISSLSMKAQPERMHVLVLSSWRS CHST7 58 EAAAAGEREQGA-----EARAAEEGGANQSPRFPSNLSGAVGEAVSREKQHIYVHATWRT CHST2 121 DLRTPYRPPAAA-----VGAAPAAAAGMAGVAAPPGNGTRGTGGVGDKRQLVYVFTTWRS CHST1 17 AIQYTAIRTFTA-----KSFHTCPGLAEAGLAERLCEESPTFAYNLSRKTHILILATTRS CHST3 84 DSAFSQLQSRLRNLSLQLGVEPAMEAAGEEEEEQRKEEEPPRPAVAGPRRHVLLMATTRT 5'-phosphate binding site __ | < LI T X S W G Changes DLF R XP<T Q DVH SP MC X #LH PLD T CHST6 52 GSSFVGQLFNQHPDVFYLMEPAWHVWTTLSQ--------GSAATLHMAVRDLVRSVFLCD CHST5 74 GSSFLGQLFSQHPDVFYLMEPAWHVWTTLSQ--------GSAATLHMAVRDLMRSIFLCD CHST4 54 GSSFVGQLFGQHPDVFYLMEPAWHVWMTFKQ--------STAWMLHMAVRDLIRAVFLCD CHST7 113 GSSFLGELFNQHPDVFYLYEPMWHLWQALYP--------GSAVSLQGALRDMLRSLFRCD CHST2 176 GSSFFGELFNQNPEVFFLYEPVWHVWQKLYP--------GDAVSLQGAARDMLSALYRCD CHST1 72 GSSFVGQLFNQHLDVFYLFEPLYHVQNTLIPRFTQGKSPADRRVMLGASRDLLRSLYDCD CHST3 144 GSSFVGEFFNQQGNIFYLFEPLWHIERTVSFE----PGGANAAGSALVYRDVLKQLFLCD N-glycosylation site ___ | | > P Changes V S C C LPX >CV L >X CHST6 104 MDVFDAYLPWR-----------RNLSDLFQWAVSRALCSPPACSAFPRG----AISSEAV CHST5 126 MDVFDAYMPQS-----------RNLSAFFNWATSRALCSPPACSAFPRG----TISKQDV CHST4 106 MSVFDAYMEPGP----------RRQSSLFQWENSRALCSAPACDIIPQD----EIIPRAH CHST7 165 FSVLRLYAPPGDPAARAPDTANLTTAALFRWRTNKVICSPPLCPGAPRARAEVGLVEDTA CHST2 228 LSVFQLYSPAGSGGR------NLTTLGIFGAATNKVVCSSPLCPAYRKEVVG--LVDDRV CHST1 132 LYFLENYIKPPPVN--------HTTDRIFRRGASRVLCSRPVCDPPGPAD---LVLEEGD CHST3 200 LYVLEHFITPLP--------EDHLTQFMFRRGSSRSLCEDPVCTPFVKK-----VFEKYH 3'-phosphate binding site ______ | # S C NQQT Changes PX WPF R HC < >P±±±+E R SERLV CHST6 149 CKP-LCARQSFTLAREACRSYSHVVLKEVRFFNLQVLYPLLSDPALNLRIVHLVRDPRAV CHST5 171 CKT-LCTRQPFSLAREACRSYSHVVLKEVRFFNLQVLYPLLSDPALNLRIVHLVRDPRAV CHST4 152 CRL-LCSQQPFEVVEKACRSYSHVVLKEVRFFNLQSLYPLLKDPSLNLHIVHLVRDPRAV CHST7 225 CER-SCPPVAIRALEAECRKYPVVVIKDVRLLDLGVLVPLLRDPGLNLKVVQLFRDPRAV CHST2 280 CK--KCPPQRLARFEEECRKYRTLVIKGVRVFDVAVLAPLLRDPALDLKVIHLVRDPRAV CHST1 181 CVR-KCGLLNLTVAAEACRERSHVAIKTVRVPEVNDLRALVEDPRLNLKVIQLVRDPRGI CHST3 247 CKNRRCGPLNVTLAAEACRRKEHMALKAVRIRQLEFLQPLAEDPRLDLRVIQLVRDPRAV 3'-phosphate binding site N-glycosylation site __ _____________________________ |W Y | | Changes FQ T +E D D X CHST6 208 LRSREQTAKALARDNGIVLGTN--------------------------GTWVEADPGLRV CHST5 230 LRSREAAGPILARDNGIVLGTN--------------------------GKWVEADPHLRL CHST4 211 FRSRERTKGDLMIDSRIVMGQH--------------------------EQKLKKEDQPYY CHST7 284 HNSRLKSRQGLLRESIQVLRTRQRGDRFHRVLLAHGVGARPGGQSRALPAAPRADFFLTG CHST2 338 ASSRIRSRHGLIRESLQVVRSRDPRAHRMPFLEAAG-HKLG---AKKEGVGGPADYHALG CHST1 240 LASRSETFRDTYRLWRLWYGTG---------------------------RK--PYNLDVT CHST3 307 LASRMVAFAGKYKTWKKWLDDE---------------------------GQDGLREEEVQ S Changes W<RP C H K P CHST6 242 VREVCRSHVRIAEAATLKPPPFLRGRYRLVRFEDLAREPLAEIRALYAFTGLSLTPQLEA CHST5 264 IREVCRSHVRIAEAATLKPPPFLRGRYRLVRFEDLAREPLAEIRALYAFTGLTLTPQLEA CHST4 245 VMQVICQSQLEIYKTIQSLPKALQERYLLVRYEDLARAPVAQTSRMYEFVGLEFLPHLQT CHST7 344 ALEVICEAWLRDLLFARGAPAWLRRRYLRLRYEDLVRQPRAQLRRLLRFSGLRALAALDA CHST2 394 AMEVICNSMAKTLQTALQPPDWLQGHYLVVRYEDLVGDPVKTLRRVYDFVGLLVSPEMEQ CHST1 271 QLTTVCEDFSNSVSTGLMRPPWLKGKYMLVRYEDLARNPMKKTEEIYGFLGIPLDSHVAR CHST3 340 RLRGNCESIRLSAELGLRQPAWLRGRYMLVRYEDVARGPLQKAREMYRFAGIPLTPQVED N-glycosylation site N-glycosylation site ___ ___ | | | | X Y Changes X H C# X W <> D CHST6 302 WIHNITHGSGPGARREAFKTSSRNALNVSQAWRHALPFAKIRRVQELCAGALQLLGYRPV CHST5 324 WIHNITHGSGIGKPIEAFHTSSRNARNVSQAWRHALPFTKILRVQEVCAGALQLLGYRPV CHST4 305 WVHNITRGKGMGD--HAFHTNARDALNVSQAWRWSLPYEKVSRLQKACGDAMNLLGYRHV CHST7 404 FALNMTRGAAYGAD-RPFHLSARDAREAVHAWRERLSREQVRQVEAACAPAMRLLAYPRS CHST2 454 FALNMTSGSGSSS--KPFVVSARNATQAANAWRTALTFQQIKQVEEFCYQPMAVLGYERV CHST1 331 WIQNNTRGDPTLG--KHKYGTVRNSAATAEKWRFRLSYDIVAFAQNACQQVLAQLGYKIA CHST3 400 WIQKNTQAAHDGS--G-IYSTQKNSSEQFEKWRFSMPFKLAQVVQAACGPAMRLFGYKLA CHST6 362 YSEDEQRNLALDLVLPRGLNGFTWASSTASHPRN CHST5 384 YSADQQRDLTLDLVLPRGPDHFSWASPD------ CHST4 363 RSEQEQRNLLLDLLS------T-WTVPEQIH--- CHST7 463 GEEGDAEQPREGETP------LEMDADGAT---- CHST2 512 NSPEEVKDLSKTLLR----------KPRL----- CHST1 389 ASEEELKNPSVSLVE------ERDFRPFS----- CHST3 457 RDAAALTNRSVSLLE------ERGTFWVT----- |