Figure 4 of Dhawan et al
Figure 4. Amino acid sequences of chdlx-3 and chanf mRNA fragments aligned to other dlx- and anf sequences to show homologies.
Sequences identical to mdlx-3 are shown in asterisks (*) and stop codons as vertical hashes (|). The GAVY sequence at the end of other dlx-3 sequences is shown in blue. Chdlx-3 and chanf are denoted by magenta text.
A. The zebrafish (zdlx-3), the newt (nvhbox4), the chicken (chdlx-3 & 5), the mouse sequences (mdlx-1 & 2), and the Drosophila (dll), are shown aligned to the mouse mdlx-3 sequence. The single amino acid insertion of glycine in the Drosophila and newt sequences at the sixth position after the homeodomain is represented as a dot in the other sequences. The chdlx-3 sequence shows 24% homology to the mouse mdlx-1, 27% to mdlx-5, 31% to mdlx-2, 36% to mdlx-3, 33% to zdlx-3 and 29% to nvhbox4 in the sequences shown.
B. The chicken anf (chanf) sequence aligned to the Xenopus (xanf-1 & 2) and mouse (Hesx-1, Rpx) sequences. The chicken sequence is identical to the xanf-1 & 2 in the homeobox sequences and is identical at 55% of the amino acid positions in the 3' coding sequences. It bears 87% identity to the mouse sequences in the homeobox region and 47-52% identity in the 3' coding sequences.
A
Homeodomain----( Carboxyl terminus
helix-3 helix-4 6
mdlx3 KIWFQNRR SKFKKLYK NGEVP.LEHSPNNSDSMACNSPPSPALWDTSSHSTPAPARNPLPPPLPYSASPNYLDDPTTSWYHTQNLSGPHLQQQPPQPATLHHASPGPPPNPGAVY|
zdlx3 ******** ******** *****.******A*************V**NNA**SQVNRGQIPQ***SSTPPYMEDYSNHWYQQGSHLQHPV*HPGP*QSVGAVY|
nvhbox4 ******** ******** *****GM****D**********A**T*****TP*RVQHTQAQPL*HNSSPSYLEDYNPWYHHPQNLSGHLQPPGTMHHTPPGTGAVY|
chdlx3 ******** ******** *****.**************EGTAF*GVEGLRVRVAKGLLHTCCCAAHT*VNKHL*VFV|
chdlx5 ******K* **I**IM* ***M*.P****SS**P****SPQSP*VW*PQGSSRSIGHHGHGH**AANPSPGS**ES*SAWYPAASP*GSH?QPHGSLOHPLALPSGTIY|
mdlx2 ******** *****MW* S**I*.T*QH*GA*A*PP*A***VS*PASWDFGAPQRM*GGGPGSGGGGAG*SGSSPSSAA*AFLGNYPWYHQASGSASHLQATAPLLHPSQTPQAHHHHHHHHHAGGGAPVSAGTIF|
mdlx1 ******K* ******M* Q*GAA.**G*ALANGRALSAGS*PVPPGWNPNSSSGKGSGSSAGSYV*SYT*WYPSAHQEAMQQPQLM|
dll ******** **Y**MM* AAQG*GTNSGMPLGGGGPNPGQHSPNQMHSGGNNGGGSNSGSPSHY**PGH**TPSST*VSELSPQFPPT*LSPPT*A*WDQKP*WIAHK***QM*GYVPQYWYLPETNPSLVTVWPAV|
B
Homeodomain----( Carboxyl-terminus
helix-3 helix-4
xanf-1 QIWFQNRR AKLKRSHR ESQFLIVKDSLSSKIQE|
xanf-2 ******** ******** ***************E*|
mRpx ******** **M***R* *****MA*KPFNPDLLK|
Hesx-1 ******** **M***R* *****MARKPFNPDLLK|
chanf ******** ******** *****M**NNST*SLLE|