Figure 4 of Dhawan et al
Figure 4. Amino acid sequences of chdlx-3 and chanf mRNA fragments aligned to other dlx- and anf sequences to show homologies.
Sequences identical to mdlx-3 are shown in asterisks (*) and stop codons as vertical hashes (|). The GAVY sequence at the end of other dlx-3 sequences is shown in blue. Chdlx-3 and chanf are denoted by magenta text.
A. The zebrafish (zdlx-3), the newt (nvhbox4), the chicken (chdlx-3 & 5), the mouse sequences (mdlx-1 & 2), and the Drosophila (dll), are shown aligned to the mouse mdlx-3 sequence. The single amino acid insertion of glycine in the Drosophila and newt sequences at the sixth position after the homeodomain is represented as a dot in the other sequences. The chdlx-3 sequence shows 24% homology to the mouse mdlx-1, 27% to mdlx-5, 31% to mdlx-2, 36% to mdlx-3, 33% to zdlx-3 and 29% to nvhbox4 in the sequences shown.
B. The chicken anf (chanf) sequence aligned to the Xenopus (xanf-1 & 2) and mouse (Hesx-1, Rpx) sequences. The chicken sequence is identical to the xanf-1 & 2 in the homeobox sequences and is identical at 55% of the amino acid positions in the 3' coding sequences. It bears 87% identity to the mouse sequences in the homeobox region and 47-52% identity in the 3' coding sequences.
A Homeodomain----( Carboxyl terminus helix-3 helix-4 6 mdlx3 KIWFQNRR SKFKKLYK NGEVP.LEHSPNNSDSMACNSPPSPALWDTSSHSTPAPARNPLPPPLPYSASPNYLDDPTTSWYHTQNLSGPHLQQQPPQPATLHHASPGPPPNPGAVY| zdlx3 ******** ******** *****.******A*************V**NNA**SQVNRGQIPQ***SSTPPYMEDYSNHWYQQGSHLQHPV*HPGP*QSVGAVY| nvhbox4 ******** ******** *****GM****D**********A**T*****TP*RVQHTQAQPL*HNSSPSYLEDYNPWYHHPQNLSGHLQPPGTMHHTPPGTGAVY| chdlx3 ******** ******** *****.**************EGTAF*GVEGLRVRVAKGLLHTCCCAAHT*VNKHL*VFV| chdlx5 ******K* **I**IM* ***M*.P****SS**P****SPQSP*VW*PQGSSRSIGHHGHGH**AANPSPGS**ES*SAWYPAASP*GSH?QPHGSLOHPLALPSGTIY| mdlx2 ******** *****MW* S**I*.T*QH*GA*A*PP*A***VS*PASWDFGAPQRM*GGGPGSGGGGAG*SGSSPSSAA*AFLGNYPWYHQASGSASHLQATAPLLHPSQTPQAHHHHHHHHHAGGGAPVSAGTIF| mdlx1 ******K* ******M* Q*GAA.**G*ALANGRALSAGS*PVPPGWNPNSSSGKGSGSSAGSYV*SYT*WYPSAHQEAMQQPQLM| dll ******** **Y**MM* AAQG*GTNSGMPLGGGGPNPGQHSPNQMHSGGNNGGGSNSGSPSHY**PGH**TPSST*VSELSPQFPPT*LSPPT*A*WDQKP*WIAHK***QM*GYVPQYWYLPETNPSLVTVWPAV| B Homeodomain----( Carboxyl-terminus helix-3 helix-4 xanf-1 QIWFQNRR AKLKRSHR ESQFLIVKDSLSSKIQE| xanf-2 ******** ******** ***************E*| mRpx ******** **M***R* *****MA*KPFNPDLLK| Hesx-1 ******** **M***R* *****MARKPFNPDLLK| chanf ******** ******** *****M**NNST*SLLE|