Figure 2 of
Tomi, Mol Vis 2004;
10:537-543.
Figure 2. Nucleotide and deduced amino acid sequences of rat M-cadherin
The deduced amino acid sequence is shown below the nucleotide sequence. The stop codon is denoted by an asterisk. The signal peptide is in blue and the postulated furin cleavage site of the precursor polypeptides is indicated by a ^. The extracellular domain is divided into five cadherin extracellular subdomain repeats (EC1-EC5). The transmembrane domain is in green. In the cytoplasmic domain following the transmembrane domain, the membrane-proximal conserved domain (MPCD) is in purple and the catenin binding sequence (CBS) is in orange. The characteristic cadherin consensus sequences are shaded in red. The 4 cysteine residues in EC5 are in pink. The N-glycosylation sites are indicated in brown. The nucleotide sequence of rat M-cadherin is in GenBank with the accession number AB176538.
1 TCCCGCCGCTGCCCCCATGAGTTCTGCTCTGCTCTTCGCCCTCGGGCTGCTTGCCCAGAGCCTTGGCCTCTCCTGGGCAGTCCCTGAGCCTGAACCCAGCACCCTGTACCCCTGGCGCCGGGCA 124
1 M S S A L L F A L G L L A Q S L G L S W A V P E P E P S T L Y P W R R A 36
125 TCAGCCCCAGGCCGTGTGCGGAGAGCCTGGGTCATCCCACCCATCAGTGTGTCTGAGAACCACAAACGCCTCCCCTACCCACTTGTGCAGATCAAGTCTGACAAACAACAGCTAGGCAGTGTC 247
37 S A P G R V R R A W V I P P I S V S E N H K R L P Y P L V Q I K S D K Q Q L G S V 77
^ EC1-->
248 ATCTACAGTATCCAGGGTCCCGGTGTGGACGAGGAGCCCCGAAACGTCTTCTCCATCGACAAGTTCACTGGGAGGGTGTACCTCAATGCCACTCTGGACCGTGAGAAGACGGACCGCTTCAGG 370
78 I Y S I Q G P G V D E E P R N V F S I D K F T G R V Y L N A T L D R E K T D R F R 118
371 CTGAGGGCCTTTGCCCTGGACTTGGGTGGCTCTACCCTGGAGGACCCCACGGACCTGGAGATCGTCGTGGTGGATCAAAATGACAACCGGCCAGTCTTCCTACAGGATGTGTTCAGAGGCCGC 493
119 L R A F A L D L G G S T L E D P T D L E I V V V D Q N D N R P V F L Q D V F R G R 159
EC2-->
494 ATCCTGGAGGGTGCCATCCCAGGCACCTTCGTAACCAGGGCTGAGGCCACAGATGCCGACGATCCGGAGACAGACAATGCGGCCCTCAGGTTCTCTATCCTGGAGCAGGGCAGCCCTGAGTTG 616
160 I L E G A I P G T F V T R A E A T D A D D P E T D N A A L R F S I L E Q G S P E L 200
617 TTCAGCATTGACGAGCACACTGGAGAGATCCGCACAGTGCAAGTGGGGCTGGACCGTGAGGTGGTGGCTGTGTACAACCTGACCTTGCAGGTGGCAGACATGTCCGGAGATGGCCTCACCGCC 739
201 F S I D E H T G E I R T V Q V G L D R E V V A V Y N L T L Q V A D M S G D G L T A 241
740 ACAGCCTCGGCCATCATCTCCGTAGATGACGTCAACGACAATGCCCCCGAGTTCACCAAGGATGAGTTCTTTATGGAGGCTGCAGAGGCTGTCAGTGGAGTGGACGTGGGACGGCTTGAGGTA 862
242 T A S A I I S V D D V N D N A P E F T K D E F F M E A A E A V S G V D V G R L E V 282
EC3-->
863 GAAGACAAGGATCTGCCCGGTTCCCCCAACTGGGTGGCCAGGTTCACCATCCTGGAAGGCGATCCTGACGGGCAGTTCAAGATCTATACGGACCCCAAGACCAATGAAGGTGTGCTGTCCGTG 985
283 E D K D L P G S P N W V A R F T I L E G D P D G Q F K I Y T D P K T N E G V L S V 323
984 GTCAAGCCCCTGGATTATGAGAGCCGTGAGCAGTATGAGCTCAGAGTGTCTGTACAGAACGAGGCCCCTCTGCAGACAGCTGCCCCCCGGGCCCAGCGGGGCCAGACCAGGGTCAGTGTGTGG 1108
324 V K P L D Y E S R E Q Y E L R V S V Q N E A P L Q T A A P R A Q R G Q T R V S V W 364
1109 GTTCAGGACACCAACGAGGCTCCAGTGTTTCCAGAGAACCCACTGAGGACGAGCATAGCTGAAGGAGCCCCGCCAGGCACCTCTGTGGCCACCTTCTCTGCCCGAGACCCTGACACGGAACAG 1231
365 V Q D T N E A P V F P E N P L R T S I A E G A P P G T S V A T F S A R D P D T E Q 405
EC4-->
1232 CTGCAGAGAATCAGCTACTCTAAGGACTACGACCCAGAAGACTGGCTGCAAGTGGACCGGGCCACAGGCAGGATTCAGACCCAGCGAGTGTTGAGCCCTGCCTCACCCTTTTTAAAGGACGGC 1354
406 L Q R I S Y S K D Y D P E D W L Q V D R A T G R I Q T Q R V L S P A S P F L K D G 446
1355 TGGTACAGGGCCATCATCCTAGCCCTGGACAACGCCATGCCTCCCAGCACAGCCACAGGCACCCTGTCCATTGAGATCTTAGAAGTCAACGACCATGCCCCTGCACTGGCCCCTCCTCTGTCT 1477
447 W Y R A I I L A L D N A M P P S T A T G T L S I E I L E V N D H A P A L A P P L S 487
EC5-->
1478 GGCAGCTTGTGCAGTGAACCGGACCAAGGCCCCGGTCTCCTCTTGGGTGCCACGGATGAGGACCTGCCCCCGCACGGGGCCCCCTTCCACTTCCAGCTGAACCCCAGGGTACCAGATCTCGGC 1600
488 G S L C S E P D Q G P G L L L G A T D E D L P P H G A P F H F Q L N P R V P D L G 528
1601 CGGAACTGGAGCCTCAGCCAGATTAACGTGAGCCATGCACGCTTGCGGCTCCGACACCAGGTCTCTGAGGGCCTGCATCGCCTGAGCCTGCTGCTCCAGGACTCTGGGGAGCCGCCCCAGCAG 1723
529 R N W S L S Q I N V S H A R L R L R H Q V S E G L H R L S L L L Q D S G E P P Q Q 569
1724 CGAGAGCAAACGCTGAACGTTACCGTGTGTCGCTGTGGGTTGGATGGCACCTGCCTGCCCGGGGCCGCTGCGCTGCAAGGAGGAGGTGTAGGCGTCGGCTTGGGCGCACTGGTCATTGTGCTG 1846
570 R E Q T L N V T V C R C G L D G T C L P G A A A L Q G G G V G V G L G A L V I V L 610
1847 GCCAGCACCGTGGTCCTACTGGTTCTCATCCTGCTTGCTGCGCTCCGCACACGGTTCCGGGGGCAGTCTCGGAGCAAGAGTCTGTTGCATGGGCTTCAGGAGGATCTTCGGGACAACATTCTT 1969
611 A S T V V L L V L I L L A A L R T R F R G Q S R S K S L L H G L Q E D L R D N I L 651
1970 AACTACGATGAACAAGGAGGCGGGGAGGAGGACCAGGATGCCTACGACATAAACCAGCTGCGCCACCCAGTGGAACCGAAGGCCACCAGCCGCTCTTTGGGCCGGCCACCCCTGCGCAGGGAT 2092
652 N Y D E Q G G G E E D Q D A Y D I N Q L R H P V E P K A T S R S L G R P P L R R D 692
2093 GCACCCTTCAGCTATGTGCCACAGCCACATCGAGTACTTCCTACCAGCCCGTCTGACATCGCCAACTTCATCAGTGACGGCTTGGAGGCTGCGGACAGCGACCCCAGCGTGCCTCCCTATGAC 2215
693 A P F S Y V P Q P H R V L P T S P S D I A N F I S D G L E A A D S D P S V P P Y D 733
2216 ACAGCTCTCATCTATGACTACGAGGGAGATGGCTCTGTGGCAGGGACCCTGAGCTCCATCCTGTCCAGCCAGGGAGATGAAGACCAGGACTATGACTATCTCCGGGACTGGGGCCCCCGCTTT 2338
734 T A L I Y D Y E G D G S V A G T L S S I L S S Q G D E D Q D Y D Y L R D W G P R F 774
2339 GCTCGGCTGGCGGACATGTATGGGCATCCGTGAGAGCCAGAGCCAGGGGCAGACGTCCTGTGTGGACACGCCCACTCGGCCCAAACAAGGAGGCTCTCTCCTGGGACATGCACCCAGAAATCC 2461
775 A R L A D M Y G H P * 784
2462 TATGAGGGTCAGCAGCACGACCCATCTTTGGCTCCATGGCAGATAAACTCACTGAAGGTCATCTGTGTGAGCTCCAGGGGAGGACTGAGTCCTGTATGGGCTAGGCAGCGGAGGGAGAGCGCT 2584
2585 CTCCCTCTGGAGTGCAGAAGCCACCTTCAATCACCCTGCTAGGGTTCATCCCATCTTTGTGTCCCAGTTGTGACTCTCACCTCTGTATGAAAGCAGGCATCTAAGGAGCAGATTGGAATTAAA 2707
2708 AACAACTGTTCAGTGAAAAAAAAAAAAAA 2736
|