Figure 2 of
Tomi, Mol Vis 2004;
10:537-543.
Figure 2. Nucleotide and deduced amino acid sequences of rat M-cadherin
The deduced amino acid sequence is shown below the nucleotide sequence. The stop codon is denoted by an asterisk. The signal peptide is in blue and the postulated furin cleavage site of the precursor polypeptides is indicated by a ^. The extracellular domain is divided into five cadherin extracellular subdomain repeats (EC1-EC5). The transmembrane domain is in green. In the cytoplasmic domain following the transmembrane domain, the membrane-proximal conserved domain (MPCD) is in purple and the catenin binding sequence (CBS) is in orange. The characteristic cadherin consensus sequences are shaded in red. The 4 cysteine residues in EC5 are in pink. The N-glycosylation sites are indicated in brown. The nucleotide sequence of rat M-cadherin is in GenBank with the accession number AB176538.
1 TCCCGCCGCTGCCCCCATGAGTTCTGCTCTGCTCTTCGCCCTCGGGCTGCTTGCCCAGAGCCTTGGCCTCTCCTGGGCAGTCCCTGAGCCTGAACCCAGCACCCTGTACCCCTGGCGCCGGGCA 124 1 M S S A L L F A L G L L A Q S L G L S W A V P E P E P S T L Y P W R R A 36 125 TCAGCCCCAGGCCGTGTGCGGAGAGCCTGGGTCATCCCACCCATCAGTGTGTCTGAGAACCACAAACGCCTCCCCTACCCACTTGTGCAGATCAAGTCTGACAAACAACAGCTAGGCAGTGTC 247 37 S A P G R V R R A W V I P P I S V S E N H K R L P Y P L V Q I K S D K Q Q L G S V 77 ^ EC1--> 248 ATCTACAGTATCCAGGGTCCCGGTGTGGACGAGGAGCCCCGAAACGTCTTCTCCATCGACAAGTTCACTGGGAGGGTGTACCTCAATGCCACTCTGGACCGTGAGAAGACGGACCGCTTCAGG 370 78 I Y S I Q G P G V D E E P R N V F S I D K F T G R V Y L N A T L D R E K T D R F R 118 371 CTGAGGGCCTTTGCCCTGGACTTGGGTGGCTCTACCCTGGAGGACCCCACGGACCTGGAGATCGTCGTGGTGGATCAAAATGACAACCGGCCAGTCTTCCTACAGGATGTGTTCAGAGGCCGC 493 119 L R A F A L D L G G S T L E D P T D L E I V V V D Q N D N R P V F L Q D V F R G R 159 EC2--> 494 ATCCTGGAGGGTGCCATCCCAGGCACCTTCGTAACCAGGGCTGAGGCCACAGATGCCGACGATCCGGAGACAGACAATGCGGCCCTCAGGTTCTCTATCCTGGAGCAGGGCAGCCCTGAGTTG 616 160 I L E G A I P G T F V T R A E A T D A D D P E T D N A A L R F S I L E Q G S P E L 200 617 TTCAGCATTGACGAGCACACTGGAGAGATCCGCACAGTGCAAGTGGGGCTGGACCGTGAGGTGGTGGCTGTGTACAACCTGACCTTGCAGGTGGCAGACATGTCCGGAGATGGCCTCACCGCC 739 201 F S I D E H T G E I R T V Q V G L D R E V V A V Y N L T L Q V A D M S G D G L T A 241 740 ACAGCCTCGGCCATCATCTCCGTAGATGACGTCAACGACAATGCCCCCGAGTTCACCAAGGATGAGTTCTTTATGGAGGCTGCAGAGGCTGTCAGTGGAGTGGACGTGGGACGGCTTGAGGTA 862 242 T A S A I I S V D D V N D N A P E F T K D E F F M E A A E A V S G V D V G R L E V 282 EC3--> 863 GAAGACAAGGATCTGCCCGGTTCCCCCAACTGGGTGGCCAGGTTCACCATCCTGGAAGGCGATCCTGACGGGCAGTTCAAGATCTATACGGACCCCAAGACCAATGAAGGTGTGCTGTCCGTG 985 283 E D K D L P G S P N W V A R F T I L E G D P D G Q F K I Y T D P K T N E G V L S V 323 984 GTCAAGCCCCTGGATTATGAGAGCCGTGAGCAGTATGAGCTCAGAGTGTCTGTACAGAACGAGGCCCCTCTGCAGACAGCTGCCCCCCGGGCCCAGCGGGGCCAGACCAGGGTCAGTGTGTGG 1108 324 V K P L D Y E S R E Q Y E L R V S V Q N E A P L Q T A A P R A Q R G Q T R V S V W 364 1109 GTTCAGGACACCAACGAGGCTCCAGTGTTTCCAGAGAACCCACTGAGGACGAGCATAGCTGAAGGAGCCCCGCCAGGCACCTCTGTGGCCACCTTCTCTGCCCGAGACCCTGACACGGAACAG 1231 365 V Q D T N E A P V F P E N P L R T S I A E G A P P G T S V A T F S A R D P D T E Q 405 EC4--> 1232 CTGCAGAGAATCAGCTACTCTAAGGACTACGACCCAGAAGACTGGCTGCAAGTGGACCGGGCCACAGGCAGGATTCAGACCCAGCGAGTGTTGAGCCCTGCCTCACCCTTTTTAAAGGACGGC 1354 406 L Q R I S Y S K D Y D P E D W L Q V D R A T G R I Q T Q R V L S P A S P F L K D G 446 1355 TGGTACAGGGCCATCATCCTAGCCCTGGACAACGCCATGCCTCCCAGCACAGCCACAGGCACCCTGTCCATTGAGATCTTAGAAGTCAACGACCATGCCCCTGCACTGGCCCCTCCTCTGTCT 1477 447 W Y R A I I L A L D N A M P P S T A T G T L S I E I L E V N D H A P A L A P P L S 487 EC5--> 1478 GGCAGCTTGTGCAGTGAACCGGACCAAGGCCCCGGTCTCCTCTTGGGTGCCACGGATGAGGACCTGCCCCCGCACGGGGCCCCCTTCCACTTCCAGCTGAACCCCAGGGTACCAGATCTCGGC 1600 488 G S L C S E P D Q G P G L L L G A T D E D L P P H G A P F H F Q L N P R V P D L G 528 1601 CGGAACTGGAGCCTCAGCCAGATTAACGTGAGCCATGCACGCTTGCGGCTCCGACACCAGGTCTCTGAGGGCCTGCATCGCCTGAGCCTGCTGCTCCAGGACTCTGGGGAGCCGCCCCAGCAG 1723 529 R N W S L S Q I N V S H A R L R L R H Q V S E G L H R L S L L L Q D S G E P P Q Q 569 1724 CGAGAGCAAACGCTGAACGTTACCGTGTGTCGCTGTGGGTTGGATGGCACCTGCCTGCCCGGGGCCGCTGCGCTGCAAGGAGGAGGTGTAGGCGTCGGCTTGGGCGCACTGGTCATTGTGCTG 1846 570 R E Q T L N V T V C R C G L D G T C L P G A A A L Q G G G V G V G L G A L V I V L 610 1847 GCCAGCACCGTGGTCCTACTGGTTCTCATCCTGCTTGCTGCGCTCCGCACACGGTTCCGGGGGCAGTCTCGGAGCAAGAGTCTGTTGCATGGGCTTCAGGAGGATCTTCGGGACAACATTCTT 1969 611 A S T V V L L V L I L L A A L R T R F R G Q S R S K S L L H G L Q E D L R D N I L 651 1970 AACTACGATGAACAAGGAGGCGGGGAGGAGGACCAGGATGCCTACGACATAAACCAGCTGCGCCACCCAGTGGAACCGAAGGCCACCAGCCGCTCTTTGGGCCGGCCACCCCTGCGCAGGGAT 2092 652 N Y D E Q G G G E E D Q D A Y D I N Q L R H P V E P K A T S R S L G R P P L R R D 692 2093 GCACCCTTCAGCTATGTGCCACAGCCACATCGAGTACTTCCTACCAGCCCGTCTGACATCGCCAACTTCATCAGTGACGGCTTGGAGGCTGCGGACAGCGACCCCAGCGTGCCTCCCTATGAC 2215 693 A P F S Y V P Q P H R V L P T S P S D I A N F I S D G L E A A D S D P S V P P Y D 733 2216 ACAGCTCTCATCTATGACTACGAGGGAGATGGCTCTGTGGCAGGGACCCTGAGCTCCATCCTGTCCAGCCAGGGAGATGAAGACCAGGACTATGACTATCTCCGGGACTGGGGCCCCCGCTTT 2338 734 T A L I Y D Y E G D G S V A G T L S S I L S S Q G D E D Q D Y D Y L R D W G P R F 774 2339 GCTCGGCTGGCGGACATGTATGGGCATCCGTGAGAGCCAGAGCCAGGGGCAGACGTCCTGTGTGGACACGCCCACTCGGCCCAAACAAGGAGGCTCTCTCCTGGGACATGCACCCAGAAATCC 2461 775 A R L A D M Y G H P * 784 2462 TATGAGGGTCAGCAGCACGACCCATCTTTGGCTCCATGGCAGATAAACTCACTGAAGGTCATCTGTGTGAGCTCCAGGGGAGGACTGAGTCCTGTATGGGCTAGGCAGCGGAGGGAGAGCGCT 2584 2585 CTCCCTCTGGAGTGCAGAAGCCACCTTCAATCACCCTGCTAGGGTTCATCCCATCTTTGTGTCCCAGTTGTGACTCTCACCTCTGTATGAAAGCAGGCATCTAAGGAGCAGATTGGAATTAAA 2707 2708 AACAACTGTTCAGTGAAAAAAAAAAAAAA 2736 |