Figure 6 of
Wistow, Mol Vis 2002;
8:196-204.
Figure 6. Novel exon in Nrl
A: The sequence of part of the gene for Nrl (GenBank accession number U95012) showing the location and sequence of the novel exon. Exons 2 and 3 contain the coding sequence for canonical Nrl (shown in red). A partial alternative insert exon (Ins) is shown in blue, with the in frame ORF shown. Three potential in frame upstream splice junction "ag" dinucleotides are shown in green and the first upstream in frame stop codon (taa) is shown in purple. The first upstream splice junction gives the ORF shown. The upper case ATG in blue shows a start codon in frame with the novel exon sequence. B: The partial predicted amino acid sequence of Nrlins is compared with the sequence of human c-Maf (GenBank accession number AAC27039), showing the glycine rich regions in both sequences. The predicted Nrl insert sequence is shown in red.
A
gcctccatgtgctccagacctctcctcctctttgcaggtgcactcctcccagcccagctc 2160 M A L P P S P L A M E Y V N D F D L M cagaATGGCCCTGCCCCCCAGCCCCCTGGCCATGGAATATGTCAATGACTTTGACTTGAT K F E V K R E P S E G R P G P P T A S L GAAGTTTGAGGTAAAGCGGGAACCCTCTGAGGGCCGACCTGGCCCCCCTACAGCCTCACT G S T P Y S S V P P S P T F S E P G M V GGGCTCCACACCTTACAGCTCAGTGCCTCCTTCACCCACCTTCAGTGAACCAGGCATGGT Exon 2 G A T E G T R P G L E E L Y W L A T L Q GGGGGCAACCGAGGGCACCCGGCCAGGCCTGGAGGAGCTGTACTGGCTGGCTACCCTGCA Q Q L G A G E A L G L S P E E A M E L L GCAGCAGCTGGGGGCTGGGGAGGCATTGGGGCTGAGTCCTGAAGAGGCCATGGAGCTGCT Q G Q G P V P V D G P H G Y Y P G S P E GCAGGGTCAGGGCCCAGTCCCTGTTGATGGGCCCCATGGCTACTACCCAGGGAGCCCAGA E T G A Q H V Q GGAGACAGGAGCCCAGCACGTCCAGgtgagtggtcagcaagctggcctgaggggaggcag 2580 ggcaaggaaggaggactgcccaagagaggaaggggagctcccagagggggttatggactg ggacaggggacagagggcaggagaaggggagaaggtcccttgaaagcaatcagatcgaga aaactacattctgcttctcccccttttcttaaaatggagagaaatgagtaggctgaacca ggaggagcagggagaagataagattagcagagaatccaaagggaagaattgagctgggga gtgggcgactccggggggtaaacagatacatggtgtgtggaaaccagggaaagggttatg tgtggagtggagctgggttaagactggtttagattggccttttcaagacttcgtgcttcc cagcccccaaccttctcaggaaggattgggactcatcctattaattacagacacagATGg gggtgttgcgggcattaggatgtaatccaaatcctgtgagggtcaccgggttgcaggctt G N A A R W R V G G C L G A G P L caggacagggAAACGCAGCGCGTTGGAGGGTTGGGGGCTGTCTGGGTGCGGGTCCACTGG D T P G P G A G C R G S Q R R A R G L G ACACACCCGGGCCTGGAGCTGGATGCCGGGGATCCCAGAGACGAGCCCGGGGTTTAGGTG Ins A R R A R L T V A G P A P W G A R L T G CGCGACGGGCTCGCCTGACCGTGGCCGGCCCTGCACCGTGGGGCGCCCGCCTGACTGGAG A T CAACGgtcagctggggggcccggggagcgtcggggcctggggcgggctctggaccgaaac 3300 agactgcgtggaagggcgagccttccggtgaaggtgggagccggggcggggctgtcccgg ggcggagccaggtagcgtcgggccctcagggcagagccgggtgcgacctggcgctgaccc L A E R F S D A A L V S ggtttctgcattctccctccgcagCTGGCAGAGCGGTTTTCCGACGCGGCGCTGGTCTCG M S V R E L N R Q L R G C G R D E A L R ATGTCTGTGCGGGAGCTAAACCGGCAGCTGCGGGGCTGCGGGCGCGACGAGGCGCTGCGG L K Q R R R T L K N R G Y A Q A C R S K CTGAAGCAGAGGCGCCGCACGCTGAAGAACCGCGGCTACGCGCAGGCCTGTCGCTCCAAG Exon 3 R L Q Q R R G L E A E R A R L A A Q L D CGGCTGCAGCAGCGGCGCGGGCTGGAGGCCGAGCGCGCCCGCCTGGCCGCCCAGCTGGAC A L R A E V A R L A R E R D L Y K A R C GCGCTGCGGGCCGAGGTGGCCCGCCTGGCCCGGGAGCGCGATCTCTACAAGGCTCGCTGT D R L T S S G P G S G D P S H L F L * GACCGGCTAACCTCGAGCGGCCCCGGGTCCGGGGACCCCTCCCACCTCTTCCTCTGAgcc 3780 |
B
NRLins 8 GGCLGAGPLDTPGPGAGCRGSQRR-ARGLGARRARLTVAGPAPWGARLTGATLAERFSDA 66 GG G GP G G G G A G G AG G +RFSD Cmaf 214 GGAGGGGPASAGGGGGGGGGGGGGGAAGAGGALHPHHAAG---------GLHFDDRFSDE 264 NRLins 67 ALVSMSVRELNRQLRGCGRDEALRLKQRRRTLKNRGYAQACRSKRLQQRRGLEAERARLA 126 LV+MSVRELNRQLRG ++E +RLKQ+RRTLKNRGYAQ+CR KR+QQR LE+E+ +L Cmaf 265 QLVTMSVRELNRQLRGVSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHVLESEKNQLL 324 NRLins 127 AQLDALRAEVARLARERDLYKARCDRLTSSG 157 Q+D L+ E++RL RERD YK + ++L SSG Cmaf 325 QQVDHLKQEISRLVRERDAYKEKYEKLVSSG 355 |