Figure 6 of Wistow, Mol Vis 2002; 8:196-204.


Figure 6. Novel exon in Nrl

A: The sequence of part of the gene for Nrl (GenBank accession number U95012) showing the location and sequence of the novel exon. Exons 2 and 3 contain the coding sequence for canonical Nrl (shown in red). A partial alternative insert exon (Ins) is shown in blue, with the in frame ORF shown. Three potential in frame upstream splice junction "ag" dinucleotides are shown in green and the first upstream in frame stop codon (taa) is shown in purple. The first upstream splice junction gives the ORF shown. The upper case ATG in blue shows a start codon in frame with the novel exon sequence. B: The partial predicted amino acid sequence of Nrlins is compared with the sequence of human c-Maf (GenBank accession number AAC27039), showing the glycine rich regions in both sequences. The predicted Nrl insert sequence is shown in red.

A

gcctccatgtgctccagacctctcctcctctttgcaggtgcactcctcccagcccagctc 2160
     M  A  L  P  P  S  P  L  A  M  E  Y  V  N  D  F  D  L  M
cagaATGGCCCTGCCCCCCAGCCCCCTGGCCATGGAATATGTCAATGACTTTGACTTGAT
  K  F  E  V  K  R  E  P  S  E  G  R  P  G  P  P  T  A  S  L
GAAGTTTGAGGTAAAGCGGGAACCCTCTGAGGGCCGACCTGGCCCCCCTACAGCCTCACT
  G  S  T  P  Y  S  S  V  P  P  S  P  T  F  S  E  P  G  M  V
GGGCTCCACACCTTACAGCTCAGTGCCTCCTTCACCCACCTTCAGTGAACCAGGCATGGT Exon 2
  G  A  T  E  G  T  R  P  G  L  E  E  L  Y  W  L  A  T  L  Q
GGGGGCAACCGAGGGCACCCGGCCAGGCCTGGAGGAGCTGTACTGGCTGGCTACCCTGCA
  Q  Q  L  G  A  G  E  A  L  G  L  S  P  E  E  A  M  E  L  L
GCAGCAGCTGGGGGCTGGGGAGGCATTGGGGCTGAGTCCTGAAGAGGCCATGGAGCTGCT
  Q  G  Q  G  P  V  P  V  D  G  P  H  G  Y  Y  P  G  S  P  E
GCAGGGTCAGGGCCCAGTCCCTGTTGATGGGCCCCATGGCTACTACCCAGGGAGCCCAGA
  E  T  G  A  Q  H  V  Q
GGAGACAGGAGCCCAGCACGTCCAGgtgagtggtcagcaagctggcctgaggggaggcag 2580
ggcaaggaaggaggactgcccaagagaggaaggggagctcccagagggggttatggactg
ggacaggggacagagggcaggagaaggggagaaggtcccttgaaagcaatcagatcgaga
aaactacattctgcttctcccccttttcttaaaatggagagaaatgagtaggctgaacca
ggaggagcagggagaagataagattagcagagaatccaaagggaagaattgagctgggga
gtgggcgactccggggggtaaacagatacatggtgtgtggaaaccagggaaagggttatg
tgtggagtggagctgggttaagactggtttagattggccttttcaagacttcgtgcttcc
cagcccccaaccttctcaggaaggattgggactcatcctattaattacagacacagATGg
gggtgttgcgggcattaggatgtaatccaaatcctgtgagggtcaccgggttgcaggctt
         G  N  A  A  R  W  R  V  G  G  C  L  G  A  G  P  L
caggacagggAAACGCAGCGCGTTGGAGGGTTGGGGGCTGTCTGGGTGCGGGTCCACTGG
D  T  P  G  P  G  A  G  C  R  G  S  Q  R  R  A  R  G  L  G
ACACACCCGGGCCTGGAGCTGGATGCCGGGGATCCCAGAGACGAGCCCGGGGTTTAGGTG Ins
A  R  R  A  R  L  T  V  A  G  P  A  P  W  G  A  R  L  T  G
CGCGACGGGCTCGCCTGACCGTGGCCGGCCCTGCACCGTGGGGCGCCCGCCTGACTGGAG
A  T
CAACGgtcagctggggggcccggggagcgtcggggcctggggcgggctctggaccgaaac 3300
agactgcgtggaagggcgagccttccggtgaaggtgggagccggggcggggctgtcccgg
ggcggagccaggtagcgtcgggccctcagggcagagccgggtgcgacctggcgctgaccc

                         L  A  E  R  F  S  D  A  A  L  V  S
ggtttctgcattctccctccgcagCTGGCAGAGCGGTTTTCCGACGCGGCGCTGGTCTCG
 M  S  V  R  E  L  N  R  Q  L  R  G  C  G  R  D  E  A  L  R
ATGTCTGTGCGGGAGCTAAACCGGCAGCTGCGGGGCTGCGGGCGCGACGAGGCGCTGCGG
 L  K  Q  R  R  R  T  L  K  N  R  G  Y  A  Q  A  C  R  S  K
CTGAAGCAGAGGCGCCGCACGCTGAAGAACCGCGGCTACGCGCAGGCCTGTCGCTCCAAG Exon 3
 R  L  Q  Q  R  R  G  L  E  A  E  R  A  R  L  A  A  Q  L  D
CGGCTGCAGCAGCGGCGCGGGCTGGAGGCCGAGCGCGCCCGCCTGGCCGCCCAGCTGGAC
  A  L  R  A  E  V  A  R  L  A  R  E  R  D  L  Y  K  A  R  C
GCGCTGCGGGCCGAGGTGGCCCGCCTGGCCCGGGAGCGCGATCTCTACAAGGCTCGCTGT
 D  R  L  T  S  S  G  P  G  S  G  D  P  S  H  L  F  L  *
GACCGGCTAACCTCGAGCGGCCCCGGGTCCGGGGACCCCTCCCACCTCTTCCTCTGAgcc 3780

B

NRLins   8 GGCLGAGPLDTPGPGAGCRGSQRR-ARGLGARRARLTVAGPAPWGARLTGATLAERFSDA 66
           GG  G GP    G G G  G     A G G        AG         G    +RFSD
Cmaf   214 GGAGGGGPASAGGGGGGGGGGGGGGAAGAGGALHPHHAAG---------GLHFDDRFSDE 264

NRLins  67 ALVSMSVRELNRQLRGCGRDEALRLKQRRRTLKNRGYAQACRSKRLQQRRGLEAERARLA 126
            LV+MSVRELNRQLRG  ++E +RLKQ+RRTLKNRGYAQ+CR KR+QQR  LE+E+ +L
Cmaf   265 QLVTMSVRELNRQLRGVSKEEVIRLKQKRRTLKNRGYAQSCRFKRVQQRHVLESEKNQLL 324

NRLins 127 AQLDALRAEVARLARERDLYKARCDRLTSSG 157
            Q+D L+ E++RL RERD YK + ++L SSG
Cmaf   325 QQVDHLKQEISRLVRERDAYKEKYEKLVSSG 355

Wistow, Mol Vis 2002; 8:196-204 <http://www.molvis.org/molvis/v8/a26/>
©2002 Molecular Vision <http://www.molvis.org/molvis/>
ISSN 1090-0535