Figure 4 of Liang, Mol Vis 2004; 10:773-786.


Figure 4. Translation product of RxHMG1

A: The RxHMG1 open reading frame plus 3' UTR is illustrated. Exon-intron boundaries are shown as uppercase letters in red within the nucleic acid sequence. The 3' UTR is 1.4 kb and contains a poly-A addition site 20 bp from the poly-A tail, which begins immediately after the last nucleotide of this sequence. B: Comparison of the HMG1 box of RxHMG1 with other HMG boxes. Identities are indicated by black shading; similarities are indicated by gray shading. C: The HMG box and proline rich region within the RxHMG1 protein. The letters shown in the consensus sequence are defined in the table below.

A:

   1 atggacgtccgcctgtatccctcggcgcccgcggtaggtgcgcggcccggggccgagccggccggcctggcacacctggactactaccactgcggcaagtttgatggtgacagtgcctac
     M  D  V  R  L  Y  P  S  A  P  A  V  G  A  R  P  G  A  E  P  A  G  L  A  H  L  D  Y  Y  H  C  G  K  F  D  G  D  S  A  Y
 121 gtggggatgagtgacggaaatccagagctcctgtcaaccagccaGAcctataacagccagggcgagagtaacgaagactatgagattcctcctataacgcctcccaatctccctgagcca
     V  G  M  S  D  G  N  P  E  L  L  S  T  S  Q  T  Y  N  S  Q  G  E  S  N  E  D  Y  E  I  P  P  I  T  P  P  N  L  P  E  P
 241 tccctcctgcacctgggggatcacgaagccggctaccactcgctgtgtcacggccttgcgcccaacggtctgctccctgcctactcgtaccaagcaatggatctccctgccatcatggtg
     S  L  L  H  L  G  D  H  E  A  G  Y  H  S  L  C  H  G  L  A  P  N  G  L  L  P  A  Y  S  Y  Q  A  M  D  L  P  A  I  M  V
 361 tccaacatgctggcccaggacggccatctgctgtcaggccagctgcccacGAtccaggaaatggtccactcggaggtggcggcatatgactcaggccggccagggcccctgctgggccgc
     S  N  M  L  A  Q  D  G  H  L  L  S  G  Q  L  P  T  I  Q  E  M  V  H  S  E  V  A  A  Y  D  S  G  R  P  G  P  L  L  G  R
 481 ccggcgatgctggccagccacatgagtgccctcagccagtctcagctcatctcccagatgggtatccggagtggcattgctcacggctccccatcacctccagggagcaagtcagcgacc
     P  A  M  L  A  S  H  M  S  A  L  S  Q  S  Q  L  I  S  Q  M  G  I  R  S  G  I  A  H  G  S  P  S  P  P  G  S  K  S  A  T
 601 ccctctccatccagttccacacaggaggaggaatcagatgcccatttcaaGAtctcgggagagaagagaccctcagtggacccaggcaaaaaggccaagaatccaaagaagaagaagaag
     P  S  P  S  S  S  T  Q  E  E  E  S  D  A  H  F  K  I  S  G  E  K  R  P  S  V  D  P  G  K  K  A  K  N  P  K  K  K  K  K
 721 aaggaccccaatgagccacagaagccagtgtcggcctacgctctcttcttcagagacactcaggctgccatcaaggggcagaatcccagtgccacctttggagatgtgtccaaaatagtg
     K  D  P  N  E  P  Q  K  P  V  S  A  Y  A  L  F  F  R  D  T  Q  A  A  I  K  G  Q  N  P  S  A  T  F  G  D  V  S  K  I  V
 841 gcgtccatgtgggacagcctgggagaagagcagaaacaggcgtataagaggaagactgaagctgccaagaaggagtacctgaaagccttggcggcctacagagctagcctcgtgtccaaG
     A  S  M  W  D  S  L  G  E  E  Q  K  Q  A  Y  K  R  K  T  E  A  A  K  K  E  Y  L  K  A  L  A  A  Y  R  A  S  L  V  S  K
 961 Agccccccggaccaaggtgaggccaagaacactcaggcaaacccaccagccaaaatgcttccacccaagcagcccatgtacgccatgcccggcctggcttccttcctgacgccctccgac
     S  P  P  D  Q  G  E  A  K  N  T  Q  A  N  P  P  A  K  M  L  P  P  K  Q  P  M  Y  A  M  P  G  L  A  S  F  L  T  P  S  D
1081 ctgcaggccttccgcagtggagcctctcccgccagccttgccaggacgctgggctccaaggccctgctgccgggcctcagcacatcgccgccaccaccctccttccctctcagcccctca
     L  Q  A  F  R  S  G  A  S  P  A  S  L  A  R  T  L  G  S  K  A  L  L  P  G  L  S  T  S  P  P  P  P  S  F  P  L  S  P  S
1201 ctgcaccagcagctgccactgcccccccacgcgcagggcactctcctcagcccgcctctcagcatgtccccagccccgcagcctcctgtcctgcctgcctccatggcactccaggtgcag
     L  H  Q  Q  L  P  L  P  P  H  A  Q  G  T  L  L  S  P  P  L  S  M  S  P  A  P  Q  P  P  V  L  P  A  S  M  A  L  Q  V  Q
1321 ctggcgatgagcccctcacctccagggccacAGgacttcccacacatctctgatttctccagtggctctggctcccgctcacctggcccatccaacccttccagcagcggagactgggat
     L  A  M  S  P  S  P  P  G  P  Q  D  F  P  H  I  S  D  F  S  S  G  S  G  S  R  S  P  G  P  S  N  P  S  S  S  G  D  W  D
1441 gggagttaccccagtggggagcgtggcctcggcacctgcaGActctgcagaggcagcccaccgcccaccaccagcccaaagaacctgcaggaaccttctgcccgctgacctgcttgctcc
     G  S  Y  P  S  G  E  R  G  L  G  T  C  R  L  C  R  G  S  P  P  P  T  T  S  P  K  N  L  Q  E  P  S  A  R  *
1561 agggtagctgtggaccccgctccttggcctgcacacagtcccctgcgtctggacttctggccccagccccaactcaccgggctgcccccttcgaagttgcttagcaacagacacccaccc
1681 ctgatgctgggccagccacaggtgtgctctcagtgtacacaaagatgctgaaactcgttctgtgggttctgtgtaagtagttcactgttttagaactgtgctgaagacatctgtaagatt
1801 attttgtggggagaaagaaagtttcctttaaggttaaaaaaatttttataagacctttggcacatttttttttaagttttatcttaagggagacatgtgcacaagcaactgtcaaggtga
1921 ttctaatctgcacacagagaaaatgggaacttttaagccacacccaggggcattcttcttcctcttcttcctcctcctcctcttcttcttccccctcttcctcttctttttcctcttctt
2041 cctcttcttcatcctcttcttcctcttcctcttcttcatcttttcctcttcttcctcttcttccccttcttctttctctttttccttttcctcttcttcctattcctcttcttccccttc
2161 ttcctcttcttcttcctcttcttccccttcttcttcctgttccccttctttctcttcttccttttcctgttcttcctcttcttccccttctttctctttttacttttcctcttcttccta
2281 ttcctcttcttcctcttcttcctcttcttccccttcttcttcctcttcttccccttctttctctttttccttttcctcttcttcctattcctcttcttcctcttcttcatcttcctcttc
2401 ccctttcttcctcttcctctcctttccttcttcttctgcctcttcgtcctcttctttctcttttttttgtttctgttttactcatatatcccccctatttaaaattgccggcaatgattt
2521 ttcttctggttatctatttatgaagaaaaactgagaacagcattgtggtttctcctaacgtgtgtgtggtcggggtttgggtttggttcttgtcgttcgcagctgtctcctggcccctgc

                                                 poly-A
                                                 ------
2641 aatgtctgtcctggtgccccagtgcttcgctcagacctctttgtaataaaactgctgaaaagtggcaaac 2720

 

B:

(7 K)

 

C:

Consensus     XXXnXrXgXpXPKRjXShdffdfskXRXsfXXEXPXasXXpaXXXfGrXWsp
HMGH_IRV6     IGVKPKDVTNVPKRNKSSYLFFCQEIRPSIVAEMPDIKPNQVMVHLGKKWSE
HMG1_MOUSE    TKKKFKD-PNAPKRPPSAFFLFCSEYRPKIKGEHPGLSIGDVAKKLGEMWNN
ABF2_YEAST    TQLRNELIKQGPKRPTSAYFLYLQDHRSQFVKENPTLRPAEISKIAGEKWQN
RxHMG1        KKKKKKD-PNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSKIVASMWDS

Consensus     XXfkXnrXYXXXXXrXXsrYXXXXXXXrXXXXPXXXXXXfXXfXrsXnXr
HMGH_IRV6     LPLEDRKKYDVMAVEDRKRYLASKEANKKLNKPVKISGYLQFCADERKIK
HMG1_MOUSE    TAADDKQPYEKKAAKLKEKYEKDIAAYRAKGKPDAAKKGVVKAEKSKKKK
ABF2_YEAST    LEADIKEKYISERKKLYSEYQKAKKEFDEKLPPKKPAGPFIKYANEVRSQ
RxHMG1        LGEEQKQAYKRKTEAAKKEYLKALAAYRASLVSKSP--PDQGEAKNTQAN

 

"Meta"
residue       Description         Amino acids represented
-------   --------------------   -------------------------
   a                             I, L, V
   d      aromatic               F, W, Y
   f      hydrophobic            A, I, L, V, M, F, W, Y, C
   h      small                  A, G, S
   j      high turn propensity   G, N, P
   k      negative charge        D, E
   n      full positive charge   K, R
   p                             D, E, N, Q
   q      positive charge        H, K, R
   r      polar                  D, E, N, Q, H, K, R
   s      hydrophilic            D, E, N, Q, H, K, R, S, T
   X      any residue            all
   g      gap                    none

Liang, Mol Vis 2004; 10:773-786 <http://www.molvis.org/molvis/v10/a92/>
©2004 Molecular Vision <http://www.molvis.org/molvis/>
ISSN 1090-0535