Figure 4 of
Liang, Mol Vis 2004;
10:773-786.
Figure 4. Translation product of RxHMG1
A: The RxHMG1 open reading frame plus 3' UTR is illustrated. Exon-intron boundaries are shown as uppercase letters in red within the nucleic acid sequence. The 3' UTR is 1.4 kb and contains a poly-A addition site 20 bp from the poly-A tail, which begins immediately after the last nucleotide of this sequence. B: Comparison of the HMG1 box of RxHMG1 with other HMG boxes. Identities are indicated by black shading; similarities are indicated by gray shading. C: The HMG box and proline rich region within the RxHMG1 protein. The letters shown in the consensus sequence are defined in the table below.
A:
1 atggacgtccgcctgtatccctcggcgcccgcggtaggtgcgcggcccggggccgagccggccggcctggcacacctggactactaccactgcggcaagtttgatggtgacagtgcctac
M D V R L Y P S A P A V G A R P G A E P A G L A H L D Y Y H C G K F D G D S A Y
121 gtggggatgagtgacggaaatccagagctcctgtcaaccagccaGAcctataacagccagggcgagagtaacgaagactatgagattcctcctataacgcctcccaatctccctgagcca
V G M S D G N P E L L S T S Q T Y N S Q G E S N E D Y E I P P I T P P N L P E P
241 tccctcctgcacctgggggatcacgaagccggctaccactcgctgtgtcacggccttgcgcccaacggtctgctccctgcctactcgtaccaagcaatggatctccctgccatcatggtg
S L L H L G D H E A G Y H S L C H G L A P N G L L P A Y S Y Q A M D L P A I M V
361 tccaacatgctggcccaggacggccatctgctgtcaggccagctgcccacGAtccaggaaatggtccactcggaggtggcggcatatgactcaggccggccagggcccctgctgggccgc
S N M L A Q D G H L L S G Q L P T I Q E M V H S E V A A Y D S G R P G P L L G R
481 ccggcgatgctggccagccacatgagtgccctcagccagtctcagctcatctcccagatgggtatccggagtggcattgctcacggctccccatcacctccagggagcaagtcagcgacc
P A M L A S H M S A L S Q S Q L I S Q M G I R S G I A H G S P S P P G S K S A T
601 ccctctccatccagttccacacaggaggaggaatcagatgcccatttcaaGAtctcgggagagaagagaccctcagtggacccaggcaaaaaggccaagaatccaaagaagaagaagaag
P S P S S S T Q E E E S D A H F K I S G E K R P S V D P G K K A K N P K K K K K
721 aaggaccccaatgagccacagaagccagtgtcggcctacgctctcttcttcagagacactcaggctgccatcaaggggcagaatcccagtgccacctttggagatgtgtccaaaatagtg
K D P N E P Q K P V S A Y A L F F R D T Q A A I K G Q N P S A T F G D V S K I V
841 gcgtccatgtgggacagcctgggagaagagcagaaacaggcgtataagaggaagactgaagctgccaagaaggagtacctgaaagccttggcggcctacagagctagcctcgtgtccaaG
A S M W D S L G E E Q K Q A Y K R K T E A A K K E Y L K A L A A Y R A S L V S K
961 Agccccccggaccaaggtgaggccaagaacactcaggcaaacccaccagccaaaatgcttccacccaagcagcccatgtacgccatgcccggcctggcttccttcctgacgccctccgac
S P P D Q G E A K N T Q A N P P A K M L P P K Q P M Y A M P G L A S F L T P S D
1081 ctgcaggccttccgcagtggagcctctcccgccagccttgccaggacgctgggctccaaggccctgctgccgggcctcagcacatcgccgccaccaccctccttccctctcagcccctca
L Q A F R S G A S P A S L A R T L G S K A L L P G L S T S P P P P S F P L S P S
1201 ctgcaccagcagctgccactgcccccccacgcgcagggcactctcctcagcccgcctctcagcatgtccccagccccgcagcctcctgtcctgcctgcctccatggcactccaggtgcag
L H Q Q L P L P P H A Q G T L L S P P L S M S P A P Q P P V L P A S M A L Q V Q
1321 ctggcgatgagcccctcacctccagggccacAGgacttcccacacatctctgatttctccagtggctctggctcccgctcacctggcccatccaacccttccagcagcggagactgggat
L A M S P S P P G P Q D F P H I S D F S S G S G S R S P G P S N P S S S G D W D
1441 gggagttaccccagtggggagcgtggcctcggcacctgcaGActctgcagaggcagcccaccgcccaccaccagcccaaagaacctgcaggaaccttctgcccgctgacctgcttgctcc
G S Y P S G E R G L G T C R L C R G S P P P T T S P K N L Q E P S A R *
1561 agggtagctgtggaccccgctccttggcctgcacacagtcccctgcgtctggacttctggccccagccccaactcaccgggctgcccccttcgaagttgcttagcaacagacacccaccc
1681 ctgatgctgggccagccacaggtgtgctctcagtgtacacaaagatgctgaaactcgttctgtgggttctgtgtaagtagttcactgttttagaactgtgctgaagacatctgtaagatt
1801 attttgtggggagaaagaaagtttcctttaaggttaaaaaaatttttataagacctttggcacatttttttttaagttttatcttaagggagacatgtgcacaagcaactgtcaaggtga
1921 ttctaatctgcacacagagaaaatgggaacttttaagccacacccaggggcattcttcttcctcttcttcctcctcctcctcttcttcttccccctcttcctcttctttttcctcttctt
2041 cctcttcttcatcctcttcttcctcttcctcttcttcatcttttcctcttcttcctcttcttccccttcttctttctctttttccttttcctcttcttcctattcctcttcttccccttc
2161 ttcctcttcttcttcctcttcttccccttcttcttcctgttccccttctttctcttcttccttttcctgttcttcctcttcttccccttctttctctttttacttttcctcttcttccta
2281 ttcctcttcttcctcttcttcctcttcttccccttcttcttcctcttcttccccttctttctctttttccttttcctcttcttcctattcctcttcttcctcttcttcatcttcctcttc
2401 ccctttcttcctcttcctctcctttccttcttcttctgcctcttcgtcctcttctttctcttttttttgtttctgttttactcatatatcccccctatttaaaattgccggcaatgattt
2521 ttcttctggttatctatttatgaagaaaaactgagaacagcattgtggtttctcctaacgtgtgtgtggtcggggtttgggtttggttcttgtcgttcgcagctgtctcctggcccctgc
poly-A
------
2641 aatgtctgtcctggtgccccagtgcttcgctcagacctctttgtaataaaactgctgaaaagtggcaaac 2720
|
B:

C:
Consensus XXXnXrXgXpXPKRjXShdffdfskXRXsfXXEXPXasXXpaXXXfGrXWsp HMGH_IRV6 IGVKPKDVTNVPKRNKSSYLFFCQEIRPSIVAEMPDIKPNQVMVHLGKKWSE HMG1_MOUSE TKKKFKD-PNAPKRPPSAFFLFCSEYRPKIKGEHPGLSIGDVAKKLGEMWNN ABF2_YEAST TQLRNELIKQGPKRPTSAYFLYLQDHRSQFVKENPTLRPAEISKIAGEKWQN RxHMG1 KKKKKKD-PNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSKIVASMWDS Consensus XXfkXnrXYXXXXXrXXsrYXXXXXXXrXXXXPXXXXXXfXXfXrsXnXr HMGH_IRV6 LPLEDRKKYDVMAVEDRKRYLASKEANKKLNKPVKISGYLQFCADERKIK HMG1_MOUSE TAADDKQPYEKKAAKLKEKYEKDIAAYRAKGKPDAAKKGVVKAEKSKKKK ABF2_YEAST LEADIKEKYISERKKLYSEYQKAKKEFDEKLPPKKPAGPFIKYANEVRSQ RxHMG1 LGEEQKQAYKRKTEAAKKEYLKALAAYRASLVSKSP--PDQGEAKNTQAN |
"Meta" residue Description Amino acids represented ------- -------------------- ------------------------- a I, L, V d aromatic F, W, Y f hydrophobic A, I, L, V, M, F, W, Y, C h small A, G, S j high turn propensity G, N, P k negative charge D, E n full positive charge K, R p D, E, N, Q q positive charge H, K, R r polar D, E, N, Q, H, K, R s hydrophilic D, E, N, Q, H, K, R, S, T X any residue all g gap none |