Figure 4 of
Liang, Mol Vis 2004;
10:773-786.
Figure 4. Translation product of RxHMG1
A: The RxHMG1 open reading frame plus 3' UTR is illustrated. Exon-intron boundaries are shown as uppercase letters in red within the nucleic acid sequence. The 3' UTR is 1.4 kb and contains a poly-A addition site 20 bp from the poly-A tail, which begins immediately after the last nucleotide of this sequence. B: Comparison of the HMG1 box of RxHMG1 with other HMG boxes. Identities are indicated by black shading; similarities are indicated by gray shading. C: The HMG box and proline rich region within the RxHMG1 protein. The letters shown in the consensus sequence are defined in the table below.
A:
1 atggacgtccgcctgtatccctcggcgcccgcggtaggtgcgcggcccggggccgagccggccggcctggcacacctggactactaccactgcggcaagtttgatggtgacagtgcctac M D V R L Y P S A P A V G A R P G A E P A G L A H L D Y Y H C G K F D G D S A Y 121 gtggggatgagtgacggaaatccagagctcctgtcaaccagccaGAcctataacagccagggcgagagtaacgaagactatgagattcctcctataacgcctcccaatctccctgagcca V G M S D G N P E L L S T S Q T Y N S Q G E S N E D Y E I P P I T P P N L P E P 241 tccctcctgcacctgggggatcacgaagccggctaccactcgctgtgtcacggccttgcgcccaacggtctgctccctgcctactcgtaccaagcaatggatctccctgccatcatggtg S L L H L G D H E A G Y H S L C H G L A P N G L L P A Y S Y Q A M D L P A I M V 361 tccaacatgctggcccaggacggccatctgctgtcaggccagctgcccacGAtccaggaaatggtccactcggaggtggcggcatatgactcaggccggccagggcccctgctgggccgc S N M L A Q D G H L L S G Q L P T I Q E M V H S E V A A Y D S G R P G P L L G R 481 ccggcgatgctggccagccacatgagtgccctcagccagtctcagctcatctcccagatgggtatccggagtggcattgctcacggctccccatcacctccagggagcaagtcagcgacc P A M L A S H M S A L S Q S Q L I S Q M G I R S G I A H G S P S P P G S K S A T 601 ccctctccatccagttccacacaggaggaggaatcagatgcccatttcaaGAtctcgggagagaagagaccctcagtggacccaggcaaaaaggccaagaatccaaagaagaagaagaag P S P S S S T Q E E E S D A H F K I S G E K R P S V D P G K K A K N P K K K K K 721 aaggaccccaatgagccacagaagccagtgtcggcctacgctctcttcttcagagacactcaggctgccatcaaggggcagaatcccagtgccacctttggagatgtgtccaaaatagtg K D P N E P Q K P V S A Y A L F F R D T Q A A I K G Q N P S A T F G D V S K I V 841 gcgtccatgtgggacagcctgggagaagagcagaaacaggcgtataagaggaagactgaagctgccaagaaggagtacctgaaagccttggcggcctacagagctagcctcgtgtccaaG A S M W D S L G E E Q K Q A Y K R K T E A A K K E Y L K A L A A Y R A S L V S K 961 Agccccccggaccaaggtgaggccaagaacactcaggcaaacccaccagccaaaatgcttccacccaagcagcccatgtacgccatgcccggcctggcttccttcctgacgccctccgac S P P D Q G E A K N T Q A N P P A K M L P P K Q P M Y A M P G L A S F L T P S D 1081 ctgcaggccttccgcagtggagcctctcccgccagccttgccaggacgctgggctccaaggccctgctgccgggcctcagcacatcgccgccaccaccctccttccctctcagcccctca L Q A F R S G A S P A S L A R T L G S K A L L P G L S T S P P P P S F P L S P S 1201 ctgcaccagcagctgccactgcccccccacgcgcagggcactctcctcagcccgcctctcagcatgtccccagccccgcagcctcctgtcctgcctgcctccatggcactccaggtgcag L H Q Q L P L P P H A Q G T L L S P P L S M S P A P Q P P V L P A S M A L Q V Q 1321 ctggcgatgagcccctcacctccagggccacAGgacttcccacacatctctgatttctccagtggctctggctcccgctcacctggcccatccaacccttccagcagcggagactgggat L A M S P S P P G P Q D F P H I S D F S S G S G S R S P G P S N P S S S G D W D 1441 gggagttaccccagtggggagcgtggcctcggcacctgcaGActctgcagaggcagcccaccgcccaccaccagcccaaagaacctgcaggaaccttctgcccgctgacctgcttgctcc G S Y P S G E R G L G T C R L C R G S P P P T T S P K N L Q E P S A R * 1561 agggtagctgtggaccccgctccttggcctgcacacagtcccctgcgtctggacttctggccccagccccaactcaccgggctgcccccttcgaagttgcttagcaacagacacccaccc 1681 ctgatgctgggccagccacaggtgtgctctcagtgtacacaaagatgctgaaactcgttctgtgggttctgtgtaagtagttcactgttttagaactgtgctgaagacatctgtaagatt 1801 attttgtggggagaaagaaagtttcctttaaggttaaaaaaatttttataagacctttggcacatttttttttaagttttatcttaagggagacatgtgcacaagcaactgtcaaggtga 1921 ttctaatctgcacacagagaaaatgggaacttttaagccacacccaggggcattcttcttcctcttcttcctcctcctcctcttcttcttccccctcttcctcttctttttcctcttctt 2041 cctcttcttcatcctcttcttcctcttcctcttcttcatcttttcctcttcttcctcttcttccccttcttctttctctttttccttttcctcttcttcctattcctcttcttccccttc 2161 ttcctcttcttcttcctcttcttccccttcttcttcctgttccccttctttctcttcttccttttcctgttcttcctcttcttccccttctttctctttttacttttcctcttcttccta 2281 ttcctcttcttcctcttcttcctcttcttccccttcttcttcctcttcttccccttctttctctttttccttttcctcttcttcctattcctcttcttcctcttcttcatcttcctcttc 2401 ccctttcttcctcttcctctcctttccttcttcttctgcctcttcgtcctcttctttctcttttttttgtttctgttttactcatatatcccccctatttaaaattgccggcaatgattt 2521 ttcttctggttatctatttatgaagaaaaactgagaacagcattgtggtttctcctaacgtgtgtgtggtcggggtttgggtttggttcttgtcgttcgcagctgtctcctggcccctgc poly-A ------ 2641 aatgtctgtcctggtgccccagtgcttcgctcagacctctttgtaataaaactgctgaaaagtggcaaac 2720 |
B:
C:
Consensus XXXnXrXgXpXPKRjXShdffdfskXRXsfXXEXPXasXXpaXXXfGrXWsp HMGH_IRV6 IGVKPKDVTNVPKRNKSSYLFFCQEIRPSIVAEMPDIKPNQVMVHLGKKWSE HMG1_MOUSE TKKKFKD-PNAPKRPPSAFFLFCSEYRPKIKGEHPGLSIGDVAKKLGEMWNN ABF2_YEAST TQLRNELIKQGPKRPTSAYFLYLQDHRSQFVKENPTLRPAEISKIAGEKWQN RxHMG1 KKKKKKD-PNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSKIVASMWDS Consensus XXfkXnrXYXXXXXrXXsrYXXXXXXXrXXXXPXXXXXXfXXfXrsXnXr HMGH_IRV6 LPLEDRKKYDVMAVEDRKRYLASKEANKKLNKPVKISGYLQFCADERKIK HMG1_MOUSE TAADDKQPYEKKAAKLKEKYEKDIAAYRAKGKPDAAKKGVVKAEKSKKKK ABF2_YEAST LEADIKEKYISERKKLYSEYQKAKKEFDEKLPPKKPAGPFIKYANEVRSQ RxHMG1 LGEEQKQAYKRKTEAAKKEYLKALAAYRASLVSKSP--PDQGEAKNTQAN |
"Meta" residue Description Amino acids represented ------- -------------------- ------------------------- a I, L, V d aromatic F, W, Y f hydrophobic A, I, L, V, M, F, W, Y, C h small A, G, S j high turn propensity G, N, P k negative charge D, E n full positive charge K, R p D, E, N, Q q positive charge H, K, R r polar D, E, N, Q, H, K, R s hydrophilic D, E, N, Q, H, K, R, S, T X any residue all g gap none |