Figure 1 of Stenkamp, Mol Vis 2005; 11:833-845.


Figure 1. Translated amino acid sequence of chicken IRBP

The protein consists of four modules, each about 300 amino acid residues in length. The corresponding nucleotide sequence repeats are colored red, blue, dark purple, and turquoise. ATG and TAG start and termination codons are orange. The signal secretion sequence is blocked in green. Conserved tryptophan (W) residues with putative ligand-binding function [13] are purple. Glycosylation consensus sequences (N-X-S/T) are indicated in brown. An asterisk (*) indicates the position of the additional 82 amino acids in the jungle fowl IRBP protein. The nucleic acid sequence for chicken IRBP is available in GenBank (AY994153).

 -30 CAATTAACCCTCACTAAAGGGAGTCGACTCGATGAGAACATATTTCTTTCTGTTTTCTGT
                                     M  R  T  Y  F  F  L  F  S  V
  30 TCTGATCGTGTGCAGCATATCTGCTGAGGAAATCTTTCAGCCAACCCTTGTGCTGGACAT
      L  I  V  C  S  I  S  A  E  E  I  F  Q  P  T  L  V  L  D  M
  90 GGCTAAAGTACTACTGGATAACTACTGCTATCCTGAAAACCTAGTGGGAATGCAAGAAGC
      A  K  V  L  L  D  N  Y  C  Y  P  E  N  L  V  G  M  Q  E  A
 150 CATTGAGCAAGCCATCAAGAGTGGGGAGATCCTGGACATTTCTGATCCAAAAATGCTGGC
      I  E  Q  A  I  K  S  G  E  I  L  D  I  S  D  P  K  M  L  A
 210 CAATGTCTTGACAGCTGGAGTGCAGGGTGCTCTGAATGATCCAAGACTGGTAATCTCCTA
      N  V  L  T  A  G  V  Q  G  A  L  N  D  P  R  L  V  I  S  Y
 270 TGAGCCATCACTTCATGCAGCTCCTAAACAAGAAGCTGAAACTTACCCTACTCGGGAACA
      E  P  S  L  H  A  A  P  K  Q  E  A  E  T  Y  P  T  R  E  Q
 330 ACTCCTGAGTCTTATTGAACATGTAGTCATATATGACAAGCTGGAGGGAAATGTTGGCTA
      L  L  S  L  I  E  H  V  V  I  Y  D  K  L  E  G  N  V  G  Y
 390 CCTGAGGATTGATTATATCATAGGACAGGAAGTTGTGGAGAAAGTTGGAGCTTTTCTGGT
      L  R  I  D  Y  I  I  G  Q  E  V  V  E  K  V  G  A  F  L  V
 450 GGACAAGGTATGGAAGACATTGATAAATACATCTGCCTTGGTGATAGATCTTCGTTACAG
      D  K  V  W  K  T  L  I  N  T  S  A  L  V  I  D  L  R  Y  S
 510 CACTGGAGGACAGATCTCTGGAATTCCCTTCATTATCTCATATTTGCACGAAGCAGACAA
      T  G  G  Q  I  S  G  I  P  F  I  I  S  Y  L  H  E  A  D  K
 570 GATGCTACATGTTGAAACTGTATACAATCGGCCTTCCAACACCACTACAGAGATATGGAC
      M  L  H  V  E  T  V  Y  N  R  P  S  N  T  T  T  E  I  W  T
 630 ATTGCCAAAAGTGTTGGGAGAGAGATACAGCAAAGACAAAGATGTCATTGTTCTGATCAG
      L  P  K  V  L  G  E  R  Y  S  K  D  K  D  V  I  V  L  I  S
 690 TCATCACACCACAGGAGTGGCTGAAGATGTGGCTTACATCCTGAAGCATATGAACAGAGC
      H  H  T  T  G  V  A  E  D  V  A  Y  I  L  K  H  M  N  R  A
 750 TATCACTCTTGGGGAGAAGACAGCTGGGGGCTCGCTGGACATCCAGAAGCTACGTATTGG
      I  T  L  G  E  K  T  A  G  G  S  L  D  I  Q  K  L  R  I  G
 810 TCCCTCTAATTTCTATATGATGGTTCCTGTGTCGCGATCTGTCAGCCCCTTGAGTGGGGG
      P  S  N  F  Y  M  M  V  P  V  S  R  S  V  S  P  L  S  G  G
 870 TGGACAGAGCTGGGAGGTGAGTGGGGTGATGCCATGTGTGGCTTCTGAGGCAGAGCAAGC
      G  Q  S  W  E  V  S  G  V  M  P  C  V  A  S  E  A  E  Q  A
 930 CTTGAAGAAATCCTTGGACATCCTGGCAGTGCGCAGAGCAGTGCCTGGCACCCTAAGTCG
      L  K  K  S  L  D  I  L  A  V  R  R  A  V  P  G  T  L  S  R
 990 CCTCACAGACATACTAAAGGACTATTACAGCTTAGTGGAGCGAGTGCCAGTGCTGCTGAG
      L  T  D  I  L  K  D  Y  Y  S  L  V  E  R  V  P  V  L  L  R
1050 GCACCTCACTACCTCTGACTTCTCTTCTGTGCAGTCAGCGGAGGACCTGGCCACCAAGCT
      H  L  T  T  S  D  F  S  S  V  Q  S  A  E  D  L  A  T  K  L
1110 CAACACTGAGATGCAGACCTTGTCTGAGGACCCTCGCCTGTTGGTCCGCACCATGATGCC
      N  T  E  M  Q  T  L  S  E  D  P  R  L  L  V  R  T  M  M  P
1170 TGGTGAAGCTGCTGCCCCTCCTGCTGAGATGCCAATTGCAATGGCAGCCAATTTGCCTGA
      G  E  A  A  A  P  P  A  E  M  P  I  A  M  A  A  N  L  P  D
1230 TAATGAGCAATTGCTGCATGCCTTGGTGGATACTGTCTTCAAGGTGTCAGTGTTGCCAGG
      N  E  Q  L  L  H  A  L  V  D  T  V  F  K  V  S  V  L  P  G
1290 CAATGTGGGCTACATGCGCTTTGATGAGTTTGCTGATGCCTCTGTTCTTGTTAAGCTGGG
      N  V  G  Y  M  R  F  D  E  F  A  D  A  S  V  L  V  K  L  G
1350 ACCTTACATTGTAAAAAAAGTCTGGGAGCCCCTACAAAATACAGAGAACCTGATCATGGA
      P  Y  I  V  K  K  V  W  E  P  L  Q  N  T  E  N  L  I  M  D
1410 CCTACGTTACAACCCTGGTGGCCCTTCCTCCTCTGCTGTGCCTATGTTGATCTCTTACTT
      L  R  Y  N  P  G  G  P  S  S  S  A  V  P  M  L  I  S  Y  F
1470 CCAAGATCCTACTGCTGGCCCTGTCCATCTCTTCACAACCTACGACAGGCGTACCAACCA
      Q  D  P  T  A  G  P  V  H  L  F  T  T  Y  D  R  R  T  N  H
1530 TACACAAGAGCATAACAGCCAGGCAGAACTGCTGGCCCAGCCCTACGGAGCTCAGCGTGG
      T  Q  E  H  N  S  Q  A  E  L  L  A  Q  P  Y  G  A  Q  R  G
1590 CATCTATGTACTCACCAGCCGCCACACTGCTACAGCTGCTGAGGAGTTTGCTTACCTCAT
      I  Y  V  L  T  S  R  H  T  A  T  A  A  E  E  F  A  Y  L  M
1650 GCAATCACTTGGCCGTGCCACGCTGATTGGTGAGATCACAGCAGGTAGCCTCTCACATAC
      Q  S  L  G  R  A  T  L  I  G  E  I  T  A  G  S  L  S  H  T
1710 CTGTACCTTCCCTCTCGTGCAGCCTGAGCAAGGAATAACTCGTGGCCTAACCATCACGGT
      C  T  F  P  L  V  Q  P  E  Q  G  I  T  R  G  L  T  I  T  V
1770 CCCAGTTATCACCTTCATTGACAACCATGGGGAAAGCTGGATGGGTGGAGGTGTTGTGCC
      P  V  I  T  F  I  D  N  H  G  E  S  W  M  G  G  G  V  V  P
1830 TGATGCCATAGTGTTGGCAGAGGATGCACTGGAGAAGGCGGAAGAGGTGCTAACTTTTCA
      D  A  I  V  L  A  E  D  A  L  E  K  A  E  E  V  L  T  F  H
1890 CAGGAAAATGGGGATACTTTTGGAGAGTACTGGGCAGCTTCTAGAGGCTCACTATGCCAT
      R  K  M  G  I  L  L  E  S  T  G  Q  L  L  E  A  H  Y  A  I
1950 CCCAGAAGTGGCTGAAAAGGCCAGTGTTATGCTCAGCACCAAACGAGTTCAAGGAGGTTA
      P  E  V  A  E  K  A  S  V  M  L  S  T  K  R  V  Q  G  G  Y
2010 TCGATCAGCTGTAGACTTTGAGACGTTGGCTTCCCAGCTTACCAGTGACTTGCAGGAGGC
      R  S  A  V  D  F  E  T  L  A  S  Q  L  T  S  D  L  Q  E  A
2070 ATCAGGGGATCATCGGCTTCATGTTTTCCACAGCCATGTGGAACCAACACCAGAAGAACA
      S  G  D  H  R  L  H  V  F  H  S  H  V  E  P  T  P  E  E  Q
2130 GCTTCCCAACATGATTCCCAGCCCTGAGGAACTCAGCTATATCATTGAGGCACTCTTCAA
      L  P  N  M  I  P  S  P  E  E  L  S  Y  I  I  E  A  L  F  K
2190 AATCGAGGTATTGCCAGGCAACCTGGGATACCTTCGCTTTGATATGATGGCTGAGGCAGA
      I  E  V  L  P  G  N  L  G  Y  L  R  F  D  M  M  A  E  A  E
2250 AACTGTAAAAGCAATTGGACCTCAGCTGGTGCAGATGGTCTGGAACAAGCTGGTTGACAC
      T  V  K  A  I  G  P  Q  L  V  Q  M  V  W  N  K  L  V  D  T
2310 AGATGCCATGATTATTGACATGAGATATAATACAGGTGGCTACTCTACTGCTGTCCCAAT
      D  A  M  I  I  D  M  R  Y  N  T  G  G  Y  S  T  A  V  P  I
2370 ACTTTGTTCCTATTTCTTTGAGCCTGAACCTCGTCAACACCTCTACACTGTCTTTGATCG
      L  C  S  Y  F  F  E  P  E  P  R  Q  H  L  Y  T  V  F  D  R
2430 TAGTACCTCCCGCAGCACAGAGGTGTGGACTCTCCCCAAGGTTACTGGCAAGAGATATGG
      S  T  S  R  S  T  E  V  W  T  L  P  K  V  T  G  K  R  Y  G
2490 CTCCCTCAAGGACATCTACATCCTAACAAGCCATATGAGTGGCTCAGCAGCTGAAGCTTT
      S  L  K  D  I  Y  I  L  T  S  H  M  S  G  S  A  A  E  A  F
2550 CACTCGCTCTATGAAGGATCTACACCGTGCCACAGTTATTGGTGAGCCCACAGTAGGTGG
      T  R  S  M  K  D  L  H  R  A  T  V  I  G  E  P  T  V  G  G
2610 TTCCCTCTCAGTGGGTATATACCGAGTTGGCAACAGCTCCTTATATCGTTCCATCCCTAG
      S  L  S  V  G  I  Y  R  V  G  N  S  S  L  Y  R  S  I  P  S
2670 CCAAGTGGTGCTCAGCCCAGTCACTGGCAAAGTATGGAGTGTGTCTGGAGCAGAGCCACA
      Q  V  V  L  S  P  V  T  G  K  V  W  S  V  S  G  A  E  P  H
2730 TATCACCATCCAAGCCAGCGAAGCCTTGGCTGCAGCTAAGCACATTGCCAGCCTGCGTAC
      I  T  I  Q  A  S  E  A  L  A  A  A  K  H  I  A  S  L  R  T
2790 CCAGGTGCCACAGATAGTGCAAACTGTAGGTAAGCTTGTGGCAGAAAATTATGCTTTTGT
      Q  V  P  Q  I  V  Q  T  V  G  K  L  V  A  E  N  Y  A  F  V
2850 AGACATTGGGACTGATATTGCATCCAACCTCACCAAGAGTGTCAACAAAGAAAATTACAA
      D  I  G  T  D  I  A  S  N  L  T  K  S  V  N  K  E  N  Y  K
2910 AAGGATTAATTCAGAAAAGGAGCTGGCCAGGAAGTTGACTGCAATCTTGCAAGCTCTTTC
      R  I  N  S  E  K  E  L  A  R  K  L  T  A  I  L  Q  A  L  S
2970 TGATGATGAACACTTGAAAATACTCTACATCCCTGAACATGCCAAAGACAGCATTCCAGG
      D  D  E  H  L  K  I  L  Y  I  P  E  H  A  K  D  S  I  P  G
3030 GATTTTGCCAAAACAGATCCCTTCCCCAGAAGTTTTTGAAGATCTGATTAAATTTTCATT
      I  L  P  K  Q  I  P  S  P  E  V  F  E  D  L  I  K  F  S  F
3090 CCACACAAACGTATTTGAAAACAACATCGGCTATCTGAGATTTGATATGTTTGGAGACTG
      H  T  N  V  F  E  N  N  I  G  Y  L  R  F  D  M  F  G  D  C
3150 TGAACTTCTAACCCAGGTGTCTGATCTACTGGTAGAGCATGTTTGGAAGAAAATTGTTCA
      E  L  L  T  Q  V  S  D  L  L  V  E  H  V  W  K  K  I  V  H
3210 CACAGATGCATTAATCATAGACATGAGGTACAATATTGGAGGTTACACCAATTCCATACC
      T  D  A  L  I  I  D  M  R  Y  N  I *G  G  Y  T  N  S  I  P
3270 AATCTTATGCTCATATTTCTTCGATGAAGGACATCAAGTTCTACTGGACAAAGTTTATGA
      I  L  C  S  Y  F  F  D  E  G  H  Q  V  L  L  D  K  V  Y  D
3330 CAGACCCAGTGACTCAGTAAAGGAAATATGGACCCAGCCACAACTCAGAGGGGAGAGGTA
      R  P  S  D  S  V  K  E  I  W  T  Q  P  Q  L  R  G  E  R  Y
3390 TGGCTCCCAGAAAGGACTGATAATCCTTACCAGTGCTGTGACGGCTGGGGCTGCTGAGGA
      G  S  Q  K  G  L  I  I  L  T  S  A  V  T  A  G  A  A  E  E
3450 GTTTGTCTTCATAATGAAGAGGTTGGGCAGAGCTCTGATCATTGGAGAACAGACCAGTGG
      F  V  F  I  M  K  R  L  G  R  A  L  I  I  G  E  Q  T  S  G
3510 TGGGTCCCATTCCCCACAGACATACCAAGTAGATGATACCAACTTCTACATCATCATCCC
      G  S  H  S  P  Q  T  Y  Q  V  D  D  T  N  F  Y  I  I  I  P
3570 CACTGCACGATCAGTCATCTCTGCAGAGAGCGCTTCTTGGGAAGGGAAAGGGGTGCCCCC
      T  A  R  S  V  I  S  A  E  S  A  S  W  E  G  K  G  V  P  P
3630 TCACATGGAAACACCAGCGGTAACAGCCCTCATCAAAGCAAAGGAGGTGCTCAGTGCTCA
      H  M  E  T  P  A  V  T  A  L  I  K  A  K  E  V  L  S  A  H
3690 TCTGCACAGCTCAAGATAGCCCAACAGGGACATGTGCTTC
      L  H  S  S  R

Stenkamp, Mol Vis 2005; 11:833-845 <http://www.molvis.org/molvis/v11/a99/>
©2005 Molecular Vision <http://www.molvis.org/molvis/>
ISSN 1090-0535