Figure 2 of
Lin, Mol Vis 3:17, 1997.
Figure 2. The amino acid sequences of the IRBP and variants.
REPEAT 1 REGION: The top sequence shows the amino acid sequence of
wild type human IRBP as deduced from cDNA and genomic nucleotide
sequences. Standard single letter codes are used to represent amino
acids. The X represents an amino acid that could not be identified. The
deduced sequence includes a signal peptide typical of secretory proteins
at positions -22 to -6. This is followed by a propeptide, GPTHL, of five
amino acids, from -5 to -1. In human IRBP derived from cadaver eyes
about 50% of the protein begins with the glycine (G, position -5) of the
propeptide (68). The remainder begins with phenylalanine (F, position +1
). When wild type IRBP is first synthesized and secreted it appears that
the protein is the full length form including the propeptide as shown
here and starts at position -5. All the deletion variants begin at the
same point, suggesting that they are similarly processed and secreted.
The exception is R1, which still contains its signal peptide. How it
avoids cotranslational processing that normally removes the signal
peptide is unknown, but the mass of protein being produced may simply
overwhelm the insect cell's signal peptidase. The E. coli expressed
Repeat 1 protein, EcR1, yielded an amino acid sequence matching that
expected from the DNA sequence of the clone, including a seven amino
acid extension (underlined) before the beginning of Repeat 1. REPEAT 2
REGION: The E. coli expressed protein, EcR2, contains a short N-terminal
extension of 17 amino acids (underlined) derived from the vector,
followed by the IRBP repeat 2 sequence. The sequence shown here matches
the sequence deduced from the DNA construct exactly. For the Repeat 3
and 4 regions, EcR3 and EcR4 have amino acid sequences identical to the
expected seven amino acids encoded by the vector (underlined) followed
exactly by the deduced sequences from each repeat. This figure shows
that all the proteins encode IRBP by their identity with the deduced
amino acid sequences from the cDNA and gene, and by similarity with
amino acid sequences of IRBP from other species (68, 69).
REPEAT 1 REGION:
-22 -5 +1
| | |
MMREWVLLMSVLLCGLAGPTHLFQPSV... SEQUENCE DEDUCED FROM THE GENE AND cDNA
MREXVLLM SEQUENCE OF R1
GPTH SEQUENCE OF WT
GPTXLFQP SEQUENCE OF R12-
GPTHLFQP SEQUENCE OF R12+ UPPER BAND
GPTHLFQPSV... SEQUENCE OF R12+ MIDDLE BAND
GPTHLFQPSV... SEQUENCE OF R12+ LOWER BAND
GPTHL.... SEQUENCE OF R123 (Predominant)
FQPSV... SEQUENCE OF R123 (Minor)
SEQUENCE OF G719S
SEQUENCE OF R725C
MVPSSDPGPTHLFQ SEQUENCE OF EcR1
|
REPEAT 2 REGION:
RSALPGV... SEQUENCE DEDUCED FROM THE GENE AND cDNA
MVPSSDPLVTAASVLEFRSALPGV SEQUENCE OF EcR2
|
REPEAT 3 REGION:
QSL... SEQUENCE DEDUCED FROM THE GENE and cDNA
MVPSSDPQSL SEQUENCE OF EcR3
|
REPEAT 4 REGION:
AKVPT... SEQUENCE DEDUCED FROM THE GENE AND cDNA
MVPSSDPAKVPT SEQUENCE OF EcR4
|
Lin, Mol Vis 1997; 3:17
<http://www.emory.edu/molvis/v3/lin>
©1997 Molecular Vision
ISSN 1090-0535