Figure 5 of
Vazquez-Chona, Mol Vis 2005;
Figure 5. Evaluating candidate genes within Chromosome 12 locus
A: The transcript variability of Id2, Crem, Egr1, Fos, Nr4a1, Junb, Ptpn16, and Icam1 in brains from BXD RI mouse strains is genetically linked to the regulatory locus on Chromosome 12. This network was generated by comparing genome-wide scans that measure the linkage of transcript variability across the mouse genome, as described in Figure 1. These transcripts are also upregulated as acute phase transcripts in retina, brain and spinal cord (Table 2). Transcripts genetically linked to the same regulatory loci and highly regulated in injured CNS are hypothesized to be part of a genetic network. B: The combined genome-wide map shows that their transcript expression has a strong genetic linkage to a locus within Chromosome 12, 10 to 30 Mb. The genome wide scan, zoomed at Chromosome 12, shows the genetic linkage (y-axis, likelihood ratio statistic [LRS]) across the chromosome (x-axis, Megabase pairs [Mb]). This locus includes over 60 positional genes, some of which have a high or low single nucleotide polymorphism (SNP) density. SNP density is denoted by the height of the orange lines on the x-axis. Several rules may help in selecting candidate gene responsible for this regulatory locus. The first rule is there have to be SNPs present within the coding or regulatory region of the gene of interest. As a first approximation we used SNP density that is displayed at the bottom of panel B and D. The second rule is a high degree of transcript abundance variability in the BXD RI strains (illustrated in C). C: Transcript abundance variability in normal forebrains of BXD RI mouse strains is due to genetic polymorphisms between the parental C57BL/6 and DBA/2J mouse strains. The graph illustrates transcript abundance variability (y-axis) for genes (dots) within the 10 to 30 Mb interval of Chromosome 12 (x-axis). For a transcript to be mapped its abundance must vary across the BXD RI mouse strains. The higher the variation in transcript abundance the more likely that the gene is a candidate. Furthermore we expect that the variability is due to a polymorphism in the candidate gene, suggesting that the polymorphic gene is modulating its own expression level (that is, a "cis-regulatory locus"). We measured transcript variability using an analysis of variance (ANOVA) that tests the between-strain variance compared with the total variance for 100 arrays from 35 mouse strains. The degrees of freedom for the between-group and total variance are 34 and 99. Strain-specific variation is significant (p<0.05) when F34,99>1.5. In the graph, each dot represents a gene with its variability measured by the ANOVA F-statistic (y-axis) and its genomic location within Chromosome 12 described in Mb. Lpin1 and Id2 are polymorphic genes that displayed significant transcript variability and cis-regulatory loci in normal forebrains of BXD mouse strains. D: The third criterion identifies the genes that display self regulation across CNS tissues. For example, the transcript variability of Id2 (red circle in C) displayed a cis-regulatory locus in the brain (LRS=44, probe set 93013_at) and cerebellum (LRS=14; 1453596_at_A) of BXD RI mouse strains. Here we illustrate linkage of Id2 transcript variability in BXD RI mouse strains across Chromosome 12. Red triangle indicates gene location. Lpin1 also displayed cis-regulatory loci in brain and cerebellum (data available at GeneNetwork). E: The structure of the Id2 gene illustrates a SNP at the promoter region and four SNPs within the second intron. F: The SNP within the promoter region (Ensembl SNPView ID rs4229289 and Celera SNP ID mC22302957) is located within a highly conserved region and is adjacent to a nuclear transcription factor Y (NF-Y) binding site (TRANSFAC ID M00185). Gene structure was obtained from Ensembl and GenomeBrowser. The TRANSFAC 5.0 database was accessed through the MOTIF website. Highly conserved regions were defined using the GenomeBrowser Conservation tool (Mouse May 2004 Assembly).
DBA2 GGTTCTTCCGCGTTCGAGCCAATACC B6 GGTTCTTCCGCGTTCGAGCCAATGCC Rn. GGTTCTTCCGCGTTCGAGCCAATGCC Hs. GGTTCTTCCGCGTTCGAGCCAATCCC NF-Y binding site GCCAATC GCCAATG