Molecular Vision 2016; 22:589-598
<http://www.molvis.org/molvis/v22/589>
Received 24 September 2015 |
Accepted 06 June 2016 |
Published 08 June 2016
Dan Li,1,2 Siying Wang,3 Hongfei Ye,2,4 Yating Tang,2,4 Xiaodi Qiu,2,4 Qi Fan,2,4 Xianfang Rong,2,4 Xin Liu,2,4 Yuhong Chen,2,4 Jin Yang,2,4 Yi Lu2,4
The first two authors contributed equally to this work.
1Research Center, Eye & ENT Hospital of Fudan University, Shanghai, China; 2Key Laboratory of myopia,Ministry of Health, Shanghai, China; 3Department of Ophthalmology, Shanghai Children’s Hospital, Shanghai Jiao Tong University; Shanghai,China; 4Department of Ophthalmology, Eye & ENT Hospital of Fudan University, Shanghai, China
Correspondence to: Yi Lu, Department of Ophthalmology, Eye & ENT Hospital of Fudan University, 83 Fenyang Road, Shanghai, China, 200031; Phone: 86-21-64377134; FAX: 86-21-64377151; email: luyieent@126.com
Purpose: This study aimed to investigate the genetic effects underlying non-familial sporadic congenital cataract (SCC).
Methods: We collected DNA samples from 74 patients with SCC and 20 patients with traumatic cataract (TC) in an age-matched group and performed genomic sequencing of 61 lens-related genes with target region capture and next-generation sequencing (NGS). The suspected SCC variants were validated with MassARRAY and Sanger sequencing. DNA samples from 103 healthy subjects were used as additional controls in the confirmation examination.
Results: By filtering against common variants in public databases and those associated with TC cases, we identified 23 SCC-specific variants in 17 genes from 19 patients, which were predicted to be functional. These mutations were further confirmed by examination of the 103 healthy controls. Among the mutated genes, CRYBB3 had the highest mutation frequency with mutations detected four times in four patients, followed by EPHA2, NHS, and WDR36, the mutation of which were detected two times in two patients. We observed that the four patients with CRYBB3 mutations had three different cataract phenotypes.
Conclusions: From this study, we concluded the clinical and genetic heterogeneity of SCC. This is the first study to report broad spectrum genotyping for patients with SCC.
Congenital cataract (CC) is defined as lens opacity that is present at birth or occurs within the first year after birth; this condition is also called infantile cataract. CC is one of the most common causes of childhood blindness (approximately 200,000 children worldwide are blind from cataracts) [1] and might result in permanent blindness due to the irreversible damage to the responsible brain zone during visual development. Previous studies indicated that about one-third of CC cases are hereditary [2] and, in most cases, represent an autosomal dominant trait, although autosomal recessive and X-linked cases are also observed [3,4]. Meanwhile, the majority of the unilateral cases and approximately half of the bilateral cases are not due to familial history [5]; these cases are called sporadic cases and mostly occur due to unknown causes. Little attention has been paid to these non-familial cases in terms of broad spectrum genetic identification.
With an extensive genetic heterogeneity, cataract has been linked to more than 20 genes, the majority of which are located in crystalline and connexin (gap junction) genes [6,7], but the pool is continuously expanding. The most acknowledged cataract genes encode crystallins (CRYAA-Gene ID 1409, OMIM 123580, CRYAB-Gene ID 1410, OMIM 123590, CRYBA1-Gene ID 1411, OMIM 123610, CRYBB1-Gene ID 1414, OMIM 600929, CRYBB2-gene ID 1415, OMIM 123620, CRYBB3-Gene ID 1417, OMIM 123630, CRYGC-Gene ID 1420, OMIM 123680, and CRYGD-gene ID 1421, OMIM 123690), membrane transport proteins (GJA1-Gene ID 2697, OMIM 121014, GJA3 -Gene ID 2700, OMIM 121015, GJA8-Gene ID 2703, OMIM 600897, and MIP-Gene ID 4284, OMIM 154050), a cytoskeletal protein (BFSP2-Gene ID 8419, OMIM 603212), and developmental regulators (PITX3-Gene ID 5309, OMIM 602669, MAF-Gene ID 4094, OMIM 177075, and HSF4-Gene ID 3299, OMIM 602438) [8]. In the present study, we examined 61 lens-related genes in 74 SCC cases using target capture and next-generation sequencing (NGS). In addition to the identified CC genes described, we included other lens disease-related genes: microphthalmia (SOX2-Gene ID 6657, OMIM 184429, CYP1B1-Gene ID 1545, OMIM 601771, RAX-Gene ID 30062, OMIM 601881, FOXE3-Gene ID 2301, OMIM 601094, VSX2-Gene ID 338917, OMIM 142993, and CRYBA4-Gene ID 1413, OMIM 123631) [9,10], aniridia (FOXC1-Gene ID 2296, OMIM 601090, PITX2-Gene ID 5308, OMIM 601542, and PITX3) [11-13], Marfan syndrome (FBN1-Gene ID 2200, OMIM 134797) [14], ectopia lentis (ADAMTSL4-Gene ID 54507, OMIM 610113) and ADAMTS17-Gene ID 170691, OMIM 607511) [15], Alport syndrome (COL4A5-Gene ID 1287, OMIM 303630) [16], glaucoma (CYP1B1-Gene ID 1545, OMIM 601771, MYOC-Gene ID 4653, OMIM 601652, OPTN-Gene ID 10133, OMIM 602432, and WDR36-Gene ID 134430, OMIM 609669) [17], Nance-Horan syndrome (NHS-Gene ID 4810, OMIM 300457) [18], and exfoliation syndrome (LOXL1-Gene ID 4016, OMIM 153456) [19]. The total gene list is provided in Appendix 1. The aim of this study was to investigate what proportion of the SCC cases had a genetic basis and the distribution of these tested genes with functional mutations. We showed that in 19 out of 74 patients with SCC, genetic mutations were detected in the lens-related genes, and CRYBB3 appeared to have the highest mutation frequency. Here, we provide the first report on the mutation distribution and frequency of known genes from SCC cases in a Han Chinese population.
Informed consent in written form to collect the blood sample and to publish case details was obtained from the parents of the patients. All procedures were reviewed and approved by the ethics committee of Eye & ENT Hospital of Fudan University. The study was adhered to ARVO statement on human subjects and the tenets of the Declaration of Helsinki. A total of 74 patients from the pediatric congenital cataract clinic were diagnosed with SCC with no familial history (i.e., lens opacity occurred at birth or within the first year after birth and no family member was diagnosed with congenital cataract within three generations; in addition, the lens of the parents were diagnosed as normal at the time of surgery) between September 2012 and April 2013. During the same period, 20 patients from the pediatric congenital cataract clinic were diagnosed with traumatic cataract (TC). Meticulous clinical examinations were performed by experienced specialists. All patients were Han Chinese from 11 provinces in China. To confirm the identified SCC variants, DNA samples from 103 healthy controls were obtained from Shanghai Children’s Hospital of Shanghai Jiao Tong University and used as additional controls.
A total of 1 to 2 ml peripheral blood was collected from patients and controls in EDTA blood collection tubes and stored in -80 °C prior to use. Genomic DNA was extracted from the whole blood cells using a genomic DNA extraction and purification kit (FlexiGene DNA kit #51206, QiagenChina, Shenzhen, China) according to the manufacturer’s protocol. The DNA concentration was measured using a NanoDrop 2000 (Thermo Fisher Scientific, Waltham, MA), and 100 ng DNA was used for molecular analysis.
Capture of the target DNA region was completed using the Agilent SureSelect DNA Design (Agilent Technologies, Santa Clara, CA). Each captured library was loaded on the HiSeq 2000 platform (Illumina, Santa Clara, CA); high-throughput sequencing for each captured library was performed independently to ensure that each sample met the desired average fold-coverage. The paired raw data were deposited in the NCBI Sequencing Read Archive under accession number SRP062799. The bioinformatics analysis began from the sequencing data (raw data) that had been generated from the Illumina pipeline. First, the adaptor sequence in the raw data was removed, and low-quality reads that had too many Ns or low base quality were discarded. This step produced clean data. Second, Burrows-Wheeler Aligner (BWA) was used to do the alignment. BWA gives results in BAM format files. Single nucleotide polymorphisms (SNPs) were investigated with SOAPsnp SAMtools or GATK. Then the software AnnoDB was used in-house to annotate the confident variant results. The SNPs with 5X or more coverage and insertions or deletions were annotated to genes using Ensembl release version 73. The minimum thresholds were set at 50X/45 mqV. Additional annotation was provided from Online NCBI dbSNP, as well as population frequencies from the 1000 Genomes Project (including the 1000 Genomes All, Asian, African, American, and European categories). Then we filtered the resulting SNPs in the following steps: 1) exclude database SNPs with minor allele frequency (MAF) equal to or greater than 0.01 in the 1000 Genome database, 2) exclude the variants that existed in the TC controls, 3) exclude the silent or synonymous variants without altering splicing sites in genes, and 4) exclude the missense variants with low effects in SIFT and Polyphen-2 predictions. We obtained 32 SCC-specific variants, 27 of which were novel (not deposited in dbSNP and not described in published reports). All of the novel SCC SNPs were examined in each patient and healthy controls with Sequenom MassARRAY genotyping [20]; Sanger sequencing was performed instead for the samples that failed in the MassARRAY. (The primers are listed in Appendix 2) The variants listed in the dbSNP were confirmed with the TaqMan SNP genotyping assay kit (Thermo Fisher Scientific, Carlsbad, CA; the product number of each SNP is listed in Appendix 2). SIFT [21] and PolyPhen [22] analysis were performed to predict whether the identified SNPs would affect the protein structure and function.
A total of 52.7% of the 74 patients with SCC were male and 47.3% were female; the average age was 4.6 years (range 0 to 11 years). A total of 43 patients were diagnosed with bilateral congenital cataract (58.1%), and 31 patients were diagnosed with unilateral congenital cataract (41.9%); this incidence was considerably higher than the rate observed in family hereditary cataract cases (only approximately 6% were unilateral cases in our database; data not shown). The 20 control subjects diagnosed with traumatic lens opacity included 13 boys and 7 girls, with an average age of 6.1 years (range from 3 to 11 years); all of the control subjects were diagnosed with unilateral cataract. Detailed clinical observations are presented in Table 1.
The average mean read depth for the target region (whole sequences from the 61 genes, including introns and exons) was 152.7X; moreover, 93.7% of the target region achieved 10X coverage. By filtering the variants following the four steps described in the Methods section, we identified 32 SCC-specific variants, all of which were heterozygous except one homozygous nonsense mutation in the NHS gene. Additional sequencing analysis of the 103 healthy controls confirmed that the 32 variants did not exist in the controls. In addition, we searched each variant in the ExAC East Asian samples database. None of the 32 SCC variants had an allele frequency of more than 0.1%: 16 were not available (NA), four were 0, and 12 were <0.1%. However, nine of the heterozygous variants were from six genes (GALK1, SIL1, ADAMTSL4, ADAMTS17, VSX2, and LIM2) responsible for an autosomal recessive pattern of cataract, which should not be considered causative mutation. However, we listed these variants separately (Appendix 3) because they still could be risk sites for CC as none existed in the TC and healthy controls. As a result, 23 predicted functional SCC mutations remained in the main list (Table 2), including 16 missense, five nonsense, one spicing site acceptor, and one splicing site donor mutations. These mutations were located in 17 genes related to lens disease; the mutation frequencies (the number of detected times) of these genes were analyzed and are presented in Figure 1A. CRYBB3 was the most mutated gene in the samples, followed by EPHA2, NHS, and WDR36. The six recessive genes were also analyzed, as shown in Figure 1B. GALK1 appeared to be the most frequently mutated gene (six mutations in six patients) among the recessive genes.
Compared to familial CC, the genetic effect is ambiguous in SCC cases. As displayed in Table 3, in the 19 patients with identified SCC variants, five were found to have mutations in more than one gene. These complexities suggested combined effects of multiple variants acting together. For example, patient 3, who presented with bilateral total CC, was found to have one missense mutation in EPHA2 and one splice site acceptor mutation in CRYGC; patient 51, who presented with bilateral nuclear CC, had missense mutations in CRYBB3 and BFSP2. Notably, with the only detected homozygous mutation, patient 59, who presented with bilateral nuclear CC, was found to have a homozygous mutation c.1693C>T, p.R565* in NHS located in chromosome X. Based on this result, we suspected this patient had Nance-Horan syndrome, a rare X-linked syndrome characterized by congenital cataract and dental anomalies [18]. Although we found dental anomalies in this patient (Appendix 4), we were not confident to attribute this feature to the syndrome because the patient was growing permanent teeth when examined. The patient was intellectually normal and did not appear with other obvious dysmorphic features. By comparing the patient’s DNA sequences with the parents’ DNA sequences at the corresponding sites, we found that the homozygous c.1693C>T, p.R565* in X-linked NHS was inherited from the mother (Figure 2). Both parents had normal transparent lenses. The mother’s cornea diameter was 11 mm and the ocular axis was 24.69 mm (D), 24.50 mm (S) on an ultrasound A-scan, both of which were within normal range. This case is an example indicating that genetic effects could be underestimated in sporadic cases when the patient presents a recessively inherited mutation (X-linked recessive in this case). Segregation analysis could not be performed on the other patients because the parents’ DNA samples were not available.
Six cataract subtypes (classified as total, punctate, nuclear, posterior polar, anterior polar, and cortical subtypes) associated with genes that have been mutated at least two times were listed in Table 4. The four patients with CRYBB3 mutations presented with three cataract subtypes, indicating the phenotype heterogeneity of SCC. Of note, patient 10 and patient 51 shared a mutation site, c.466G>A, p.G156R; both presented with bilateral nuclear cataract.
In the pediatric cataract clinic in our tertiary care hospital, we found that SCC cases account for 71% of total congenital cataract cases (data not shown), which presented with an unclear etiology without familial history. Genetic studies have revealed numerous mutations in diverse genes associated with congenital cataracts; these studies most frequently used whole exome sequencing followed by linkage analysis within family members [23-28] or screened only one specific gene in a larger group [29]. In a previous study conducted in an Indian population (100 congenital cataract cases), the exon sequencing of 14 genes (ten of which were crystalline genes) identified 18 variants, 14 of which were from crystalline genes [6]. In the present study, we extended the genetic analysis to 61 genes in 74 cases (the familial factor was excluded) using target capture and NGS. We found 32 SCC-specific SNPs in 29 patients, 31 of which were heterozygous. After nine variants located in genes responsible for recessive cataracts were removed, 23 mutations in 17 genes from 19 patients were predicted to be functional. Among the mutated genes, CRYBB3 appeared with the highest mutation frequency, followed by 16 genes that have been mutated at least once, suggesting a broad spectrum of genes could contribute to SCC (Figure 1A). The mutation pick-up rate in this study was 26% (19 out of 74), which was much lower than in a recent study of general CC cases in the United Kingdom (not only SCC) that was 75% (27 out of 35 patients) using a similar method [30]. This result suggested that a genetic factor has much less of an effect in sporadic cases. Notably, putative functional mutations in more than one gene were found in five patients, suggesting a combined genetic effect for SCC. However, we did not obtain the DNA samples from the parents of each patient (except patient 59), and thus, we could not confirm through segregation study whether these SNPs are disease-causing.
As the most mutated gene, CRYBB3 (encoding crystalline beta B3) was mutated four times in patients with SCC in the present results. The mutation in CRYBB3 has been reported in autosomal dominant [28] and autosomal recessive cataract [31]. The four mutations in CRYBB3 presented no significant association with any specific subtype of cataract phenotype (Table 4).
We propose that examining the whole genomic sequence of the selected genes rather than screening known SNPs represents an efficient method for discovering susceptible SNPs involved in SCC, because the genomic variants in most non-familial cases differ from previously described variants. The majority of the identified SCC-specific SNPs in the present results have not been reported. Compared to whole exome sequencing, this target region sequencing method is a better option for exploring the SNPs in sporadic cases because it is more economical and the results are easier to analyze (considering the sample size and the fact that these subjects are not related). Additionally, this method successfully identified cataract patient-specific mutations in three families when applied to genomic studies of familial cataracts (data not shown). However, novel genes cannot be revealed using this sequencing strategy. Taken together, our study may help enrich the knowledge of the etiology of congenital cataract, especially SCC.
Appendix 1. List of the 61 selected genes and related lens diseases.
Appendix 2. Primers used for Sanger sequencing and the product number of Taqman SNP genotyping.
Appendix 3. Summary of 9 SCC-specific mutations in recessive genes.
This work was supported by the National Natural Science Foundation of China, grant number 81200668 and 81270989. The samples used for the analyses described in this manuscript were obtained from the EENT Biobank; we would like to thank all the participants and the staffs for their valuable contribution to this research. We thank Dr. Jihong Wu for her precious advice. Dr. Jin Yang (jin_er76@hotmail.com) and Yi Lu (luyieent@126.com) are co-corresponding authors of this work.