Comprehensive analysis of candidate genes for photosensitivity using a complementary bioinformatic and experimental approach
Address correspondence to Sarah von Spiczak, MD, Department of Neuropediatrics, University Medical Centre Schleswig-Holstein, Arnold-Heller-Str. 3, Building 9, 24105 Kiel, Germany. E-mail: email@example.com
Photoparoxysmal response (PPR) is a highly heritable electroencephalographic trait characterized by an increased sensitivity to photic stimulation. It may serve as an endophenotype for idiopathic generalized epilepsy. Family linkage studies identified susceptibility loci for PPR on chromosomes 5q35.3, 8q21.13, and 16p13.3. This study aimed to identify key candidate genes within these loci. We used bioinformatics tools for gene prioritization integrating information on biologic function, sequence data, gene expression, and others. The prime candidate gene from this analysis was sequenced in 48 photopositive probands. Presumed functional implications of identified polymorphisms were investigated using bioinformatics methods. The glutamate receptor subunit gene GRIN2A was identified as a prime candidate gene. Sequence analysis revealed various new polymorphisms. None of the identified variants was predicted to be functionally relevant. We objectified the selection of candidate genes for PPR without an a priori hypothesis. Particularly among the various ion channel genes in the linkage regions, GRIN2A was identified as the prime candidate gene. GRIN2A mutations have recently been identified in various epilepsies. Even though our mutation analysis failed to demonstrate direct involvement of GRIN2A in photosensitivity, in silico gene prioritization may provide a useful tool for the identification of candidate genes within large genomic regions.
Despite efforts to unravel the genetics of idiopathic (genetic) generalized epilepsies (IGEs), success in identifying genetic markers for IGEs has been limited. Using photosensitivity (photoparoxysmal response, PPR) as an endophenotype has been advocated to reduce phenotypic and genetic heterogeneity and complexity (Helbig et al., 2008).
Photosensitivity is characterized by an abnormal visual sensitivity of the brain to photic stimulation and high incidences in patients with IGE. Recently, a meta-analysis of several published linkage studies and additional families revealed suggestive nonparametric linkage for three loci on chromosomes 5q35.3, 8q21.13, and 16p13.3 (De Kovel et al., 2010).
In total, 450 genes are located within these three linkage regions. Identifying key candidate genes is difficult when applying experimental methods alone such as traditional sequence analysis. Selection of candidate genes is fraught with personal bias based on the necessity of an a priori hypothesis.
Within this study, we explore a combined approach of candidate gene selection and analysis using both bioinformatics tools and candidate gene sequencing.
The study protocol was approved by the local ethics committee, and all participants or their parents/legal guardians, respectively, gave written informed consent. The study outline is demonstrated in Figure S1. Detailed information and relevant literature on all bioinformatic programs are presented as Supporting Information.
Computational prioritization of candidate genes
Four web-based, freely available bioinformatics tools were selected in order to use complementary methods covering different data sources as input.
The chromosomal locations for the linkage regions published by De Kovel et al. (2010)±5 Mb (5q35.3 = chr5:171.600.001-185.915.260; 8q21.13 = chr8:75.100.001-89.600.000; 16p13.3 = chr16:1-12.900.000, NCBIBuild37) were used as input for the different programs. For Endeavour and PROSPECTR and SUSPECT, where a training set of genes is needed, epilepsy genes listed in OMIM (Online Mendelian Inheritance in Man, search term “epilepsy”) were used.
The study cohort comprised 48 probands with PPR type II–IV (Waltz et al., 1992) recruited at the Department of Neuropediatrics, University Hospital Schleswig-Holstein (Kiel, Germany). Patients were diagnosed with IGEs or were without history of seizures.
DNA from individual blood samples was extracted using commercially available kits.
Mutation analysis of GRIN2A including all exons, exon–intron boundaries, and the promoter region was performed using the NCBI Primer Set RSS000057426.1 and additionally designed primers (Primer3, http://www.primer3.sourceforge.net, Table S1). Amplification by polymerase chain reaction (PCR) and bidirectional sequencing was performed following standard protocols. Polymorphisms and InDels were identified by NovoSNP 3.0 (Weckx et al., 2005).
In silico analysis to predict functional impact of polymorphisms
To assess possible functional implications of newly identified single nucleotide polymorphisms (SNPs), we used a complimentary set of bioinformatics tools. Default parameters were used for all programs.
Nonsynonymous coding polymorphisms:
Analysis of potential affection of transcription factor binding sites (TFBS) by polymorphisms within the promoter region:
Follow-up of variant chr16:g.10277068G>A
The polymorphism chr16:g.10277068G>A was investigated in a control cohort of 358 healthy blood donors of reported German descent, who were negative for neurologic and psychiatric diseases as screened by standardized questionnaires (Popgen cohort, http://www.popgen.de), using a custom-made TaqMan™ SNP assay (Applied Biosystems, Carlsbad, CA, U.S.A.).
Computational prioritization of candidate genes
The top 50 results generated by each program were compared for genes overlapping between at least three of the four programs. Detailed results are demonstrated in Table 1. Because of its biologic function as a subunit of the ionotropic glutamate receptor and recent reports of involvement in epilepsy (Endele et al., 2010; Reutlinger et al., 2010), GRIN2A was chosen for subsequent analysis by mutation screening.
Table 1. Results of candidate gene prioritization
|DDX41||DEAD (Asp-Glu-Ala-Asp) box polypeptide 41||Chr. 5q35.3:|| ||x||x||x|
|B4GALT7||Xylosylprotein beta 1,4-galactosyltransferase, polypeptide 7 (galactosyltransferase I)||Chr. 5q35.3:||x||x||x||x|
|CLCN7||Chloride channel 7||Chr. 16p13.3:||x||x||x||x|
|PKD1||Polycystic kidney disease 1 (autosomal dominant); transient receptor potential cation channel, subfamily P, member 1||Chr. 16p13.3:||x|| ||x||x|
|ABCA3||ATP-binding cassette, subfamily A (ABC1), member 3||Chr. 16p13.3:||x|| ||x||x|
|GRIN2A||Glutamate receptor, ionotropic, N-methyl-d-aspartate 2A||Chr. 16p13.3:||x|| ||x||x|
Details on diagnoses are given in Table S2.
Sequencing revealed seven previously unknown single nucleotide polymorphisms. Apart from one polymorphism within the promoter region (chr16:g.10277263G>A, NCBIBuild37), which was present in the heterozygous state in 8/48 probands (16.7%) and in the homozygous state in one proband (2.1%), all SNPs were found in the heterozygous state in a single patient each.
In silico analysis to predict functional impact of polymorphisms and follow-up of variant chr16:g.10277068G>A
Detailed results of the in silico analysis are demonstrated in Tables 2–4. For polymorphism chr16:g.10277068G>A, a new binding site for transcription factor MZF1 was predicted. Investigation of this variant in controls revealed a heterozygous status in 3 of 358 probands equivalent to a minor allele frequency of 0.4%. None of the other polymorphisms identified was rated as functionally relevant by at least two bioinformatics tools.
Table 2. In silico analysis of the nonsynonymous coding SNP g.9858211A>G
Table 3. In silico analysis of intronic noncoding SNPs
|5′UTR, exon 2||g.10275749||C>A||–||No difference||No difference|
|Intron 9/10||g.9923799||A>C||–||No difference||ASS 82.02||ASS 53.07a|
|DSS 80.32||DSS 75.57|
|Intron 13/14||g.9858894||T>C||–||No difference||No difference|
Table 4. In silico analysis of promoter SNPs
|5′UTR, promoter||g.10277263||G>A||–||TFBS not changed||New site for EEF2||0.863|
|5′UTR, promoter||g.10277068||G>A||–||New site for MZF1||94.8||New site for MZF1||0.991|
|Site lost for WT1||0.943|
|Site lost for XCPE1||0.801|
|5′UTR, promoter||g.10276998||T>C||–||TFBS not changed||Site lost for BCL6||0.76|
Within the present study we attempted candidate gene identification by using various complementary bioinformatics tools for candidate gene analysis in combination with traditional sequencing techniques.
Photosensitivity can be used as an endophenotype for idiopathic generalized epilepsies (Helbig et al., 2008). Endophenotypes have been used to investigate the genetic background of common complex diseases, assuming that the genetic basis of the endophenotype is less complex than the genetic basis of the disease in question. This concept has been applied to psychiatric disorders (Cannon & Keller, 2006) and neurologic syndromes (Stefansson et al., 2007).
A recent analysis of whole-genome linkage data for photosensitivity combining previous linkage studies and the analysis of additional families revealed several linkage peaks (De Kovel et al., 2010).
Former candidate genes studies on photosensitivity failed to demonstrate reproducible major effects of the investigated genes. The genes investigated were chosen due to their biologic functions (e.g., Von Spiczak et al., 2010) and presumed involvement in idiopathic epilepsies (e.g., Lorenz et al., 2006).
Traditional candidate gene selection identifies genes based on known biologic functions and considerations of possible involvement in pathophysiologic processes. Accordingly, this method is highly subjective (Zhu & Zhao, 2007). Systematically screening the available information is beyond the possibilities of individual researchers. Screening all genes located within a given linkage region is time consuming and cost-intensive. Although recent technical advances such as next-generation sequencing techniques are likely to overcome these problems, these methods are expensive and not yet available for most researchers. Accordingly, alternative approaches for candidate gene selection are necessary.
Several online bioinformatics tools have been developed to facilitate candidate gene prioritization (Tranchevent et al., 2010). These programs differ with respect to data sources (gene ontology, gene function, sequence data, protein–protein interactions, information on gene expression, and others), computational methods, and prioritization algorithms. The basic approach of selecting candidate genes for follow-up studies by computational prioritization is, therefore, to some degree similar to proceedings of researchers. However, the magnitude of data analyzed for prioritization is far more comprehensive than what is analyzable for researchers. The programs used within our study were selected based on the complementarity of data sources used for gene prioritization, aiming to avoid bias toward specific aspects and data categories and to increase overall reliability.
In our study, gene prioritization revealed six genes as prime candidates for photosensitivity identified by at least three of the four bioinformatics tools. Of these, GRIN2A is coding for the 2A subunit of the ionotropic (N-methyl-d-aspartate , NMDA) glutamate receptor. NMDA receptors (NMDARs) are critically involved in excitatory synaptic transmission, plasticity, and excitotoxicity in the central nervous system. Recently, involvement of GRIN2A in human epilepsies was suggested (Endele et al., 2010; Reutlinger et al., 2010). Given the functional implications of NMDARs and the involvement of GRIN2A in epilepsy, the identification of GRIN2A by candidate gene prioritization seems both plausible and reliable.
To assess the importance of GRIN2A in photosensitivity, we sequenced the gene in photosensitive probands. One promoter variation at chr16:g.10277068G>A was predicted to create a new binding site for transcription factor MZF-1 (myeloid zinc finger gene). Accordingly, this newly created MZF-1 binding site may impact on the expression of GRIN2A in the CNS. The polymorphism was found to be present at a minor allele frequency of 0.4% in a control population. Additional studies are needed to further evaluate this finding.
None of the other polymorphisms identified was concordantly rated to have an impact on protein function, splicing, or gene regulation by the applied programs.
In summary, a combination of bioinformatics tools and traditional sequencing efforts was applied to evaluate candidate genes for photosensitivity. Gene prioritization revealed GRIN2A as an intriguing candidate. Although sequence analysis failed to show a direct role of GRIN2A in photosensitivity, GRIN2A is increasingly recognised in human epilepsies. We suggest that in silico prioritization has the capability to identify exciting genes within a given linkage region.
We would like to thank I. Urbach, M. Newsky, A. Dietsch, S. Greve, and M. Depta for technical assistance. We are grateful to A. Ackerhans and K. Moldenhauer for database management. SvS receives institutional support from the Christian-Albrechts-University Kiel, Germany and received a scholarship from the German Epilepsy Society for research activities (Otfrid-Foerster-Stipendium).
None of the authors has any conflict of interest to disclose.
We confirm that we have read the Journal’s position on issues involved in ethical publication and affirm that this report is consistent with those guidelines.