Identification of CDH11 as an ASD Risk Gene by Matched-gene Co-expression Analysis and Mouse Behavioral Studies
Abstract Background: Gene co-expression analysis (GCA) has emerged as an important tool to identify convergent molecular pathways of ASD risk genes. The aim of this study is to identify ASD-relevant genes at the whole-genome level using GCA with consideration of the effect of confounding factors on GCA, including the size, expression level, and guanine-cytosine content of genes. Methods: Pearson’s correlation coefficient was computed to indicate the co-expression of a gene pair based on the BrainSpan human brain transcriptome dataset. Whether a gene is significantly co-expressed with a group of high-confidence ASD risk genes (hcASDs) was determined by statistically comparing the co-expression of this gene with the hcASD gene set to that of this gene with permuted gene sets of matched gene features. This method is referred to as "matched-gene co-expression analysis" (MGCA). Gene ontology (GO) analysis and construction of integrated GO enrichment networks were performed to reveal convergent pathways of co-expressed genes. Behavioral tests were carried out in gene knockout mice. Results: Gene size, mRNA length, mRNA abundance, and guanine-cytosine content were found to affect co-expression profiles of ASD genes. Using the MGCA method, we confirmed the convergence in the developmental expression profiles of hcASDs. MGCA also effectively revealed convergent molecular pathways of ASD risk genes and determined that CDH11, but not CDH9, is associated with ASD. Mouse behavioral studies showed that Cdh11-null mice, but not Cdh9-null mice, have multiple autistic-like behavioral alterations.Limitations: The use of tissue-derived transcriptomes instead of single-cell transcriptomes may have detected coincident expression of some functionally irrelevant genes in different cell types. Some ASD risk genes may have been missed due to the highly stringent statistical standard of MGCA. Another limitation is the relatively small number of animals analyzed in behavioral tests. Conclusions: Results of this study revealed the importance of considering matched gene features in GCA. CDH11 was confirmed to be an important ASD risk gene and Cdh11-null mice were found to be a very useful animal model for investigation of ASD.