Identification of prognostic signature based on the copy number variation (CNV) and expression in acute myeloid leukemia
Abstract Background Acute myeloid leukemia (AML) is caused by multiple genetic alterations in the hematopoietic progenitors, and molecular genetic analysis has provided useful information for AML diagnosis and prognosis. However, an integrative understanding of the prognostic value of specific copy number variation (CNV) and CNV-modulated gene expression has been limited. Methods We conducted an integrative analysis of CNV profiling and gene expression using data from the TARGET and TCGA AML cohorts. The CNV data from TCGA were analyzed using the GISTIC and all CNV data by genes on every patient were obtained. CNV survival analysis and mRNA survival analysis were conducted with the Multivariate Cox proportional hazards regression model using R software with “survminer” and “survival” packages. KEGG cancer panel genes were extracted from the cancer-related pathways from Kyoto Encyclopedia of Genes and Genomes (KEGG). The R package “circlize” was used for mapping the CNV genes to chromosomes. Results From this investigation, we observed distinct CNV patterns in the AML risk groups as well as the expression of 251 genes significantly modulated by CNV in both cohorts. There were 102 CNV genes (located at 7q31-34, 16q24) associated with clinical outcomes in AML, which were identified in the TARGET cohort and validated in the TCGA cohort, three of which being miRNA genes (MIR29A, MIR183, MIR335) that overlapped with a KEGG cancer panel. Five genes were identified whose expression was modulated by CNV and significantly associated with clinical outcomes, and among them, the deletion of SEMA4D and CBFB were found to potentially have protective effects against AML. The result was also validated with patient marrow samples. Moreover, the distribution of CNV in these five CNV-modulated genes was independent of the risk groups, which suggests that they are independent prognosis factors. Conclusion Overall, this study identified 102 CNV genes and five CNV-modulated gene expression that is crucial for developing new modes of prognosis evaluation and target therapy for AML.