Facial reconstruction

Search LJMU Research Online

Browse Repository | Browse E-Theses

Deep Learning Classification of Polygenic Obesity using Genome Wide Association Study SNPs

Curbelo Montañez, CA, Fergus, P, Montañez, AC, Hussain, A, Al-Jumeily, D and Chalmers, C (2018) Deep Learning Classification of Polygenic Obesity using Genome Wide Association Study SNPs. In: 2018 International Joint Conference on Neural Networks (IJCNN) . (IEEE World Congress on Computational Intelligence, 8th - 13th July 2018, Rio de Janeiro, Brazil).

[img]
Preview
Text
1804.03198v1.pdf - Accepted Version

Download (539kB) | Preview

Abstract

In this paper, association results from genome-wide association studies (GWAS) are combined with a deep learning framework to test the predictive capacity of statistically significant single nucleotide polymorphism (SNPs) associated with obesity phenotype. Our approach demonstrates the potential of deep learning as a powerful framework for GWAS analysis that can capture information about SNPs and the important interactions between them. Basic statistical methods and techniques for the analysis of genetic SNP data from population-based genome-wide studies have been considered. Statistical association testing between individual SNPs and obesity was conducted under an additive model using logistic regression. Four subsets of loci after quality-control (QC) and association analysis were selected: P-values lower than 1x10-5 (5 SNPs), 1x10-4 (32 SNPs), 1x10-3 (248 SNPs) and 1x10-2 (2465 SNPs). A deep learning classifier is initialised using these sets of SNPs and fine-tuned to classify obese and non-obese observations. Using a deep learning classifier model and genetic variants with P-value < 1x10-2 (2465 SNPs) it was possible to obtain results (SE=0.9604, SP=0.9712, Gini=0.9817, LogLoss=0.1150, AUC=0.9908 and MSE=0.0300). As the P-value increased, an evident deterioration in performance was observed. Results demonstrate that single SNP analysis fails to capture the cumulative effect of less significant variants and their overall contribution to the outcome in disease prediction, which is captured using a deep learning framework.

Item Type: Conference or Workshop Item (Paper)
Uncontrolled Keywords: cs.CY; cs.CY; cs.CE; cs.LG; q-bio.GN
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Computer Science & Mathematics
Publisher: IEEE
Related URLs:
Date Deposited: 03 May 2018 11:41
Last Modified: 13 Apr 2022 15:16
URI: https://researchonline.ljmu.ac.uk/id/eprint/8495
View Item View Item