Facial reconstruction

Search LJMU Research Online

Browse Repository | Browse E-Theses

Identification of Data Structure with Machine Learning: From Fisher to Bayesian networks

Casana Eslava, R (2019) Identification of Data Structure with Machine Learning: From Fisher to Bayesian networks. Doctoral thesis, Liverpool John Moores University.

2018raulcasanaphd.pdf - Published Version

Download (14MB) | Preview


This thesis proposes a theoretical framework to thoroughly analyse the structure of a dataset in terms of a) metric, b) density and c) feature associations. To look into the first aspect, Fisher's metric learning algorithms are the foundations of a novel manifold based on the information and complexity of a classification model. When looking at the density aspect, the Probabilistic Quantum clustering, a Bayesian version of the original Quantum Clustering is proposed. The clustering results will depend on local density variations, which is a desired feature when dealing with heteroscedastic data. To address the third aspect, the constraint-based PC-algorithm is the starting point of many structure learning algorithms, it is focused on finding feature associations by means of conditional independent tests. This is then used to select Bayesian networks, based on a regularized likelihood score. These three topics of data structure analysis were fully tested with synthetic data examples and real cases, which allowed us to unravel and discuss the advantages and limitations of these algorithms. One of the biggest challenges encountered was related to the application of these methods to a Big Data dataset that was analysed within the framework of a collaboration with a large UK retailer, where the interest was in the identification of the data structure underlying customer shopping baskets.

Item Type: Thesis (Doctoral)
Uncontrolled Keywords: Quantum Clustering; Fisher manifold; Bayesian Networks
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Computer Science & Mathematics
Date Deposited: 18 Jun 2019 08:14
Last Modified: 21 Nov 2022 14:23
DOI or ID number: 10.24377/LJMU.t.00010869
Supervisors: Jarman, IH, Lisboa, PJ and Ortega-Martorell, S
URI: https://researchonline.ljmu.ac.uk/id/eprint/10869
View Item View Item