Casana Eslava, R (2019) Identification of Data Structure with Machine Learning: From Fisher to Bayesian networks. Doctoral thesis, Liverpool John Moores University.
|
Text
2018raulcasanaphd.pdf - Published Version Download (14MB) | Preview |
Abstract
This thesis proposes a theoretical framework to thoroughly analyse the structure of a dataset in terms of a) metric, b) density and c) feature associations. To look into the first aspect, Fisher's metric learning algorithms are the foundations of a novel manifold based on the information and complexity of a classification model. When looking at the density aspect, the Probabilistic Quantum clustering, a Bayesian version of the original Quantum Clustering is proposed. The clustering results will depend on local density variations, which is a desired feature when dealing with heteroscedastic data. To address the third aspect, the constraint-based PC-algorithm is the starting point of many structure learning algorithms, it is focused on finding feature associations by means of conditional independent tests. This is then used to select Bayesian networks, based on a regularized likelihood score. These three topics of data structure analysis were fully tested with synthetic data examples and real cases, which allowed us to unravel and discuss the advantages and limitations of these algorithms. One of the biggest challenges encountered was related to the application of these methods to a Big Data dataset that was analysed within the framework of a collaboration with a large UK retailer, where the interest was in the identification of the data structure underlying customer shopping baskets.
Item Type: | Thesis (Doctoral) |
---|---|
Uncontrolled Keywords: | Quantum Clustering; Fisher manifold; Bayesian Networks |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Computer Science & Mathematics |
Date Deposited: | 18 Jun 2019 08:14 |
Last Modified: | 21 Nov 2022 14:23 |
DOI or ID number: | 10.24377/LJMU.t.00010869 |
Supervisors: | Jarman, IH, Lisboa, PJ and Ortega-Martorell, S |
URI: | https://researchonline.ljmu.ac.uk/id/eprint/10869 |
View Item |