Facial reconstruction

Search LJMU Research Online

Browse Repository | Browse E-Theses

Technical Document Query System using Transformer Model-based Machine Reading Comprehension

Kumar, A, Natalia, F, Sudirman, S and Al-Jumeily, D (2025) Technical Document Query System using Transformer Model-based Machine Reading Comprehension. In: 2024 17th International Conference on Development in eSystem Engineering (DeSE) . pp. 497-502. (2024 17th International Conference on Development in eSystem Engineering (DeSE), 6th - 8th Nov 2024, Khorfakkan, United Arab Emirates).

[img]
Preview
Text
Amit Kumar Final.pdf - Accepted Version

Download (988kB) | Preview

Abstract

Constructing a Question Answering system is a challenging task despite a significant amount of study that has been conducted in recent times on this topic. It is even more difficult to provide satisfactory responses to the inquiries raised by users in an organizational setting as opposed to in an informal setting. We present in this paper, the results of our study into the use of a transformer-based model in the development of a technical document query system with machine reading comprehension. Our method fine-tunes a pre-trained transformer model with hyperparameter optimization using a pre-processed training dataset and tested on a different dataset. We experimented using eight pre-trained models from seven different variations of the BERT transformer architecture including BERT, RoBERTa, XLM-RoBERTa, ELECTRA, ALBERT, MobileBERT, and MPNet using the SQuAD1.1 dataset for fine-tuning and the Oracle Knowledge Documentation for testing. We found that the ALBERT pre-trained model is the best model achieving 0.891, 0.950, and 0.882 performance when measured using the Exact Match, F1 score, and Confidence Score metrics - despite its relatively small model size.

Item Type: Conference or Workshop Item (Paper)
Additional Information: © 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Computer Science and Mathematics
Publisher: IEEE
SWORD Depositor: A Symplectic
Date Deposited: 13 Mar 2025 13:41
Last Modified: 13 Mar 2025 13:41
DOI or ID number: 10.1109/dese63988.2024.10911893
URI: https://researchonline.ljmu.ac.uk/id/eprint/25871
View Item View Item