Facial reconstruction

Search LJMU Research Online

Browse Repository | Browse E-Theses

GSWO: A Programming Model for GPU-enabled Parallelization of Sliding Window Operations in Image Processing

Yang, P, Clapworthy, G, Dong, F, Codreanu, VB, Williams, DP, Liu, BQ, Roerdink, JBTM and Deng, ZK (2016) GSWO: A Programming Model for GPU-enabled Parallelization of Sliding Window Operations in Image Processing. Signal Processing: Image Communication. ISSN 1879-2677

GSWO_IC.pdf - Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (1MB) | Preview


Sliding Window Operations (SWOs) are widely used in image processing applications. They often have to be performed repeatedly across the target image, which can demand significant computing resources when processing large images with large windows. In applications in which real-time performance is essential, running these filters on a CPU often fails to deliver results within an acceptable timeframe. The emergence of sophisticated graphic processing units (GPUs) presents an opportunity to address this challenge. However, GPU programming requires a steep learning curve and is error-prone for novices, so the availability of a tool that can produce a GPU implementation automatically from the original CPU source code can provide an attractive means by which the GPU power can be harnessed effectively. This paper presents a GPUenabled programming model, called GSWO, which can assist GPU novices by converting their SWO-based image processing applications from the original C/C++ source code to CUDA code in a highly automated manner. This model includes a new set of simple SWO pragmas to generate GPU kernels and to support effective GPU memory management. We have implemented this programming model based on a CPU-to-GPU translator (C2GPU). Evaluations have been performed on a number of typical SWO image filters and applications. The experimental results show that the GSWO model is capable of efficiently accelerating these applications, with improved applicability and a speed-up of performance compared to several leading CPU-to- GPU source-to-source translators.

Item Type: Article
Subjects: Q Science > QA Mathematics > QA76 Computer software
Divisions: Computer Science & Mathematics
Publisher: Elsevier
Date Deposited: 06 May 2016 10:41
Last Modified: 04 Sep 2021 04:16
DOI or ID number: 10.1016/j.image.2016.05.003
URI: https://researchonline.ljmu.ac.uk/id/eprint/3573
View Item View Item