Institute of Electrical & Electronic Engineers
Non-Local Means (NLM) algorithm is widely considered as a state-of-the-art denoising filter in many research fields. High computational complexity led to implementations on Graphic Processor Unit (GPU) architectures, which achieve reasonable running times by filtering, slice-by-slice, 3D datasets with a 2D NLM approach. Here the authors present a fully 3D NLM implementation on a multi-GPU architecture and suggest its high scalability. The performance results they discuss encourage the coding of further filter improvements and the investigation of a large spectrum of applicative scenarios.