Maybe something you'd like to check, could be replicated with OpenCL perhaps? If you ofc decide to take a GPGPU route.

http://www.geeks3d.com/20090827/deferred-shading-with-compute-shaders-in-frostbite-2-engine/

The slide from page 19 and onwards is about the pros and cons and other info.