Ok, I found a suitable solution which only requires a multiplication to get the screen space texcoords in the loop. I'll clean up and post the code later. Perhaps I'll even be able to take the last step to pure ss-texcoords in the meantime...