currently i test the CUDA.NET wrapper. i´m researching for the best way to speed up my calculations (i already optimized the algorithm-layer)