Using Device 0: Tesla M2090

Reducing array of type int

16777216 elements
256 threads (max)
64 blocks

Reduction, Throughput = 109.3834 GB/s, Time = 0.00061 s, Size = 16777216 Elements, NumDevsUsed = 1, Workgroup = 256

GPU result = 2139353471
CPU result = 2139353471

