Performs numRows parallel scan operations of numElements each on its
input (d_in) and places the output in d_out, with the scan parameters
set by config.
Performs a scan operation of numElements on its input in GPU memory
(d_in) and places the output in GPU memory (d_out), with the scan
parameters specified in the plan pointed to by planHandle.
Performs a segmented scan operation of numElements on its input in GPU
memory (d_idata) and places the output in GPU memory (d_out), with the
scan parameters specified in the plan pointed to by planHandle.