Describe the enhancement requested
When running a take function over a chunked array, the chunked array gets concatenated first to create a single contiguous array and then the take kernel is invoked on the resulting array. This is usually unnecessary as we could directly run the take kernel on the individual chunks.
This might also be related to #25822
Component(s)
C++