Closed
Description
Given an array of integer values which may be the labels of some list elements, we want to generate an array of offsets so we can create a lists column from these offsets and elements gathered from another lists column using the input labels as markup for gather map.
For example:
input_labels = [0, 0, 0, 0, 1, 1, 2, 2, 2, 2, 5, 5]
output_offsets = [0, 4, 6, 10, 12]
This is basically extracting the existing function from drop_list_duplicates
(
cudf/cpp/src/lists/drop_list_duplicates.cu
Line 486 in ca952f8
drop_list_duplicates
. In addition, it should be used to implement the set-like operations (#10409).
Activity