Skip to content

[FEA] Replace cuco::static_multimap by cuco::static_map in semi-anti-join #11313

Closed
@ttnghia

Description

The implementation of semi-anti-join was refactored in #11100. One of the changes was to use cuco::static_multimap, which was later discovered that it has performance issue when the input tables have too many duplicate rows (#11299).

We should use cuco::static_map to avoid the performance issue. However, this is not just a simple change in implementation but needs a new FEA from cuco which adds pair_contains into static_map: NVIDIA/cuCollections#191.

Activity

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Assignees

Labels

0 - BacklogIn queue waiting for assignmentPerformancePerformance related issueSparkFunctionality that helps Spark RAPIDSfeature requestNew feature or requestlibcudfAffects libcudf (C++/CUDA) code.non-breakingNon-breaking change

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions