Open
Description
A user process group has to be created before nvFuser to even initialize a device mesh. Currently, nvFuser's communicator creates its own process groups listening to a different port. This wastes resources and probably has triggered some conflicts that lead to NVFUSER_DISABLE=multidevice
.
cc @syed-ahmed: I'm pretty sure I asked you about this and then forgot. Did you recommend a way to register the user process group that's accessible to the C++ side of nvFuser?
Activity