Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
error like "ValueError: Expecting a ProcessGroup, but got a <class 'text_generation_server.utils.dist.FakeGroup'>. rank=0"
@OlivierDehaene OR @Narsil
│ ╭───────────────────────────────── locals ─────────────────────────────────╮ │
│ │ async_op = False │ │
│ │ device_ids = None │ │
│ │ group = <text_generation_server.utils.dist.FakeGroup object at │ │
│ │ 0x7f20935d8d90> │ │
│ │ opts = <torch.distributed.distributed_c10d.BarrierOptions object │ │
│ │ at 0x7f2093631130> │ │
│ ╰──────────────────────────────────────────────────────────────────────────╯ │
│ │
│ /opt/conda/lib/python3.11/site-packages/torch/distributed/distributed_c10d.p │
│ y:810 in _get_object_coll_device │
│ │
│ 807 │ │ │ # RPC uses Gloo for object collectives │
│ 808 │ │ │ return "cpu" │
│ 809 │ │ else: │
│ ❱ 810 │ │ │ raise ValueError(f"Expecting a ProcessGroup, but got a {t │
│ 811 │ │
│ 812 │ """ │
│ 813 │
group._device_types
is a property pybind that returns the dev ││ │
│ ╭───────────────────────────────── locals ─────────────────────────────────╮ │
│ │ group = <text_generation_server.utils.dist.FakeGroup object at │ │
│ │ 0x7f20935d8d90> │ │
│ ╰──────────────────────────────────────────────────────────────────────────╯ │
╰──────────────────────────────────────────────────────────────────────────────╯
ValueError: Expecting a ProcessGroup, but got a <class
'text_generation_server.utils.dist.FakeGroup'>. rank=0