[QST] What is the proper way to load checkpoint to merlin.models.torch.DLRMModel #1245

breadbread1984 · 2024-11-27T01:38:57Z

❓ Questions & Help

DLRMModel cannot load trained checkpoint properly.

Details

the torch implement of DLRMModel has an interaction layer implement with a member defined in register_buffer in condition block (see

models/merlin/models/torch/blocks/dlrm.py

Line 67 in eb1e541

self.register_buffer(

). the newly created interaction has no such member until the forward is called. therefore, the checkpoint cannot be loaded properly. what is the recommended way of loading?

breadbread1984 added the status/needs-triage label Nov 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QST] What is the proper way to load checkpoint to merlin.models.torch.DLRMModel #1245

[QST] What is the proper way to load checkpoint to merlin.models.torch.DLRMModel #1245

breadbread1984 commented Nov 27, 2024

[QST] What is the proper way to load checkpoint to merlin.models.torch.DLRMModel #1245

[QST] What is the proper way to load checkpoint to merlin.models.torch.DLRMModel #1245

Comments

breadbread1984 commented Nov 27, 2024

❓ Questions & Help

Details