Getting "RuntimeError: 'lengths' argument should be a 1D CPU int64 tensor, but got 1D cuda:0 Long tensor" #240

Jeriousman · 2022-03-17T01:04:59Z

Describe the bug(问题描述)
history = model.fit(x, y, batch_size=256, epochs=20, verbose=1, validation_split=0.4, shuffle=True)
When I try model.fit for DIEN model with run_dien.py of your default example, it works when I set device to cpu but with cuda, I get this error below.

cuda ready...
0it [00:00, ?it/s]cuda:0
Train on 4 samples, validate on 0 samples, 2 steps per epoch

Traceback (most recent call last):

  File "<ipython-input-1-e985ce1c0aa2>", line 69, in <module>
    history = model.fit(x, y, batch_size=2, epochs=10, verbose=1, validation_split=0, shuffle=False)

  File "/home/hojun/anaconda3/envs/ai/lib/python3.6/site-packages/deepctr_torch/models/basemodel.py", line 244, in fit
    y_pred = model(x).squeeze()

  File "/home/hojun/anaconda3/envs/ai/lib/python3.6/site-packages/torch/nn/modules/module.py", line 727, in _call_impl
    result = self.forward(*input, **kwargs)

  File "/home/hojun/anaconda3/envs/ai/lib/python3.6/site-packages/deepctr_torch/models/dien.py", line 92, in forward
    masked_interest, aux_loss = self.interest_extractor(keys_emb, keys_length, neg_keys_emb)

  File "/home/hojun/anaconda3/envs/ai/lib/python3.6/site-packages/torch/nn/modules/module.py", line 727, in _call_impl
    result = self.forward(*input, **kwargs)

  File "/home/hojun/anaconda3/envs/ai/lib/python3.6/site-packages/deepctr_torch/models/dien.py", line 221, in forward
    enforce_sorted=False)

  File "/home/hojun/anaconda3/envs/ai/lib/python3.6/site-packages/torch/nn/utils/rnn.py", line 244, in pack_padded_sequence
    _VF._pack_padded_sequence(input, lengths, batch_first)

RuntimeError: 'lengths' argument should be a 1D CPU int64 tensor, but got 1D cuda:0 Long tensor

So I tried lengths.cpu(), lengths.to('cpu') and all of them couldnt solve the problem. Can you please provide a solution?

Operating environment(运行环境):

python version 3.6
torch version 1.7.1
deepctr-torch version 0.2.7

The text was updated successfully, but these errors were encountered:

zanshuxun · 2022-04-04T12:08:32Z

In the new version of PyTorch, the input parameter lengths of torch.nn.utils.rnn.pack_padded_sequence has been changed: (Details can be found in pytorch/pytorch#43227)

Jeriousman · 2022-04-04T23:44:45Z

Obviously I tried. But as I said, none of them worked. But I had to get way down to torch 1.4.0 to get it done.

zanshuxun · 2022-04-05T09:46:12Z

Obviously I tried. But as I said, none of them worked. But I had to get way down to torch 1.4.0 to get it done.

Where did you use .cpu()? Did the device of the tensor change after you use .cpu()?

Jeriousman · 2022-04-06T01:08:27Z

Yes. I did. as I mentioned below.

So I tried lengths.cpu(), lengths.to('cpu') and all of them couldnt solve the problem

The length part is the one I tried to put into cpu as the exact same persons mruberry and ngimel suggested. That was the first web page I found as well when I was trying to fix the problem.

zanshuxun · 2022-04-06T06:07:10Z

Where did you use .cpu()?

Could you tell me the corresponding line number in the code? for example:

DeepCTR-Torch/deepctr_torch/models/dien.py

Lines 220 to 221 in b4d8181

    
           packed_keys = pack_padded_sequence(masked_keys, lengths=masked_keys_length, batch_first=True, 
        
                                              enforce_sorted=False)

Did you set masked_keys_length.cpu() here？

or other places like

DeepCTR-Torch/deepctr_torch/models/dien.py

Line 356 in b4d8181

    
           packed_keys = pack_padded_sequence(keys, lengths=keys_length, batch_first=True, enforce_sorted=False)

or

DeepCTR-Torch/deepctr_torch/models/dien.py

Line 365 in b4d8181

    
           packed_interests = pack_padded_sequence(interests, lengths=keys_length, batch_first=True,

Did the device of the tensor change after you use .cpu()?

Could you print the device of the tensor before and after your .cpu()? To figure out whether it works. If it works, there should not be the error "RuntimeError: 'lengths' argument should be a 1D CPU int64 tensor, but got 1D cuda:0 Long tensor"

Jeriousman · 2022-04-07T07:18:01Z

Hello. I have done for all the pack_padded_sequences for example, masked_keys_length.cpu(). When I did this, it was converted to cpu one. But the error was still there. For me, only downgrading torch version worked. It is strange tho. That was the whole point of the question. It became CPU tensor, but it didnt work. Is it working on your side?

zanshuxun · 2022-06-27T03:11:09Z

@Jeriousman I add .cpu() in all the pack_padded_sequence(...) in dien.py, then it works. Maybe you missed something. Could you paste the traceback info and your dien.py file?

1. Add multi-task models: SharedBottom, ESMM, MMOE, PLE 2. Bugfix: #240 #232

* add multitask mdoels 1. Add multi-task models: SharedBottom, ESMM, MMOE, PLE 2. Bugfix: #240 #232 * support python 3.9/3.10 (#259) * fix: variable name typo (#257) Co-authored-by: Jason Zan <[email protected]> Co-authored-by: Yi-Xuan Xu <[email protected]>

umanniyaz · 2023-03-15T20:20:01Z

hi any one tell me same error on torch==1.8.0 , how to handle this

zanshuxun added the bug Something isn't working label Apr 4, 2022

zanshuxun pinned this issue Jun 20, 2022

zanshuxun mentioned this issue Jun 28, 2022

Error when running run_dien.py #249

Open

zanshuxun mentioned this issue Aug 12, 2022

新增多任务模型 Dev zsx mtl1 #255

Merged

shenweichen pushed a commit that referenced this issue Aug 15, 2022

add multitask mdoels

19e09e4

1. Add multi-task models: SharedBottom, ESMM, MMOE, PLE 2. Bugfix: #240 #232

shenweichen mentioned this issue Oct 21, 2022

Add multi-task models: SharedBottom, ESMM, MMOE, PLE #260

Merged

shenweichen closed this as completed in #260 Oct 21, 2022

shenweichen unpinned this issue Oct 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Getting "RuntimeError: 'lengths' argument should be a 1D CPU int64 tensor, but got 1D cuda:0 Long tensor" #240

Getting "RuntimeError: 'lengths' argument should be a 1D CPU int64 tensor, but got 1D cuda:0 Long tensor" #240

Jeriousman commented Mar 17, 2022

zanshuxun commented Apr 4, 2022 •

edited

Loading

Jeriousman commented Apr 4, 2022

zanshuxun commented Apr 5, 2022

Jeriousman commented Apr 6, 2022 •

edited

Loading

zanshuxun commented Apr 6, 2022

Jeriousman commented Apr 7, 2022 •

edited

Loading

zanshuxun commented Jun 27, 2022

umanniyaz commented Mar 15, 2023

Getting "RuntimeError: 'lengths' argument should be a 1D CPU int64 tensor, but got 1D cuda:0 Long tensor" #240

Getting "RuntimeError: 'lengths' argument should be a 1D CPU int64 tensor, but got 1D cuda:0 Long tensor" #240

Comments

Jeriousman commented Mar 17, 2022

zanshuxun commented Apr 4, 2022 • edited Loading

Jeriousman commented Apr 4, 2022

zanshuxun commented Apr 5, 2022

Jeriousman commented Apr 6, 2022 • edited Loading

zanshuxun commented Apr 6, 2022

Jeriousman commented Apr 7, 2022 • edited Loading

zanshuxun commented Jun 27, 2022

umanniyaz commented Mar 15, 2023

zanshuxun commented Apr 4, 2022 •

edited

Loading

Jeriousman commented Apr 6, 2022 •

edited

Loading

Jeriousman commented Apr 7, 2022 •

edited

Loading