Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some errors when using multi-gpus #62

Open
lcf000000 opened this issue Sep 21, 2018 · 1 comment
Open

Some errors when using multi-gpus #62

lcf000000 opened this issue Sep 21, 2018 · 1 comment

Comments

@lcf000000
Copy link

lcf000000 commented Sep 21, 2018

Training is too slow, and I want using a 4xTian xp to speed up it.
when I set
opts.gpus=[1,2,3,4]
then I get some errors as:

`Warning: Function labindex has the same name as a MATLAB builtin. We suggest you rename the function to avoid a potential name
conflict.

In path at 109
In addpath at 86
In cnn_train_dag at 11
In trainig_siameseFC at 97
In run_increasingData at 6
Warning: Function numlabs has the same name as a MATLAB builtin. We suggest you rename the function to avoid a potential name
conflict.
In path at 109
In addpath at 86
In cnn_train_dag at 11
In trainig_siameseFC at 97
In run_increasingData at 6
Starting parallel pool (parpool) using the 'local' profile ... connected to 1 workers.
cnn_train_dag: resetting GPU
Lab 1:
ans =
Error using cnn_train_dag>(spmd) (line 492)
The parallel pool that SPMD was using has been shut down.

Error in cnn_train_dag>prepareGPUs (line 492)
spmd

Error in cnn_train_dag (line 92)
prepareGPUs(opts, epoch == start+1) ;

Error in trainig_siameseFC (line 97)
[net, stats] = cnn_train_dag(net, imdb, batch_fn, opts.train);

Error in run_increasingData (line 6)
trainig_siameseFC(imdb_data, 86);

The client lost connection to worker 1. This might be due to network problems, or the interactive communicating job might have
errored.
`

@lcf000000 lcf000000 changed the title I get some errors when I use multi-gpus Some errors when using multi-gpus Sep 21, 2018
@jvlmdr
Copy link

jvlmdr commented Sep 21, 2018

Sorry, we haven't tested support for training with multiple GPUs. These look like errors coming from matconvnet and/or the parallel toolbox.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants