我正在尝试使用parapooling在matconvnet中运行mnist示例。我已经分配了 2 个 GPU,但它在尝试运行cnn_mnist_experiments时显示此错误:
Error using cnn_train>(spmd) (line 157)
Error detected on workers 1 2.
Error in cnn_train (line 157)
spmd
Error in cnn_mnist (line 55)
[net, info] = trainfn(net, imdb, getBatch(opts), ...
Error in cnn_mnist_experiments (line 3)
[net_bn, info_bn] = cnn_mnist(...
Caused by:
Error using ParameterServer/startWithMMap (line 170)
An UndefinedFunction error was thrown on the workers for 'vl_cudatool'. This may be because the
file containing 'vl_cudatool' is not accessible on the workers. Specify the required files for
this parallel pool using the command: addAttachedFiles(pool, ...). See the documentation for
parpool for more details.
Undefined function 'vl_cudatool' for input arguments of type 'single'.
Error using ParameterServer/startWithMMap (line 170)
An UndefinedFunction error was thrown on the workers for 'vl_cudatool'. This may be because the
file containing 'vl_cudatool' is not accessible on the workers. Specify the required files for
this parallel pool using the command: addAttachedFiles(pool, ...). See the documentation for
parpool for more details.
Undefined function 'vl_cudatool' for input arguments of type 'single'.
显然,工作人员无法访问"vl_cudatool"文件。这里可能有什么问题?
类型为"single"的输入参数的未定义函数"vl_cudatool"。
这意味着您的vl_cudatool函数不在您的路径中
尝试
which vl_cudatool
我不知道vl_cudatool是什么。也许你可以尝试一些早期版本的MatConvNet?