如何在AI平台管道上使用GPU?我的管道在其中一个操作中使用了set_gpu_limit(1(,但我最终得到了一个This step is in Pending state with this message: Unschedulable: 0/3 nodes are available: 3 Insufficient nvidia.com/gpu.
错误。
几分钟后收到。。。我遵循GPU上的正常Kubeflow指令
export GPU_POOL_NAME=gpu-pool
export CLUSTER_NAME=cluster-1
gcloud container node-pools create ${GPU_POOL_NAME}
--accelerator type=nvidia-tesla-k80,count=1
--zone us-central1-a --cluster ${CLUSTER_NAME}
--num-nodes=0 --machine-type=n1-standard-4 --min-nodes=0 --max-nodes=1 --enable-autoscaling
kubectl apply -f https://raw.githubusercontent.com/GoogleCloudPlatform/container-engine-accelerators/master/nvidia-driver-installer/cos/daemonset-preloaded.yaml