Google Cloud的Python文档具有一个具有以下功能的脚本(Python-docs-samples/dataproc/dataproc/dataproc/dataproc/submit_job_to_cluster.py):
def create_cluster(dataproc, project, zone, region, cluster_name):
print('Creating cluster...')
zone_uri = 'https://www.googleapis.com/compute/v1/projects/{}/zones/{}'.format(
project, zone)
cluster_data = {
'projectId': project,
'clusterName': cluster_name,
'config': {
'gceClusterConfig': {
'zoneUri': zone_uri
}
}
}
result = dataproc.projects().regions().clusters().create(
projectId=project,
region=region,
body=cluster_data).execute()
return result
我想知道是否可以在此功能中指定群集的主和工人节点的机器类型?
以下应有作用:
def create_cluster(dataproc, project, zone, region, cluster_name):
print('Creating cluster...')
zone_uri = 'https://www.googleapis.com/compute/v1/projects/{}/zones/{}'.format(
project, zone)
cluster_data = {
'projectId': project,
'clusterName': cluster_name,
'config': {
'gceClusterConfig': {
'zoneUri': zone_uri
},
'masterConfig': {
'machineTypeUri' : 'n1-standard-1',
},
'workerConfig': {
'machineTypeUri' : 'n1-standard-4',
},
}
}
}
result = dataproc.projects().regions().clusters().create(
projectId=project,
region=region,
body=cluster_data).execute()
return result
https://cloud.google.com/dataproc/docs/reference/Rest/Rest/v1/projects.regions.clusters.clusters#clusterconfig