我的训练数据在BigQuery中。如何在Cloud ML中使用它来训练模型?
将预处理管道更改为使用BigQuerySource
(使用与CSV示例中相同的Features
类)。下面是一个例子:
feature_set = CsvFeatures()
train_query = “SELECT …”
valid_query = “SELECt …”
train = pipeline | 'read_train' >> beam.Read(beam.io.BigQuerySource(query=train_query))
eval = pipeline | 'read_valid' >> beam.Read(beam.io.BigQuerySource(query=valid_query))
(metadata, train_features, eval_features) = ((train, eval) |
ml.Preprocess('Preprocess', feature_set))