与GridSearchCV的工作混淆

GridSearchCV实现了一个拟合方法，在该方法中，它执行n折交叉验证以确定最佳参数。在此之后，我们可以使用 predict（）直接将最佳估计器应用于测试数据 - 点击此链接： - http://scikit-learn.org/stable/auto_examples/grid_search_digits.html

它在这里说"模型在完整的开发集上训练"

但是，我们在这里只应用了 n 折交叉验证。分类器是否也以某种方式在整个数据上进行自我训练？还是只是在应用预测时选择具有 n 折中参数的最佳训练估计器？

如果要使用 predict ，则需要将'refit'设置为 True 。从文档中：

refit : boolean
    Refit the best estimator with the entire dataset. 
    If “False”, it is impossible to make predictions using 
    this GridSearchCV instance after fitting.

默认情况下看起来是正确的，因此在示例中，predict基于整个训练集。

相关内容

最新更新

热门标签：