如何找到多类Catboost分类器的每个类的F1分数?我已经通读了文档和github repo,其中有人问了同样的问题。然而,我无法找出实现这一目标的密码。我知道我必须在CatBoostClassifier()
中使用custom_metric
参数,但当我想要多类数据集的每个类的F1
分数时,我不知道custom_metric
可以接受哪些参数。
假设您有一个玩具数据集(来自文档(:
from catboost import Pool
cat_features = [0, 1, 2]
data = [["a","b", 1, 4, 5, 6],
["a","b", 4, 5, 6, 7],
["c","d", 30, 40, 50, 60]]
label = [0, 1, 2]
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(data, labels, test_size=0.2)
train_pool = Pool(X_train, y_train, cat_features=categorical_features_indices)
validate_pool = Pool(X_test, y_test, cat_features=categorical_features_indices)
params = {"loss_function": "MultiClass",
"depth": symmetric_tree_depth,
"num_trees": 500,
# "eval_metric": "F1", # this doesn't work
"verbose": False}
model = CatBoostClassifier(**params)
model.fit(train_pool, eval_set=validate_pool)
您应该使用TotalF1
params = {
'leaf_estimation_method': 'Gradient',
'learning_rate': 0.01,
'max_depth': 8,
'bootstrap_type': 'Bernoulli',
'objective': 'MultiClass',
'subsample': 0.8,
'random_state': 42,
'verbose': 0,
"eval_metric" : 'TotalF1',
"early_stopping_rounds" : 100
}
https://catboost.ai/docs/concepts/loss-functions-multiclassification.html