ValueError:发现样本数不一致的输入变量:[1319245]



我面临与train_test_split:相关的问题

final = []
final.append(dataset)
final.append(dataset1)
X = dataset[:,0:2]
y = dataset1[:,2]
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.15, random_state=42)

错误:

Traceback (most recent call last):
File "C:UsersLenovoanaconda3thesis codeTC_code.py", line 73, in <module>
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.15, random_state=42)
File "C:UsersLenovoanaconda3libsite-packagessklearnmodel_selection_split.py", line 2172, in train_test_split
arrays = indexable(*arrays)
File "C:UsersLenovoanaconda3libsite-packagessklearnutilsvalidation.py", line 299, in indexable
check_consistent_length(*result)
File "C:UsersLenovoanaconda3libsite-packagessklearnutilsvalidation.py", line 262, in check_consistent_length
raise ValueError("Found input variables with inconsistent numbers of"
ValueError: Found input variables with inconsistent numbers of samples: [1319, 245]

检查Xy的形状。它必须具有相同的行数

print(X.shape)
print(y.shape)
if X.shape[0] != y.shape[0]:
print("X and y rows are mismatched, check dataset again")

注:

  • X和y的行应该相同

您已将dataset用于X,而将dataset1用作y,这应该是代码中的主要错误。

请参阅此处:

X = dataset[:,0:2]
y = dataset1[:,2]

数据集dataset1是两个不同的数据帧,可能代表两个不同数据。

最新更新