值错误:"dimensions_0"的值不是"data_frame"中列的名称。显示散点图后



我正在尝试可视化我的数据,所以我尝试了以下代码,但我得到了一个错误。

dataset = pd.read_csv(r'/Users/Downloads/dataset/datasets/mydatasets/out_4.csv')
df = dataset[["diffTime","diffP","diffS","diffH","diffE","diffA"]].to_numpy()
out=dataset["labels"]
import plotly.express as px
df = df.reshape(-1)
features = ["diffTime","diffP","diffS","diffH","diffE","diffA"]

fig = px.scatter_matrix(
df,
dimensions=features,
color="out"
)
fig.update_traces(diagonal_visible=False)
fig.show()

以下是我得到的错误:

ValueError:"dimensions_0"的值不是"data_frame"。应为[0]之一,但收到:diffTime

如果有人帮我解决问题,我将不胜感激。

您需要在df中保持相同的列名。当您将其转换为numpy时,结构会发生变化。或者,在整形之后,将其更改为panda并添加列。

df = dataset[["diffTime","diffP","diffS","diffH","diffE","diffA"]]
dim = df.columns
fig = px.scatter_matrix(
df,
dimensions=dim ,
color="out"
)

它应该起作用。

最新更新