我正在尝试可视化我的数据,所以我尝试了以下代码,但我得到了一个错误。
dataset = pd.read_csv(r'/Users/Downloads/dataset/datasets/mydatasets/out_4.csv')
df = dataset[["diffTime","diffP","diffS","diffH","diffE","diffA"]].to_numpy()
out=dataset["labels"]
import plotly.express as px
df = df.reshape(-1)
features = ["diffTime","diffP","diffS","diffH","diffE","diffA"]
fig = px.scatter_matrix(
df,
dimensions=features,
color="out"
)
fig.update_traces(diagonal_visible=False)
fig.show()
以下是我得到的错误:
ValueError:"dimensions_0"的值不是"data_frame"。应为[0]之一,但收到:diffTime
如果有人帮我解决问题,我将不胜感激。
您需要在df
中保持相同的列名。当您将其转换为numpy时,结构会发生变化。或者,在整形之后,将其更改为panda并添加列。
df = dataset[["diffTime","diffP","diffS","diffH","diffE","diffA"]]
dim = df.columns
fig = px.scatter_matrix(
df,
dimensions=dim ,
color="out"
)
它应该起作用。