我有3个数据框:train
,validation
,test
。我想用这3个数据帧创建一个字典,以获得以下输出:
features
是数据框架列的名称
我怎么做这个字典的数据框架?
DatasetDict({
train: Dataset({
features: ['ID', 'Tweet', 'anger', 'anticipation', 'disgust', 'fear', 'joy', 'love', 'optimism', 'pessimism', 'sadness', 'surprise', 'trust'],
num_rows: 6838
})
test: Dataset({
features: ['ID', 'Tweet', 'anger', 'anticipation', 'disgust', 'fear', 'joy', 'love', 'optimism', 'pessimism', 'sadness', 'surprise', 'trust'],
num_rows: 3259
})
validation: Dataset({
features: ['ID', 'Tweet', 'anger', 'anticipation', 'disgust', 'fear', 'joy', 'love', 'optimism', 'pessimism', 'sadness', 'surprise', 'trust'],
num_rows: 886
})
})
我正在尝试这个:
DatasetDict = {}
dataframes = [train, validation, test]
for grp in dataframes:
DatasetDict[grp] = df
但是行不通
train.name = 'train'
test.name = 'test'
validation.name = 'validation'
datasetdict = {}
dataframes = [train, validation, test]
for df in dataframes:
datasetdict[df.name] = {'features': df.columns.to_list(), 'num_rows': len(df)}