小贝子编程

如果被视为索引的列中的值多次出现，如何从数据帧获取保留所有值的字典?

本文关键字：数据帧获取保留字典索引如果 python pandas dataframe
更新时间 : 2023-09-16
英文 : How to get a dict from a DataFrame that keeps all the values if the values in the column considered as index appear multiple times?

有没有最好的方法来做这样的事情？

假设我有以下数据帧：

我想得到这样的字典：

{1: [1, 2], 2:[3, 4, 5]}

请记住，列表具有不同的长度，因为值1出现两次，值2出现三次。如果我尝试

df.set_index('A').to_dic('list')

Pandas 只保留 B 中每个值的最后一个值，返回以下字典：

{1:[2], 2:[5]

将DataFrame.groupby与GroupBy.apply一起使用list表示Series，然后Series.to_dict：

d = df.groupby('A')['B'].apply(list).to_dict()
print (d)
{1: [1, 2], 2: [3, 4, 5]}

您可以按A分组，并将B中的值转换为列表：

result = {key: group['B'].tolist() for key, group in df.groupby('A')}
print(result)

输出

{1: [1, 2], 2: [3, 4, 5]}

相关内容