循环从一个数据帧创建多个数据帧



我正在尝试创建一个循环,将从一个大数据帧输出到多个数据帧。

raw_df['names'] = [joe, joe, bob, john, john]
raw_df['order_id'] = [10, 12, 5, 20, 25]
raw_df['amount'] = [100, 1000, 200, 20 25]
for name in raw_df['name'].unique():
names = pd.DataFrame(raw_df.loc[raw_df['name'] == name])
name['cummulative_sum'] = owner_names['amount'].cumsum()

所有名称的预期结果:joe.head()

name   id   sum
joe    10   100    
joe    12   110

可以对感兴趣的变量执行.groupby,而不是检查每个唯一项:

for group_name, group_df in raw_df.groupby("name"):
print("Processing name:", group_name)
names = group_df # this is the same as "names" in your snippet
names["cum_sum"] = names["amount"].cumsum()

group_dfraw_df.loc[raw_df['name'] == name]相同。

你可以做

variables = locals()
for name, data in raw_df.groupby('names'):
variables[name] = data

joe
Out[607]: 
names  order_id  amount
0   joe        10     100
1   joe        12    1000

最新更新