我有各种数据帧,如下所示:
df1
ID Number Score Time Result
a 45 0.3 2535 0.9
b 46 0.5 345 0.8
c 34 0.94 346 0.6
d 36 1 356 0.7
df2
ID Image Video
a 0 0
b 0 0
c 1 0
d 0 1
e 1 0
f 1 0
df3
ID Length
a 35
b 57
c 75
d 57
e 85
f 48
如何将它们合并为
ID Time Image Video Length Score
a 2535 0 0 35 0.3
b 345 0 0 57 0.5
c 346 1 0 75 0.94
d 356 0 1 57 1
我的想法是使用pd.merge
(on="ID"
((这只会给我从a到d的行,对吧?(,然后删除不必要的列。但是我该如何在最后移动Score?还有其他方法吗?
连锁多个merge
,然后将Score
设置为最后一列:
df = df1.merge(df2, on='ID').merge(df3, on='ID')
df['Score'] = df.pop('Score')
或者如果有list of DataFrames
使用:
from functools import reduce
dfs = [df1, df2, df3]
df = reduce(lambda df1,df2: pd.merge(df1,df2,on='ID'), dfs)
cols = df.columns.drop(['Score']).tolist() + ['Score']
df = df[cols]