我有两个excel文件,其中包含来自数据记录器的多行excel,我需要比较这两个文件的3个相似列(anum、bnum、date、time(,但列持续时间不同,然后将差异保存到第三个excel文件中。
***excel文件1:
anum bnum duration date time
02473082424 0969755655 12 2018-08-04 10:53:04
02473082424 02435543470 17 2018-08-04 10:53:04
02473082424 01653559999 19 2018-08-04 10:53:06
02473082424 02437633476 63 2018-08-04 10:52:46
02473082424 02432262638 23 2018-08-04 10:53:26
02473082424 02435537928 40 2018-08-04 10:53:18
02473082424 0936467084 20 2018-08-04 10:53:42
***excel文件2:
anum bnum duration date time
02473082424 0969755655 16 2018-08-04 10:53:04
02473082424 02435543470 17 2018-08-04 10:53:04
02473082424 01653559999 23 2018-08-04 10:53:06
02473082424 02437633476 63 2018-08-04 10:52:46
02473082424 02432262638 23 2018-08-04 10:53:26
02473082424 02435537928 10 2018-08-04 10:53:18
02473082424 0936467084 20 2018-08-04 10:53:42
您可以首先使用pandas.read_excel读取两个excel文件分成两个数据帧df1和df2然后:
df1.rename(columns={'duration':'duration1'},inplace=True)
df2.rename(columns={'duration':'duration2'},inplace=True)
df=df1.merge(df2)
df['duration']=df['duration2']-df['duration1']
writer = pd.ExcelWriter(excel_file_3)
df[['anum','bnum','duration','date','time']].to_excel(writer,'Sheet1')