如何从csv文件中按特定列的数据分组,并以arff格式将它们全部放在一行中



我是python编程的初学者。我有一个csv数据文件,我想在我的数据集中分组发件人,并获得相关的发件人不同的rcvTime的一些列数据,并保持它们都在一行发件人喜欢时间序列数据,但在arff格式。以下是我的部分数据:

row number,type,rcvTime,sender,pos_x,pos_y,pos_z,spd_x,spd_y,spd_z,acl_x,acl_y,acl_z,hed_x,hed_y,hed_z
0,2,25207.0,15,136.07,1118.46,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.09,-1.0,0.0
1,2,25208.0,15,136.19,1117.14,0.0,0.22,-2.31,0.0,0.14,-1.48,0.0,0.09,-1.0,0.0
2,3,25208.81,21,152.66,904.56,0.0,0.06,-0.75,0.0,0.18,-2.43,0.0,0.07,-1.0,0.0
3,2,25209.0,15,136.69,1113.79,0.0,0.39,-4.18,0.0,0.15,-1.64,0.0,0.09,-1.0,0.0
4,3,25209.81,21,152.98,902.59,0.0,0.22,-2.91,0.0,0.12,-1.68,0.0,0.07,-1.0,0.0
5,2,25210.0,15,133.77,1108.01,0.0,0.58,-6.17,0.0,0.16,-1.76,0.0,0.09,-1.0,0.0
6,3,25210.81,21,153.25,898.68,0.0,0.37,-4.65,0.0,0.11,-1.35,0.0,0.08,-1.0,0.0
7,2,25211.0,15,134.37,1100.75,0.0,0.76,-8.14,0.0,0.18,-1.93,0.0,0.09,-1.0,0.0
8,3,25211.81,21,153.82,893.0,0.0,0.65,-6.67,0.0,0.25,-2.54,0.0,0.1,-1.0,0.0
9,3,25211.93,27,122.87,892.12,0.0,5.63,0.32,0.0,-1.57,-0.09,0.0,1.0,0.04,0.0

例如,我想为sender=15提取列数据,并将它们全部放在一行中,然后对于sender=21,我想做同样的事情,... .如果有人能指导我如何用python来做,我将不胜感激。

对于列数据操作,Pandas是一个很棒的库,有大量文档(这里是一个入门指南)。

对于您的问题,您可能可以使用pandas作为从csv文件中提取数据的第一步,然后您可以使用pd.DataFrame以您想要的方式格式化数据。然后,您可以使用arff库将pd.DataFrame格式转换为arff格式(例如:https://stackoverflow.com…)

另外,如果你能提供一个代码的工作示例通常会更好,这样人们会更准确地知道问题出在哪里。

祝你好运!

最新更新