小贝子编程

将列中的连续数字行转换为具有开始和结束信息的两列(pandas dataframe)

本文关键字：结束信息两列 dataframe pandas 数字连续转换开始 python pandas
更新时间 : 2023-09-23
英文 : convert rows of consecutive numbers in a column into two columns with start and end information (pandas dataframe)

我有一个像下面这样的数据框架，每个索引i都有一个score。

当分数相同时，我想折叠第一列中的信息。预期的结果如下所示:

start   end   score
5       7      3.0
8       9     11.0
15     15     10.0
30     32     1.0
10     11     8.0
20     22     1.0

您可以按连续值分组并聚合查找端点。Even适用于单个组，其中开始和结束是相同的。

df.groupby(df["score"].ne(df["score"].shift()).cumsum()).agg(
start=("i", "first"), end=("i", "last"), score=("score", "first")
)

start  end  score
score
1          5    7    3.0
2          8    9   11.0
3         15   15   10.0
4         30   32    1.0
5         10   11    8.0
6         20   22    1.0

这里不需要什么魔术。我想说，for循环最不令人头疼。

将列中的连续数字行转换为具有开始和结束信息的两列(pandas dataframe)

相关内容

最新更新

热门标签：