在指定时间间隔内的行datetime中添加小时数



设为以下Python Panda DataFrame:

| country_ID | date                      | counter  | value  |
| --------   | ------------------------- | -------- | ------ |
| USA        | 2022-03-01 09:22:29+00:00 |    1     |  red   |
| UK         | 2022-03-01 11:21:20+00:00 |    1     |  blue  |
| USA        | 2022-04-02 12:15:23+00:00 |    1     |  red   |
| ITL        | 2022-04-03 11:13:31+00:00 |    1     |  red   |
| USA        | 2022-05-05 21:04:42+00:00 |    1     |  green |
| USA        | 2022-05-05 22:01:51+00:00 |    1     |  green |
| ITL        | 2022-06-06 13:00:41+00:00 |    1     |  red   |

给定日期和时间范围(开始和结束)以及country_ID,我想在该范围内的行中添加2小时:

的例子:

add_hours('USA', '2022-03-01 09:00:00', '2022-05-05 21:30:00', 2)
| country_ID | date                      | counter  | value  |
| --------   | ------------------------- | -------- | ------ |
| USA        | 2022-03-01 11:22:29+00:00 |    1     |  red   |
| UK         | 2022-03-01 11:21:20+00:00 |    1     |  blue  |
| USA        | 2022-04-02 14:15:23+00:00 |    1     |  red   |
| ITL        | 2022-04-03 11:13:31+00:00 |    1     |  red   |
| USA        | 2022-05-05 23:04:42+00:00 |    1     |  green |
| USA        | 2022-05-05 22:01:51+00:00 |    1     |  green |
| ITL        | 2022-06-06 13:00:41+00:00 |    1     |  red   |

尝试使用布尔索引的逻辑(date也必须是datetime对象,而不是字符串):

def add_hours(df, country, start, end):
is_country = df['country_ID'].eq(country)
valid_date = df['date'].between(start, end)
df.loc[is_country & valid_date, 'date'] += pd.Timedelta('2H')

最新更新