熊猫每小时 OHLC 重新采样到每日 OHLC

我有一个每小时 OHLC 的数据帧如下(请忽略 OHLC 的值，我输入它们以获得更好的说明(，

hr_df =
                        Close      High       Low      Open
2017-09-04 05:00:00  0.715035  0.715035  0.715035  0.715035
2017-09-04 06:00:00  0.715035  0.715045  0.715015  0.715035
2017-09-04 07:00:00  0.715040  0.715050  0.714035  0.715035
:
:
2017-09-05 05:00:00  0.715045  0.715105  0.714985  0.715035
2017-09-05 06:00:00  0.715040  0.716045  0.714605  0.715035
2017-09-05 07:00:00  0.715040  0.717045  0.713225  0.715035
:
:
2017-09-06 05:00:00  0.715040  0.714045  0.713355  0.715035

我想将其重新采样为每日 OHLC，例如，

day_df =
               Close      High       Low      Open
2017-09-04  0.715035  0.715035  0.715035  0.715035
2017-09-05  0.715035  0.715045  0.715015  0.715035
2017-09-06  0.715040  0.715050  0.714035  0.715035
2017-09-07  0.715045  0.715105  0.714985  0.715035
2017-09-08  0.715040  0.716045  0.714605  0.715035
2017-09-09  0.715040  0.714045  0.713355  0.715035
2017-09-10  0.715040  0.717045  0.713225  0.715035

我尝试使用熊猫重新采样方法，day_df = hr_df.resample('D').pad()或day_df = hr_df.resample('D').ohlc()但它不起作用。我知道我可能没有使用正确的方法。如果有人可以指导我找到替代解决方案或正确的使用方法，我将不胜感激。

我认为您需要通过Resampler.agg dictionary对列名的键和函数的值进行下采样：

day_df = (hr_df.resample('D')
               .agg({'Open': 'first', 'High': 'max', 'Low': 'min', 'Close': 'last'}))
print (day_df)
                Open      High    Close       Low
2017-09-04  0.715035  0.715050  0.71504  0.714035
2017-09-05  0.715035  0.717045  0.71504  0.713225
2017-09-06  0.715035  0.714045  0.71504  0.713355

尝试使用 pd.Grouper .例如，如果货币对是外汇对，您可以将其与新分组的日期一起使用作为索引：

hr_df.groupby([pd.Grouper(key='date',freq='D'), 'pair']).agg(
{'Open': 'first', 'High': 'max', 'Low': 'min', 'Close': 'last'})

这对

我有用。

quotes.set_index('end', inplace=True)
quotes.index = pd.to_datetime(quotes.index)
ohlc_dict = {
    'open': 'first',
    'high': 'max',
    'low': 'min',
    'close': 'last'
}
quotes.resample("1D", closed='left', label='left').apply(ohlc_dict).dropna()

相关内容

最新更新

热门标签：